Gene Franean1_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4174 
Symbol 
ID5672529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4962464 
End bp4965115 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content73% 
IMG OID641243047 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001508464 
Protein GI158315956 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0124093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.508891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGACA CCGCCCCCGT AACCGACACT GGACCGGTGA CCGACGCGCC CGCCACACCC 
GCCCTCGAGC CCGGCCCGCT CGGCCCCGCC CCGGCGGGCG CACCCCCCAC ACCCCCACAC
CCCACCGGCC GCGCCCTACC GCGCCGCCTC ACCCGGCCCG CCCTGGCCGT CCTCGCCGGC
GTCCTGCTCT ACCTCGCCTT CCCCCCGGTC GGCCTGTGGC CGCTGGCACC GGTCGCCCTG
GCCGTTCTCA CCCTGACCGT CCGCGGGCGG CGGCTACGCG CCTCCTACGG GCTGGGAATG
CTGTTCTCCC TCGCGTTCCT GCTACCCCTG CTGCGCTTCG TCTCCTTCGT CGGCGCCGAC
GGCTGGATCG TGCTGTCCGC CGCCGAGGCC GCCCTACTCG CCCTCGTAGC ACCCGCCACC
ACCCTCGTTC AGCGGCTACC CGCACCGTGG CTGTGGACGG GCGCCATCTG GGTCGCCCAG
GAGGCGCTAC GGGGCCGTGC GCCCTTCGGG GGCTTCCCCT GGGGGCGGAT CGCGTTCAGC
CAGCCGAACA GCCCCTACAC CGCCCTCGCA GCGCTCGGCG GCGCGCCTCT GGTCACCTTC
GCCGTCGCCA CCACCGCCGC GCTGCTCGCC ACAGCCGTCA CCCACGCCAC CACGACCACC
GCCGCCCGCG CCACGGGTAC AGATGCCGCC GGCCACGGCG CGGCAGGCGG CGCGGCGACC
CACATCCGGC CACTACTGGC CACTCTCACC GGCGCTCTCG CGCTCACCCT CACCGGCCTC
GCCGTTCCCC TGCCCACCAC CGCCCAGCAC GGCACCCTCA ACGTCGCCGC CGTCCAGGGC
AACGTCCCCG AAGCCGGCGG CCTGGGCGCC CTCGGCGAGG CCTTCCAGGT CACCGACAAC
CACGTCACCG GCACCGAGAA CCTCGCCGCC GCCGTCCGCG CCGGCCGCAC CCCCCAACCC
GACCTCGTCC TGTGGCCGGA GAACTCCTCC GACATCGACC CCCTCACCAA CCCCACCGCC
CACGCCGCCC TCACCCACGC CGCCACCGTC GCCGGCGCAC CACTGCTCGT CGGCGCCGTC
CTCGACGGCC CCGGCCCCCG CCACGTCCGC AACGCCGGCC TCATCTGGAC CACCGACGGC
CCCACCGGCG CCATGTACGT CAAACGCCAC CCCGTCCCCT TCGCCGAATA CCTCCCCGGC
CGCGCCCTGC TCGAAAAACT CATCAGCCGC TTCGCCGACG AGATGCCCAA CGACTTCCTC
GCCGGCACCG CCACCGGCGC CCTACCCGTC GCCGGCACCG TCATCGGCGA CGTCATCTGC
TTCGAAGTCG CCTACGACGG CCTCGTCCGC GACAACGTCA ACCGCGGCGC CGAACTCCTC
GTCATCCAAA CCAACAACGC CTCCTTCGGC CGCAAAGGCG AAAGCCAACA ACAACTCGCC
ATGAGCCGCC TGCGCGCCAT CGAACACGGC CGCGCCACCA TCCAGGTCTC CACCAGCGGC
CAGAGCGCCC TCATCACCCC CGACGGCACC ATCCTCACCC AGACCGGGCT CTACGAAGCC
GGCGTACTGT CCGCACAACT CCCACTACGC ACCACCCACA CCCTCGCCAC CCGCCTCGGT
ATCGTCCCGG AGGCGGTGCT CACCACCCTC GGCGCACTCG CCATGATTGC TGGACTGACC
CACCCCCGAC GCACCACCCA CCAACCCACC CCGACATCCA CCGACCGGAC CGGCGACGAC
CACCAGCACC TGGGGGAGAC GCACCAACCC GCCGCCGGCA CCGACCAGAA GAGGAGAGGC
GTGGAAGCCA CCCGCGTCGT CGTCTGCGTC CCGACCTACA ACGAACGGGA GAACCTGCCG
GACACCACGC GCCGGCTACG CCAGGCGAAC CCCGCGGTCC ACCTGCTCGT CATCGACGAC
GCAAGCCCCG ACGGCACCGG GAAAATCGCC GACGAACTCG CCGACGACGA CGACCACATC
CACGTCCTGC ACCGGCCCGG CAAATCCGGC CTCGGCTCCG CCTACATCGC CGGCTTCACC
TGGGCCCTGC AACACGGCTA CGACATCATC GTCGAAATGG ACGCCGACGG CTCCCACCAG
CCCGAACAGC TACCCCGCCT ACTCGACGCC CTCACCGACG CCGACCTGGC CATCGGCTCC
CGCTGGGTCC CCGGCGGCAC CGTCCACAAC TGGCCCCGCA GCCGGCTCGT CCTCTCCCGC
GGCGCCAACG CCTACGTCCG CGCCGCCCTC GGGGTGCCCC TCCACGACGC CACCGCCGGG
TTCCGCGCCT ACCGCGCCGA CGTCCTGCGC GCCCGCGACC TCGACCAGGT CGCCTCCCAG
GGCTACTGCT TCCAGGTCGA CCTCGCCTGG CGCTCCTGGC AGGCCGGGTT CCGCGTCGTC
GAAGTCCCCA TCGACTTCGT CGAACGCGAA CGCGGCGCGT CGAAGATGAG CCGCGCGATC
GTCGCCGAAG GATTCTGGCG CGTCGGCTGG TGGGCCCTGA CCTCCCTGCG CCGCGGCCCC
GCCAGCACCA GCCAGCACAC TGGCGCGGAC GCGGCGATCC CCGCCCCCGC CCGGCCCACC
GACCCCACCG CGGACAGCCT CACCACCCCG ACCGCCACCG GACCCGACAC CGTCGACGCC
GGCCGGCCCT GA
 
Protein sequence
MVDTAPVTDT GPVTDAPATP ALEPGPLGPA PAGAPPTPPH PTGRALPRRL TRPALAVLAG 
VLLYLAFPPV GLWPLAPVAL AVLTLTVRGR RLRASYGLGM LFSLAFLLPL LRFVSFVGAD
GWIVLSAAEA ALLALVAPAT TLVQRLPAPW LWTGAIWVAQ EALRGRAPFG GFPWGRIAFS
QPNSPYTALA ALGGAPLVTF AVATTAALLA TAVTHATTTT AARATGTDAA GHGAAGGAAT
HIRPLLATLT GALALTLTGL AVPLPTTAQH GTLNVAAVQG NVPEAGGLGA LGEAFQVTDN
HVTGTENLAA AVRAGRTPQP DLVLWPENSS DIDPLTNPTA HAALTHAATV AGAPLLVGAV
LDGPGPRHVR NAGLIWTTDG PTGAMYVKRH PVPFAEYLPG RALLEKLISR FADEMPNDFL
AGTATGALPV AGTVIGDVIC FEVAYDGLVR DNVNRGAELL VIQTNNASFG RKGESQQQLA
MSRLRAIEHG RATIQVSTSG QSALITPDGT ILTQTGLYEA GVLSAQLPLR TTHTLATRLG
IVPEAVLTTL GALAMIAGLT HPRRTTHQPT PTSTDRTGDD HQHLGETHQP AAGTDQKRRG
VEATRVVVCV PTYNERENLP DTTRRLRQAN PAVHLLVIDD ASPDGTGKIA DELADDDDHI
HVLHRPGKSG LGSAYIAGFT WALQHGYDII VEMDADGSHQ PEQLPRLLDA LTDADLAIGS
RWVPGGTVHN WPRSRLVLSR GANAYVRAAL GVPLHDATAG FRAYRADVLR ARDLDQVASQ
GYCFQVDLAW RSWQAGFRVV EVPIDFVERE RGASKMSRAI VAEGFWRVGW WALTSLRRGP
ASTSQHTGAD AAIPAPARPT DPTADSLTTP TATGPDTVDA GRP