Gene Franean1_4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4337 
Symbol 
ID5672692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5180367 
End bp5182997 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content75% 
IMG OID641243210 
Producthypothetical protein 
Protein accessionYP_001508627 
Protein GI158316119 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.412663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACG GCGGCGGACG GCGCGGCGCG GTGCTCGGGG AGACACCCGG CGGTGACGGC 
CATGCCGCGA CACCCGGCAA CGGGAACAGC GGCCGTGGCG CGGCGCCCGG CAGCGGCCGG
CGCGGCGCGG CATCCGGGGA CGACGGCGGG CCGGGCCGGG GCGGGGCGGG GTCGGCGGGG
CTGTGGCTGC GGTGGTCGTG GCGCGACCTG CGGGCGCGGC TGCTGCTGGT GGTGGCGCTC
GCCGCCGTCA TCGGCGAGGG AACCGGCCTG TACGCCGGCC TGACCAGCAC GTCGCGGTGG
CGCTACGAGT CCTACGACGC CAGCTTCGCC GGGCTGAACG TGCACGACCT GCGGATCAGC
GTGGACGCGG GCGCCACGGT GCCGCGGGGG CGGCTGCGCG ACGTCGTCGC GGCCCTGCCC
GACCCGGCCG CCGTGGATGC GAGCGCCGAG CGCCTGATGT TCCCCACCGA GATCGAGGCG
AGCCGACCAG GGCACAAGGA GGTGCTCGTC CGCGGCGAGG TCGTCGGCGT CGACCTGACC
GCGCGCCCGC TCGTGGACGG GATCTCGGTC GCCGCCGGCC GCGCGCTCAC CACGGCCGAC
CGGGGACGGC CCGTCGGCGT CCTGGACCAC GGGTTCGCCC AGGCCAACCA CCTGCCGGCG
ACGGGCACGG TACGGATCAG CGGGGACCGC GAGATCGGAT ACGTCGGCGA GGGGCAGTCC
CCGGAGTACT TCGTGCTGCC CGGCGAGCAG CCCGGACTGA TCACCCAGGC CGGCTTCGGC
GTGCTCTACA CCTCGCTGGA GACCGCGCAG AACCTCGCCG GCAAACCGGG AGCGGTCAAC
GACCTGGTCC TCACCGCGCG GCCCGGGACC GACGTCGCGG TCCTCCAGCG GCAGCTCACC
GCGGCCGTCA CCGAGCGGCT CCCCGGGGTG AGCACCACGA TCACCAACCG TGACGACATC
GCGTCCCGGC AGATCATGTA CGACTCGATC GAGAGCAACC AGGCGCTGTG GAACGCGCTC
GCGCTGCTCG TCCTGGTCGG TGCCATGTTC GCCGCGTTCA ACCTGGTGGG ACGGGTGGTG
GACGCGCAGC GACGGGAGAT CGGCATCGGC ATGGCGCTCG GGGTGCGGTC CCGGATGCTG
GCGTTGCGCC CGTTGCTGCT GGGGCTGCAG ATCGGGATCC TCGGGGTCCT CGCGGGGCTG
GTCACGGGGA TGATCATCAC TGCCGCGATG GGCGCGATGC TGCGCGACGT GTGGCCGCTA
CCGGACTGGC GCACCGGCTT CCAGGTCGGG GTGTTCGCCC GCGCCGCGGC GGTCGGCCTG
CTGCTGCCGC TGGTCGCGGC GGTCCACCCG GTGTGGCGGG CGGTGCGGGT CGAGCCGGTG
CAGGCGATCC ACGCGTCGGC CATCTCCGGC TCCACCCGCG CCCGTGCCCG CGCCCGCCCG
CGGCGGCGTC GGCGCGGGTT CCCGCTGCCC GGGGGGAGCC TGGCCCGAAT GCCGGCGCGC
AACCTGGCGC GCGCCCCGCG GCGGATGCTG CTCACCGCCC TGGGAATCGC CGCGGCGATC
ACGGCACAGG TCGTGTTCAC CGGCCAGCTC GACACCTTCA CCCGGACGAC CGACGCCGCC
GAGACCGAGC TGACGTCCAC CAGCCCCGAC CGGCTGCGGG TGACCCTGCC GTCGGTGCAG
CCCGTCACCT CGCCCACCGT CACCGCGGTC ACCGGCTCAC CGGCGGTCCG CGGCTCCGAC
GCCACGCTCG CCCTGCCCAC CAGGCTGCTG GGCCCGCCCG GTTCGGCACC GCACGAACCG
ATCGACACCC TGACCTACAT CCTGGACGTC GACAACCACA TCTGGTCACC CTCGATCACC
GAGGGCAGCG CGACAGGCGG CCTGCTGCTC GCCGCCAAGG CCGCCCACGA CCTGGGCGTG
GGCGTCGGCG GGACGGTCCT GCTGCGCCAT CCGCGCCGGG CCGGCGACGG CTACCAGACG
GTCGACACCC CGCTGCGCGT CGCCGGCATC CACAGCTTCC CGGTCCGCTC GGTGGCCTTC
CTCGACGCCG CCGACGCCGG CTCGTTCGGC CTCGCCGGCC TGACGAACGT GCTGACCGTC
CTGCCCGCGC CCGGCTACGA CCAGCTCGAC GCGATGCGCA CGCTCGCGGC CGTCCCCGGG
GTCGGCTCCG TCGTCCCGGC GACGGGGAGC ATCGAGGAGA TCCGTGCGCT GCTACGCACG
TTCGTCGGCA TCCTGCGGAT CGCCGAGATC GCGGTGCTCC TGCTCGCCCT GCTGATCGCC
TACAACGCGA TGAGCATCGC GATGGACGAG CGCCGCCGGG AGCAGGCGAC GATGCTCGCC
TTCGGGCTGG CCCCGCGCCG GGTGCTGGCG CTCGCCGTCG CCGAGAGCGC GCTGATCGGC
CTGCTCGGGA CGGTGATCGG CCTGGTAGCC GGCTACTGGA CGCTGCGCTG GACCGTCGAG
GTCCTGCTCG CCGACACCCT GCCCGACCTG GGCATCCGGG CCGTGCTGTC GGTTCCCACC
CTGCTGACCA CGCTGCTGCT GGGGGTCTTC GCGGTCGCCG TCGCCCCGCT GCTGTCCGCC
CGGCGGGTAC GCCGGATGGA CGTCCCGTCC ACCCTGCGGG TGATCGAGTA G
 
Protein sequence
MVNGGGRRGA VLGETPGGDG HAATPGNGNS GRGAAPGSGR RGAASGDDGG PGRGGAGSAG 
LWLRWSWRDL RARLLLVVAL AAVIGEGTGL YAGLTSTSRW RYESYDASFA GLNVHDLRIS
VDAGATVPRG RLRDVVAALP DPAAVDASAE RLMFPTEIEA SRPGHKEVLV RGEVVGVDLT
ARPLVDGISV AAGRALTTAD RGRPVGVLDH GFAQANHLPA TGTVRISGDR EIGYVGEGQS
PEYFVLPGEQ PGLITQAGFG VLYTSLETAQ NLAGKPGAVN DLVLTARPGT DVAVLQRQLT
AAVTERLPGV STTITNRDDI ASRQIMYDSI ESNQALWNAL ALLVLVGAMF AAFNLVGRVV
DAQRREIGIG MALGVRSRML ALRPLLLGLQ IGILGVLAGL VTGMIITAAM GAMLRDVWPL
PDWRTGFQVG VFARAAAVGL LLPLVAAVHP VWRAVRVEPV QAIHASAISG STRARARARP
RRRRRGFPLP GGSLARMPAR NLARAPRRML LTALGIAAAI TAQVVFTGQL DTFTRTTDAA
ETELTSTSPD RLRVTLPSVQ PVTSPTVTAV TGSPAVRGSD ATLALPTRLL GPPGSAPHEP
IDTLTYILDV DNHIWSPSIT EGSATGGLLL AAKAAHDLGV GVGGTVLLRH PRRAGDGYQT
VDTPLRVAGI HSFPVRSVAF LDAADAGSFG LAGLTNVLTV LPAPGYDQLD AMRTLAAVPG
VGSVVPATGS IEEIRALLRT FVGILRIAEI AVLLLALLIA YNAMSIAMDE RRREQATMLA
FGLAPRRVLA LAVAESALIG LLGTVIGLVA GYWTLRWTVE VLLADTLPDL GIRAVLSVPT
LLTTLLLGVF AVAVAPLLSA RRVRRMDVPS TLRVIE