Gene Franean1_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5214 
Symbol 
ID5673548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6257603 
End bp6260545 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content75% 
IMG OID641244068 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_001509478 
Protein GI158316970 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit
[COG2111] Multisubunit Na+/H+ antiporter, MnhB subunit 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGCCG CCGTAGTCGC CCACCTTGTC CTCGCCGCGG TCCTGCCCGC GCTGACCGAC 
CGGCTGGGCC GCACGGCCTT CCTGCTCGCG GCCGCCGCGC CGGCAGCCAC GTTCGGCTGG
CTGGTGGTCC GGATACCGGC CGTACTCGAC GCGGCCACGG CCGCGCCGTC CGCGCTGGCC
ACACCAGCCG CGACTGGTGT GGCTGCCACG ACGGCCGCCG TCGGAGGGCA CGGCGCCCCC
GCCGGTGACC TGCTGGTCGA GACTCTCACC TGGGCGCCGA CCGTCGAGCT GGAGATCGTG
TTCCGGCTCG CCCCGCTGGC ACTGCTGATG GCGCTGCTCG TCACCGGCGT CGGCGCGGCG
GTGCTGGTCT ACTCGTTCGC CTACCACGCG CCGCACGCGG CCGACGGCGG CGTGCCCGTG
CCGGGCTCGG CGGGGCCGGG CCGGGCGAGC GCCGCGCTGC TCGCGTTCGC GGGCGCGATG
CTGGGGCTGG TCCTGGCCGA CGACCTGTTC ACGCTCTACC TCTTCTGGGA GCTCACCACC
GTCTTCTCCT TCCTGCTGAT CGGGCAGGAC GGCGTGAGCG CCCCCGGCCG GCGGTCCGCC
GTCCAGGCGT TGCTGATCAC CAGCGTCGGG GGCCTGGCGA TGCTGTTCGG CTTCGTCCTG
CTGGGGCAGG CAGCGGGCAC CTACCGGATC TCGCGGATTG TGGCGGCGCC GCCGTCCGGC
GCGGTGGTCA CCGCGGCGCT GGTGCTCGTC CTGCTGGGCG CGGCTACGAA GTCGGCCCAG
ATCCCGTTCC ACTCCTGGCT GCCGGCCGCC ATGGTCGCGC CCACCCCGGT CAGCGCGTAC
CTGCACGCCG CGGCGATGGT GAAGGCCGGG GTGTTCCTCG TGGCGACGCT GACCCCGGCC
TTCGCCGGCG TGGTGGGCTG GCAGGTACCG GCGGTGGCCC TCGGCGCGGC GACGATGCTG
CTCGGCGGCC TGCGCGCGCT GGTGCAGACC GACCTCAAAC GGCTGCTCGC CTTCGGTACC
GTCAGCCAGC TCGGGTTCCT CACCGCGCTG GTCGGGTTCG GGTCGCGCAC CGCGGCGCTG
GCCGGCGCCA CGCTGATCCT CGCGCACGGG CTGTTCAAGG CCGCGCTGTT CATGGTCGTC
GGCATCGTCG ACCACCAGGC CGGGACCCGT GACCTGCGGG ACCTGTCCGG CCTGTGGCGA
AGCGTCCCGG TGGTGTGCGG GGGAGCCGTC CTCGCCGCCG CCTCGATGGT CGGGCTGCCG
CCCTTCCTGG GCTACCTGGG CAAGGAGGCC GCCTTCGAGG CGCTCGTCCA CGGCGGGGCC
GGCGAGCTGG CGCTGCTCGC CGTGTTCGCG GCGGGCTCCT GCCTGACCAC CGGGTACGCG
CTGCGGTTCC TGTGGGGTGC GTTCGGCGCG CGGCCGGGCG CGCCGCCGAG CGCGGTCGCC
CGTCCGGCGG CGGTGTTCGT GGCGCCGGTG GTGGTGCTGG CCCTCGCCGG ACTCGGCCTC
GGGATCGCGC ACCGGGGCGT CGACCGGCTC GTCGCCGCCT ACGCCGACGG CTTCCCCGCC
GGCGCCGCGC GCTACCACCT CGCGCTCTGG CACGGGCCGG GCTGGCCGAT CGCGCTGTCC
GCCGTCGCGG TCGCCGGGGG CGTGCTGGTC TTCGCCGCGG CGGCGGGGCC TCCGCGGCGG
CTGTCGCCGC GCCTGCCCGA TCTCCTGGAC GCGCAGCGCG GCTACGAGCG CGCCGTGCGG
GCACTGGACT GGTCGGCGGT CGCGGTCACC GGCCGGCTGC AGACCGGCTC GCTGCCGGCG
TACCTCGGCG TCATCCTGCT CACGGTGCTC GCGGTGCCGG GCACCGCGCT GGTCACCGGG
ACCTCCTGGC CCGACGACCT GCCCTGGTGG GACTACCGCA TCCAGCTCCC GCTCGCGGTG
GGCATCCTGC TCGCCTCGCT GGCGGTCGTC CGCGCCCGCA GCAGGCTCAC CGCGACGCTG
CTGCTCGGCG CGGTCGGATA CGGGATCGGC GCGCTGTTCG TCGTCGACGG CGCGCCCGAC
CTCGCGCTGG CCCAGTTCCT CGTCGAGACC CTGTCGCTGA TCGTTTTCGT CTTCGTGCTG
CGGCGGATGC CGGTCCGGTT CTCGACGGCC GACCGTTCGT CGCCGCTGCG GCTGCCGCGG
ATCGCCGTCG CCGTCGCCGT CGGGGTTTTC GTGGCCGGGT TCGCGATCGT GACCAGCGGC
TCGCGCGCCG GGATGGCGCA GCCGAGCCGG GAGTTCATCG CACGCTCGCC CGGCGAGACC
GGGGCGACGA ACGTGGTGAA CGCCATCCTG GTCGACTTCC GGGCGTTCGA CACCCTCGGC
GAGATCGCGG TGCTCGCGGT GGCCGCGCTC GGCGTGGCCT CGCTGCTGCT GCTCGTCCGC
ACCCCGCAGG GGCGCACGAT CGACCAGCTG CTCGGCCCGG CGGACCGACG GGCCGAGGAG
CCGGTGCGGG CACGGTCGGT CCTGCTGGAG GTCACGACCC GGGCGGTGTT CCCGGTGGTG
CTCGTGTTCT CGCTGTACCT GCTCTTCGCC GGCCACACCA GGACGGGGGG CGGCTTCTCC
GGCGGGCTCG TCGCCGGCCT GGCGTTCGTC CTGCGTTATG TGGCGGGACG GCGCCGGCGG
GTGGGCGCGG CCGTCCCGGT GGTGCCGACC GCGGTGATCG GCACCGGGCT GGTGACCGCG
GCGGCCGCGG GGCTCGCGCC CGCGCTGCTC GGCGATCCCG TCCTGGAGAG CTATGTGTTC
AAGGGTGATC TGCCGATCCT GGGTCACGTC GAGCTGGTGA CGAGCCTGTT CTTCGACGTC
GGGGTCTACC TGCTGATCAT CGGCGTGGTG CTCGAGCTGC TGCGCACGCT CGGCACCGCC
GTCGACAAGG AGGCCGACGC CGAGATCGAG GCGGGCCGCG AGCGCGTGGA CGAGATCGCA
TGA
 
Protein sequence
MFAAVVAHLV LAAVLPALTD RLGRTAFLLA AAAPAATFGW LVVRIPAVLD AATAAPSALA 
TPAATGVAAT TAAVGGHGAP AGDLLVETLT WAPTVELEIV FRLAPLALLM ALLVTGVGAA
VLVYSFAYHA PHAADGGVPV PGSAGPGRAS AALLAFAGAM LGLVLADDLF TLYLFWELTT
VFSFLLIGQD GVSAPGRRSA VQALLITSVG GLAMLFGFVL LGQAAGTYRI SRIVAAPPSG
AVVTAALVLV LLGAATKSAQ IPFHSWLPAA MVAPTPVSAY LHAAAMVKAG VFLVATLTPA
FAGVVGWQVP AVALGAATML LGGLRALVQT DLKRLLAFGT VSQLGFLTAL VGFGSRTAAL
AGATLILAHG LFKAALFMVV GIVDHQAGTR DLRDLSGLWR SVPVVCGGAV LAAASMVGLP
PFLGYLGKEA AFEALVHGGA GELALLAVFA AGSCLTTGYA LRFLWGAFGA RPGAPPSAVA
RPAAVFVAPV VVLALAGLGL GIAHRGVDRL VAAYADGFPA GAARYHLALW HGPGWPIALS
AVAVAGGVLV FAAAAGPPRR LSPRLPDLLD AQRGYERAVR ALDWSAVAVT GRLQTGSLPA
YLGVILLTVL AVPGTALVTG TSWPDDLPWW DYRIQLPLAV GILLASLAVV RARSRLTATL
LLGAVGYGIG ALFVVDGAPD LALAQFLVET LSLIVFVFVL RRMPVRFSTA DRSSPLRLPR
IAVAVAVGVF VAGFAIVTSG SRAGMAQPSR EFIARSPGET GATNVVNAIL VDFRAFDTLG
EIAVLAVAAL GVASLLLLVR TPQGRTIDQL LGPADRRAEE PVRARSVLLE VTTRAVFPVV
LVFSLYLLFA GHTRTGGGFS GGLVAGLAFV LRYVAGRRRR VGAAVPVVPT AVIGTGLVTA
AAAGLAPALL GDPVLESYVF KGDLPILGHV ELVTSLFFDV GVYLLIIGVV LELLRTLGTA
VDKEADAEIE AGRERVDEIA