Gene Francci3_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0549 
Symbol 
ID3904200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp636337 
End bp638322 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content67% 
IMG OID637877878 
ProductNADH dehydrogenase subunit L 
Protein accessionYP_479662 
Protein GI86739262 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID[TIGR01974] proton-translocating NADH-quinone oxidoreductase, chain L 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.322787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG ACCCGCTTCT GCTCGCTGCC GCCGAACATG GCGGTAGCAA GATCCATTAT 
GCGGCCGCGT CGGGGGTCTT CTCGCTGACC TGGCTGCTCA TCGCGCTTCC ATTGGCCGGA
GCCGCGGTGC TGTTGCTCGG CGGTCGGCGG ACCGACGGGT TCGGGCATCT GCTCGGCACA
CTGACCTCCG CCGCCAGCTT CGTGATCGGT CTGGTGCTGT TCGCCGGCCT GCTCGACCGC
CCCGGTGACG ACCGGGCGCT GTCCCAGCAG CTGTACTCCT GGATTCCGGT CAACGGTTTC
CGGGTGGACG TCGGTCTGCT GGTCGACCAG CTCTCGGTCG TCTTCGTGCT GCTGATCACC
GGGGTCGGGA CACTGATCCA CATCTACTCG ATCGGCTACA TGGCGCACGA CCCGGCCCGG
CGGAAGTTCT TCGCCTACAT GAACCTCTTC CTGGCGTCGA TGCTGCTGCT CGTCCTGGGC
GACAACTTCC TGTCGCTCTA TGCCGGCTGG GAGCTGGTCG GGCTGTCGTC CTTCCTGCTC
ATCAAGTTCT GGGAGTACAA GCCGGCCGCC GCCACCGCGG CGAACAAGGC CTTCTACATG
AACCGGGTCG GGGACGTCGG GCTGGCGCTC GCGATCATGT TCATGTTCGC CACCGTCGGG
TCGACGAGCT ACGCCGACGT GTTCGGCTCG GCCGCCGCGG GCGTCATCGG TTACGGCACG
ATCACCGCGA TCGCGCTGCT GCTCCTGCTC GCTGCGTGCG GCAAGTCCGG CCAGTTCCCG
TTGCAGGCCT GGCTGCCGGA CGCCATGGAG GGCCCGACCC CGATCTCCGC GCTCATCCAC
GCGGCGACCA TGGTCACCGC CGGGGTCTAT CTCATCGTGC GGGCCGGGCC GATCTTCAAC
GAGACGCAGG CCGCCCGTAC CGTCGTGGTG ATCATCGGGG CGGTCACGAT CCTGATCGGC
TGCGTCATCG GCTGTGCCTA CGATGACATC AAGAAGGTGC TGGCCTACTC GACAGTCAGC
CAGATCGGGT ACATGTTCCT CGCGGTGGGG CTGGGACCGG CCGGCTACGC GATCGGCATC
ATGCACCTGC TCGCCCACGG CTTCTTCAAG GCCGGTCTGT TCCTCGGATC CGGCTCGGTG
ATCCACGCGA TGAATGACGA CCAGGACCTG CGTCACTACG GCGGGCTCTG GCGGTATATG
AAGATCACCT GGGTGACCTT TGGTGTCGGC TACCTGGCGA TCATCGGGTT CCCCGGGTTC
TCCGGCTTCT TCACCAAGGA CCGGATCATC GAGACCGCCT TCGACAAGGG CGGGACGTCG
GGATACCTGC TCGGCTCGAT CGCGCTGCTG GGCGCGGGCA TCACCGCCTT CTACATGTCC
CGCCTGTTTC TGATGACCTT CCACGGCAGG CCGCGCTGGA CGACGGAGGG CGAGCACGCT
CGGCATCCGC ACGAGTCCCC GGGGTCGATG ACCGGTCCGA TGATCCTGCT GGCCGTCGGC
TCGCTCCTCG CCGGTGGCCT GTTCGTCCTC GGCCATTCGC TGCAGGACTG GCTCGAACCG
GTCGCCGGGG CGGGGGTCGA GGGTGTCCAC ACCTTCTCGC CGCTCGTCCT CACCCTGCTC
ACGCTGGTGG TCACCGCCGG CGGCTTTGCC GGGGCCTATG TGCGCTACCA GCTGCGCCCG
GTCGAGGCGA CGGCCCCGCC GGACTCCGAG GTCACCCTGG CCACCGTCGC CGCCCGTCAC
GACCTGTTCG CGAACACCTT CAACGAGACC GTCGCGATGC GGCCCGGCCA GTACCTGACC
CGTTTCCTGG TCTGGCTGGA CCTCGTCGGC GTTGACGGGC TCGTGCGGGG CAGCGCCGCC
GCCATCGGCG GCCTGTCGGG GCGGATGCGC CGCCTCCAGA CCGGCTTCGT CCGGTCCTAC
GCTCTGTCGA TGTTGGGAGG TGCCGTCCTC GTGGTCGGCG CGCTGCTGCT GGTGAGGGCG
GGCTGA
 
Protein sequence
MNADPLLLAA AEHGGSKIHY AAASGVFSLT WLLIALPLAG AAVLLLGGRR TDGFGHLLGT 
LTSAASFVIG LVLFAGLLDR PGDDRALSQQ LYSWIPVNGF RVDVGLLVDQ LSVVFVLLIT
GVGTLIHIYS IGYMAHDPAR RKFFAYMNLF LASMLLLVLG DNFLSLYAGW ELVGLSSFLL
IKFWEYKPAA ATAANKAFYM NRVGDVGLAL AIMFMFATVG STSYADVFGS AAAGVIGYGT
ITAIALLLLL AACGKSGQFP LQAWLPDAME GPTPISALIH AATMVTAGVY LIVRAGPIFN
ETQAARTVVV IIGAVTILIG CVIGCAYDDI KKVLAYSTVS QIGYMFLAVG LGPAGYAIGI
MHLLAHGFFK AGLFLGSGSV IHAMNDDQDL RHYGGLWRYM KITWVTFGVG YLAIIGFPGF
SGFFTKDRII ETAFDKGGTS GYLLGSIALL GAGITAFYMS RLFLMTFHGR PRWTTEGEHA
RHPHESPGSM TGPMILLAVG SLLAGGLFVL GHSLQDWLEP VAGAGVEGVH TFSPLVLTLL
TLVVTAGGFA GAYVRYQLRP VEATAPPDSE VTLATVAARH DLFANTFNET VAMRPGQYLT
RFLVWLDLVG VDGLVRGSAA AIGGLSGRMR RLQTGFVRSY ALSMLGGAVL VVGALLLVRA
G