Gene Mjls_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1547 
Symbol 
ID4877279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1655263 
End bp1656603 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID640138852 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001069836 
Protein GI126434145 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.795023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.142178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC CCCCCGGCCC GCCCAACGCG GGCGGTGACG CGCGCACCGG CACCGACACG 
GTGATCGTCG TCGGCGGTGA GGACTGGGAG CAGGTCGTCG CCGCCGCCGA ACAGGCGCAG
GCCGGTGAGC GCATCGTCGT GAACATGGGA CCGCAGCACC CCTCGACACA CGGCGTGCTG
CGGTTGATCC TCGAGATCGA GGGCGAGACG ATCACCGAAG CCCGTTGCGG TATCGGCTAT
CTGCATACCG GCATCGAGAA GAACCTGGAG TACCGGAACT GGACGCAGGG CGTCACCTTC
GTCACCCGGA TGGACTACCT GTCGCCGTTC TTCAACGAGA CCGCCTACTG CCTGGGTGTG
GAGAAACTGC TCGGCGTCAC CGACGCGATC CCGGAGCGCG TCAACGTGAT CCGGGTGATG
TTGATGGAAC TCAACCGGAT CTCCTCGCAT CTGGTCGCAC TGGCGACCGG CGGGATGGAA
CTCGGGGCGA TGAGCGCGAT GTTCTACGGC TTCCGGGAGC GCGAGGAGAT CCTGTCGGTG
TTCGAGATGA TCACCGGGTT GCGGATGAAC CACGCGTACA TCCGGCCCGG CGGGCTGGCC
GCCGACCTGC CCGACGGTGC GGTCCCCCGC ATCCGCGAAC TGCTCGCGCT GCTCCCCGGG
CGGCTGCGCG ACCTGGAGAA CCTGCTCAAC GAGAACTACA TCTGGAAGGC CCGCACGCAG
GGCATCGGCT ACCTCGACCT GGCCGGCTGC ATGGCACTCG GCATCACGGG CCCGGTGCTG
CGCTCGACCG GGCTGCCGCA CGATCTGCGC CGGGCCCAAC CGTACTGCGG TTACGAGGAC
TACGAATTCG ACGTGATCAC CGACGACGGC TGCGACGCCT ACGGCCGCTA CCTCATCCGG
GTGAAGGAGA TGCGTGAATC GCTCAAGATC GTCGAACAGT GTGTGGACCG ATTGAAGCCC
GGACCGGTGA TGATCGCGGA CAAGAAGCTC GCCTGGCCGG CCGACCTCGA ACTGGGACCC
GACGGCCTCG GCAACTCCCC CGCCCACATC GCCCGCATCA TGGGGCAGTC GATGGAGGGC
CTGATCCACC ACTTCAAGCT GGTGACCGAG GGTATCCGGG TGCCGGCCGG GCAGGTGTAC
ACGGCCGTGG AGTCGCCCCG CGGCGAACTG GGGGTGCACA TGGTCTCCGA CGGTGGAACC
CGGCCCTACC GCGTCCACTA CCGCGACCCG TCGTTCACGA ATCTGCAAGC GGTGGCGGCG
ATGTGCGAGG GCGGGATGGT CGCCGACGCC ATCTCGGCGG TCGCGTCGAT CGACCCGGTC
ATGGGCGGGG TGGATAGGTG A
 
Protein sequence
MTTPPGPPNA GGDARTGTDT VIVVGGEDWE QVVAAAEQAQ AGERIVVNMG PQHPSTHGVL 
RLILEIEGET ITEARCGIGY LHTGIEKNLE YRNWTQGVTF VTRMDYLSPF FNETAYCLGV
EKLLGVTDAI PERVNVIRVM LMELNRISSH LVALATGGME LGAMSAMFYG FREREEILSV
FEMITGLRMN HAYIRPGGLA ADLPDGAVPR IRELLALLPG RLRDLENLLN ENYIWKARTQ
GIGYLDLAGC MALGITGPVL RSTGLPHDLR RAQPYCGYED YEFDVITDDG CDAYGRYLIR
VKEMRESLKI VEQCVDRLKP GPVMIADKKL AWPADLELGP DGLGNSPAHI ARIMGQSMEG
LIHHFKLVTE GIRVPAGQVY TAVESPRGEL GVHMVSDGGT RPYRVHYRDP SFTNLQAVAA
MCEGGMVADA ISAVASIDPV MGGVDR