Gene Mkms_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1601 
Symbol 
ID4614070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1719789 
End bp1721129 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID639791273 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_937599 
Protein GI119867647 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC CCCCCGGCCC GCCCAACGCG GGCGGTGACG CGCGCACCGG CACCGACACG 
GTGATCGTCG TCGGCGGTGA GGACTGGGAG CAGGTCGTCG CCGCCGCCGA ACAGGCGCAG
GCCGGTGAGC GCATCGTCGT GAACATGGGA CCGCAGCACC CCTCGACACA CGGCGTGCTG
CGGTTGATCC TCGAGATCGA GGGCGAGACG ATCACCGAAG CCCGTTGCGG TATCGGCTAT
CTGCATACCG GCATCGAGAA GAACCTGGAG TACCGGAACT GGACGCAGGG CGTCACCTTC
GTCACCCGGA TGGACTACCT GTCGCCGTTC TTCAACGAGA CCGCCTACTG CCTGGGTGTG
GAGAAACTGC TCGGCGTCAC CGACGCGATC CCGGAGCGCG TCAACGTGAT CCGGGTGATG
TTGATGGAAC TCAACCGGAT CTCCTCGCAT CTGGTCGCAC TGGCGACCGG CGGGATGGAA
CTCGGGGCGA TGAGCGCGAT GTTCTACGGC TTCCGGGAGC GCGAGGAGAT CCTGTCGGTG
TTCGAGATGA TCACCGGGTT GCGGATGAAC CACGCGTACA TCCGGCCCGG CGGGCTGGCC
GCCGACCTGC CCGACGGTGC GGTCCCCCGC ATCCGCGAAC TGCTCGCGCT GCTCCCCGGG
CGGCTGCGCG ACCTGGAGAA CCTGCTCAAC GAGAACTACA TCTGGAAGGC CCGCACGCAG
GGCATCGGCT ACCTCGACCT GGCCGGCTGC ATGGCACTCG GCATCACGGG CCCGGTGCTG
CGCTCGACCG GGCTGCCGCA CGATCTGCGC CGGGCCCAAC CGTACTGCGG TTACGAGGAC
TACGAATTCG ACGTGATCAC CGACGACGGC TGCGACGCCT ACGGCCGCTA CCTCATCCGG
GTGAAGGAGA TGCGTGAATC GCTCAAGATC GTCGAACAGT GTGTGGACCG ATTGAAGCCC
GGACCGGTGA TGATCGCGGA CAAGAAGCTC GCCTGGCCGG CCGACCTCGA ACTGGGACCC
GACGGCCTCG GCAACTCCCC CGCCCACATC GCCCGCATCA TGGGGCAGTC GATGGAGGGC
CTGATCCACC ACTTCAAGCT GGTGACCGAG GGTATCCGGG TGCCGGCCGG ACAGGTGTAC
ACGGCCGTGG AGTCGCCACG CGGCGAACTG GGGGTGCACA TGGTCTCCGA CGGTGGAACC
CGGCCCTACC GCGTCCACTA CCGCGACCCG TCGTTCACGA ATCTGCAAGC GGTGGCGGCG
ATGTGCGAGG GCGGGATGGT CGCCGACGCC ATCTCGGCGG TCGCGTCGAT CGACCCGGTC
ATGGGCGGGG TGGATAGGTG A
 
Protein sequence
MTTPPGPPNA GGDARTGTDT VIVVGGEDWE QVVAAAEQAQ AGERIVVNMG PQHPSTHGVL 
RLILEIEGET ITEARCGIGY LHTGIEKNLE YRNWTQGVTF VTRMDYLSPF FNETAYCLGV
EKLLGVTDAI PERVNVIRVM LMELNRISSH LVALATGGME LGAMSAMFYG FREREEILSV
FEMITGLRMN HAYIRPGGLA ADLPDGAVPR IRELLALLPG RLRDLENLLN ENYIWKARTQ
GIGYLDLAGC MALGITGPVL RSTGLPHDLR RAQPYCGYED YEFDVITDDG CDAYGRYLIR
VKEMRESLKI VEQCVDRLKP GPVMIADKKL AWPADLELGP DGLGNSPAHI ARIMGQSMEG
LIHHFKLVTE GIRVPAGQVY TAVESPRGEL GVHMVSDGGT RPYRVHYRDP SFTNLQAVAA
MCEGGMVADA ISAVASIDPV MGGVDR