Gene Mvan_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2416 
Symbol 
ID4644838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2570953 
End bp2572008 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID639805899 
ProductNADH ubiquinone oxidoreductase, 20 kDa subunit 
Protein accessionYP_953235 
Protein GI120403406 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0177394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.195445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCG AGGCAGCAGT CAAAGCGGAA GAGGCCTTGA TCCACGTGCT GTGGATCAAC 
GCCGGTTTGA GTTGTGACGG CGATTCAGTG GCGTTGACTG CCGCCACCCA ACCCAGCATC
GAGGAGATCG CGCTCGGCGC GCTCCCCGGC CTGCCCAGGA TCGCCGTCCA CTGGCCCCTG
ATCGACTTCG AATGCGGACC GAGCGGGGGC GCCGACGACT TCCTCGAATG GTTCTTCAGA
GCGGACAGAG GCGAATTGGA ACCGTTCGTA CTGGTCGTCG AGGGATCCAT CCCGAACGAG
AAGATCAAGG ACGAAGGTTA CTGGTGCGGG TTCGGCAACG ACCCGGCCAC CGGTCAGCCC
ATGACCACCA GCGAATGGTT GGATCGGCTC GCCCCGAAGG CGACCGCGGT GGTGGCGGTC
GGGACGTGCG CCACCTACGG CGGCATCCAT GCGATGGCGG GCAACCCGAC GGGCGCGATG
GGTGTCCCCG ACTACCTCGG ATGGGACTGG AAGACCAAGG CGGGCATCCC GATCGTGTGC
GTGCCCGGCT GCCCGATCCA CCCGGACAAC CTGGCCGAGA CGCTGACCTA CCTGCTGTAC
ATGGCCACCG ACCAGGCGCC GATGATCCCA CTCGACGACG CGCTGCGGCC GAAGTGGCTG
TTCGGCGCGA CCGTGCACGA AGGCTGCGAC CGCGCCGGCT ACTACGAGCA GGGCGACTTC
GCGACCGAGT ACGGCTCCCC GAAATGCATT GTGAAACTGG GGTGTTGGGG TCCGGTGGTG
AAGTGCAACG TACCCAAACG CGGGTGGATC AACGGTATCG GCGGATGCCC GAACGTCGGC
GGTATCTGCA TCGGCTGCAC CATGCCGGGC TTTCCCGACA AGTTCATGCC CTTCATGGAC
GAACCGCCCG GCGGCAAGTT GTCCTCGACG GCGTCGGGAC TGTACGGCTC GGTGATCCGC
GGCCTGCGCC ACATCACCGG GCGCACCGTC GACAAAGAAC CCCGGTGGCG ACATCCGGGC
ACCACACTCG AGACGGGCGC GACCCGCACC TGGTAG
 
Protein sequence
MPSEAAVKAE EALIHVLWIN AGLSCDGDSV ALTAATQPSI EEIALGALPG LPRIAVHWPL 
IDFECGPSGG ADDFLEWFFR ADRGELEPFV LVVEGSIPNE KIKDEGYWCG FGNDPATGQP
MTTSEWLDRL APKATAVVAV GTCATYGGIH AMAGNPTGAM GVPDYLGWDW KTKAGIPIVC
VPGCPIHPDN LAETLTYLLY MATDQAPMIP LDDALRPKWL FGATVHEGCD RAGYYEQGDF
ATEYGSPKCI VKLGCWGPVV KCNVPKRGWI NGIGGCPNVG GICIGCTMPG FPDKFMPFMD
EPPGGKLSST ASGLYGSVIR GLRHITGRTV DKEPRWRHPG TTLETGATRT W