Gene Mjls_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2120 
Symbol 
ID4877840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2221979 
End bp2223046 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content70% 
IMG OID640139417 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001070397 
Protein GI126434706 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0569732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0990172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGA CCCGCGCGGC GATCGACATG GAGAGCTGGG TGTGCCCGGC GCCGCTGCGG 
GATTCGCCGA ACGTCGTGAT GGGTCACGGC GGTGGCGGCG CGATGTCGGG TGAGCTGATC
GAGCATCTGT TCCTGCCCGC GTTCGGTCCG GCCGCGGACG CGGCGATGGG CGATTCGGCC
GTCGTCGAAA TCGGCGGGAC CCGGCTGGCG TTCTCCACCG ATTCGTTCGT CGTCAAGCCG
ATGGTGTTCC CGGGCGGCAC GATCGGCGAG CTGGCGGTCA ACGGCACGGT CAACGACCTC
GCGATGGCCG GCGCGACGCC GATGGTGCTG TCGACGGCGT TCATCCTCGA GGAAGGCACC
TCACTCGACG ATCTGGCGCG GGTCGCTCAT GCGGTCGGCA CCGCGGCCTT GGCCGCCGGC
GTCAAACTCG TCACCGGCGA CACCAAGGTC GTCGATTCCG GGCACGGCGA CGGAATCTAT
GTGAACACCA CCGGTATCGG GGTGATCGAC CGGCGGGCCG ACATCCGGCC ACAGCGCGCC
ACCGAGGGCG ACGCGGTCAT CGTCAGCGGC GACATCGGCG TCCACGGGGT CGCCGTTATG
AGCTGCCGCG AAGGTCTGGA GTTCGCGACC AGCATCGCCA GCGACACCGC GCCCCTGCAC
GGTCTGGTGG CGGCGATGAT CGAGACCGGC GCCGACATCC ACGCACTTCG CGACCCCACC
CGCGGCGGGA TGGCCGCCAC TCTGAACGAG ATCGCCAAGG CCGCCGAGGT GGGCATGGTG
CTCGACGAAC GATCGATTCC GGTGCCACCG GAGGTGCGCG ACGCCTGCGG CCTGCTCGGC
CTCGATCCGA TGTATGTGGC CAACGAGGGC AAGCTGGTGG CGTTCGTGCC GGCCGCCGAC
GCCGATCGTG TGGTCGAGGC GATGCGGGCA CACCCGCTGG GCGCCCACGC CGCCGTCATC
GGCACCTGCG TCTCCGACCA CCCCGGGATG GTCGTCGCCC GCACCGCACT GGGTGGTACG
CGGGTGGTCG ACCTGCCGAT CGGCGAACAG CTACCCCGGA TCTGTTGA
 
Protein sequence
MRETRAAIDM ESWVCPAPLR DSPNVVMGHG GGGAMSGELI EHLFLPAFGP AADAAMGDSA 
VVEIGGTRLA FSTDSFVVKP MVFPGGTIGE LAVNGTVNDL AMAGATPMVL STAFILEEGT
SLDDLARVAH AVGTAALAAG VKLVTGDTKV VDSGHGDGIY VNTTGIGVID RRADIRPQRA
TEGDAVIVSG DIGVHGVAVM SCREGLEFAT SIASDTAPLH GLVAAMIETG ADIHALRDPT
RGGMAATLNE IAKAAEVGMV LDERSIPVPP EVRDACGLLG LDPMYVANEG KLVAFVPAAD
ADRVVEAMRA HPLGAHAAVI GTCVSDHPGM VVARTALGGT RVVDLPIGEQ LPRIC