Gene Mlg_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2028 
Symbol 
ID4268144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2298497 
End bp2300284 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content58% 
IMG OID638126784 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_742860 
Protein GI114321177 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCG TTACAACGTC GAACGGATTT ACTCTTAACG ATGGCGGCCG CCGTTTGGTG 
GTTGATCCGG TAACCCGCAT CGAAGGGCAT ATGCGTTGCG AGGTGAACCT GGATGAAAAC
AACGTCATTA CCAACGCTGT TTCCACAGGG ACCTTGTGGC GTGGGTTGGA GGTGATCCTG
AAAGGGCGGG ATCCTCGCGA TGCCTGGGCA TTTACGCAGC GCATCTGCGG GGTGTGCACC
GGAACCCACG CCCTGACTTC GGTGCGTGCG GTGGAAGATG CCCTCGGCAT CAGTATCCCG
GAGAATGCCA ACTCCATTCG CAACATTATG CAACTGGCCC TTCAGGTGCA TGACCATCTG
GTGCATTTTT ACCATCTGCA TGCCTTGGAC TGGGTTGATG TGGTGGATGC GCTTAAGGCG
AGCCCTCAGG CGACGTCCAG TCTTGCCCAG AGCATTTCGA GTTGGCCACT TTCCTCGCCA
GGGTACTTTC GGGACATTCA GCACCGACTG AAGCGATTCG TGGAGTCCGG GCAGCTCGGG
CCCTTCGCCA ATGGCTATTG GGGTAATCCG GCTTATAAGT TGCCGCCGGA AGCCAACCTG
ATGGCGGTTG CACACTACTT GGAGGCCCTC GATTTCCAGA AGGAAATCAC AAAGATTCAT
GCAGTCTATG GCGGAAAGAA CCCGCACCCG AATTGGTTGG TTGGTGGCGT GCCGTGCCCA
ATCAATATGG ACGATACTGG GGCAGGCGGA GCGATCAACA TAGAGCGCCT TAACCTGGTC
TCGGAAATCA TTGATCGTTG CATCCAGTTC GTCGATCAGG TTTATCTCCC TGACCTCAAA
GCCATTGCTT CGTATTATAC GGACTGGCTA TACGGAGGTG GCCTGGCCAG CCGGAGTCTT
CTCTCCTATG GGGATGTGCC GGAGCATGCC AACGATTACA ACAGCCTGCT GATGCCGAAA
GGCGCCATCA TCAACGGTAA GCTTGACGAG GTCCACCCGG TGGACCTGCG GGACCCTGAT
GAAATTCAGG AGTTCGTCAC TCATTCTTGG TACCGCTACG GTGATGACGA TATCGGCCTG
CACCCGTGGG ATGGTATCAC CGAGGCCGAT TACCGGTTGG GCCCCAATAC CAAAGGACGC
CACGATAATA TTCAACAATT GGACGAGGCG GGCAAGTACT CCTGGATTAA GGCGCCGCGT
TGGCGGGGCC ACGCTATGGA AGTCGGCCCT CTGGCCCGCT ATGTGATCGG TTACATGCAG
GGAAACCCGG AATTCAAGGA ACCGACCGAC GCCTTTCTTC ATGACTTGGG TGTTCCGTTG
GAGGCCCTGT TCTCAACCCT CGGCCGGACG GCCGCACGGG GCCTGGAGGC GTCCTGGGCC
GCCCATAAGA TGCGGTACTT CCAGGATAAG CTGGTCGCGA CTATCCGGGC CGGGGATACA
GCCACCGCCA ACGTGGCCAA ATGGGAACCC GACAAATGGC CCTCGGAGAC CCGTGGAGTT
GGGTTCACAG AAGCGCCTCG GGGCGCGTTG GGACACTGGG TGGTAATCAA GAATCGGAAG
ATCGAGAACT ACCAGTGCGT TGTGCCTACC ACATGGAATG GGTCGCCCCG CGACGCCAGG
GGTGAGATCG GGCCGTTCGA AGCCTCCCTG CTGAATACGC CGCTAGCGAA GCGCGACCAA
CCGCTCGAGA TCCTGCGGAC ATTGCACAGC TTCGACCCCT GCCTCGCCTG TTCTACCCAT
GTGATGGGCG ATGACGGCCG GGAACTCACC CGAGTGAAGG TGCGCTAA
 
Protein sequence
MSVVTTSNGF TLNDGGRRLV VDPVTRIEGH MRCEVNLDEN NVITNAVSTG TLWRGLEVIL 
KGRDPRDAWA FTQRICGVCT GTHALTSVRA VEDALGISIP ENANSIRNIM QLALQVHDHL
VHFYHLHALD WVDVVDALKA SPQATSSLAQ SISSWPLSSP GYFRDIQHRL KRFVESGQLG
PFANGYWGNP AYKLPPEANL MAVAHYLEAL DFQKEITKIH AVYGGKNPHP NWLVGGVPCP
INMDDTGAGG AINIERLNLV SEIIDRCIQF VDQVYLPDLK AIASYYTDWL YGGGLASRSL
LSYGDVPEHA NDYNSLLMPK GAIINGKLDE VHPVDLRDPD EIQEFVTHSW YRYGDDDIGL
HPWDGITEAD YRLGPNTKGR HDNIQQLDEA GKYSWIKAPR WRGHAMEVGP LARYVIGYMQ
GNPEFKEPTD AFLHDLGVPL EALFSTLGRT AARGLEASWA AHKMRYFQDK LVATIRAGDT
ATANVAKWEP DKWPSETRGV GFTEAPRGAL GHWVVIKNRK IENYQCVVPT TWNGSPRDAR
GEIGPFEASL LNTPLAKRDQ PLEILRTLHS FDPCLACSTH VMGDDGRELT RVKVR