Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2029 |
Symbol | |
ID | 4268145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2300285 |
End bp | 2301400 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638126785 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_742861 |
Protein GI | 114321178 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTG AAACAATGGA AACATTCTAC GATGTCATGC GGCGTCAGGG GATAACGCGC CGCAGCTTCA TGAAGTTCTG CAGCCTTACG GCGGCGGCTC TAGGTCTGGG GCCTCAATTC GCCGGCCGTA TAGCCCACGC CATGGAGAAC AAGCCCCGCA CCCCGGTGCT TTGGGTGCAC GGGCAGGAAT GCACATGCTG CTCCGAGTCC TTCATCCGTT CGGCCCACCC TCTGGCCAAG GACGTCGTGC TGTCGATGAT CTCGCTGGAT TACGATCCAC TGCTGATGGC CGCGGCGGGG GATGATGCGG AGGCCGCCTT GGAGGCGGCG ATAGAGAAGT ACCATGGAAA CTACATCCTG GCCGTAGAGG GTACCCCGGC ATTGGGCCAC GACGGCATGG CGTGCGTTGT GGGCGGGCGG CCATTTCTGG ATCAGCTTAA GCACACTGCG GAAGGTGCCA AGGCGATCAT TTCATGGGGC TCCTGCGCGT CCTGGGGTTG CGTGCAGGCG GCGCGGCCCA ACCCAACCCA GGCTGTCCCT GTCCACAAGG TCATCCGGGA CAAGCCAATC ATCAAGGTCC CGGGCTGTCC GCCCATCGCT GAGGTAATGA CCGGCGTCAT CACCTACATG CTGACGTTCG ATCGCTTGCC AGCATTGGAC CGGCAGGGGC GACCGAAGAT GTTCTATGGG CAACGAATTC ACGATAAGTG TTATCGGCGC CCCCACTTTG ATGCCGGTCA ATTTGCCGAG CAGTGGGACG ATGAGGGGGC CCGTCGTGGA TACTGCCTAT ACAAGTTGGG CTGCAAGGGG CCGACGACCT ATAACGCCTG TTCCACCATG CGTTGGAACA GCGGGGTTTC GTTCCCAATA CAGTCCGGCC ATGGGTGCAT CGGGTGTTCC GAGGATGGCT TCTGGGACAA GGGCTCTTTC TATGACCGGG TGACGAATAT CCACCAGTTC GGTATCGAAT CCAATGCCGA TCGCATCGGC AAGACCGCGG CCGGTGTGGT GGGTGCCGCC GTTGCCGCAC ACGCAGCGGT GAGTGTGGCC AAGCATACCG CGAATAAACG GAAAGAGGTC CGGGAACAGA CCGCCGACCA GGAGGGGGAG AAGTAA
|
Protein sequence | MAGETMETFY DVMRRQGITR RSFMKFCSLT AAALGLGPQF AGRIAHAMEN KPRTPVLWVH GQECTCCSES FIRSAHPLAK DVVLSMISLD YDPLLMAAAG DDAEAALEAA IEKYHGNYIL AVEGTPALGH DGMACVVGGR PFLDQLKHTA EGAKAIISWG SCASWGCVQA ARPNPTQAVP VHKVIRDKPI IKVPGCPPIA EVMTGVITYM LTFDRLPALD RQGRPKMFYG QRIHDKCYRR PHFDAGQFAE QWDDEGARRG YCLYKLGCKG PTTYNACSTM RWNSGVSFPI QSGHGCIGCS EDGFWDKGSF YDRVTNIHQF GIESNADRIG KTAAGVVGAA VAAHAAVSVA KHTANKRKEV REQTADQEGE K
|
| |