Gene Moth_2176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2176 
Symbol 
ID3831646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2275672 
End bp2276688 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content50% 
IMG OID637830098 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_431008 
Protein GI83590999 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.239453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA ATGGCAGGGA TATTGTTTTG CTGGCTCACG GTGACGGGGG TGCCCTTACC 
CATGAACTCA TTAATAACCT TTTTCTTCGT CACTTTGAAA GCAATATCCT GAAAACGCTC
ACCGATGCGG CGGTTTTTTC CGGAAATGAA GGGAGAATGG CCATCACTAC GGATTCCTTT
GTTGTCGATC CGCCTTTCTT CCCCGGGGGG GATGTGGGAA AACTGGCGGT TTGCGGTACG
GTAAATGATC TGGCGGTCAG CGGCGCCCGG CCCATCTATC TCACGGCTTC TTTTATTTTA
GAAGAAGGAC TGCCCCTTGC CGATCTTGAA AGCATTGTCC AATCTATGGC CGGGGCCTGC
GGGGTCGCGG GCGTAGAAAT TATCACCGGG GATACCAAGG TGGTTGAAAA AGGTCATTTA
GATAAAATTT TTATTAACAC CACCGGTGTC GGTTATATTT TTCCAGGAGT AGATTTGGGT
TACCACCGGA TTAAACCCGG TGATAAAGTT TTAGTCAACG GCAGCCTCGG GCAACACGGT
ATCGCGGTAC TTTGTAAACG GTATGGCTTT GATTTTGCAG GACAGGTCGT GAGCGATTGT
GCCCCGCTGA ACGGCATAAC CAGCCTGCTT TTAAAGGAAA TCCGGGGGAT CAAGATAATG
CGGGATCTTA CCCGGGGTGG CCTGGCAACA ACCGCCAAAG AAATTGCGTC AGCCTGCGGT
TTAGATATCT GGCTGGACGA AAATTGTATT CCTGTCGATG CCGGGGTAAA GGGTGCCGCC
GAGATGCTGG GACTGGATCC CCTCTACCTT GCCAACGAGG GGAAGTTTAT GGTTATTGTC
AGCCCGGAAG AAGCAGAAAA AGCTGTAGCG GTAATGCAAG GGCATGAATT GGGAAGGGAT
GCTAGTATTA TAGGCGAAGT CAAACCTGGC AAGGGGAACG TTTATCTCTC CACCTCCCTT
GGAGGTACTA AACTCTTAGA TCTTATGGCT GGAAGTCCTC TACCACGAAT TTGTTGA
 
Protein sequence
MKENGRDIVL LAHGDGGALT HELINNLFLR HFESNILKTL TDAAVFSGNE GRMAITTDSF 
VVDPPFFPGG DVGKLAVCGT VNDLAVSGAR PIYLTASFIL EEGLPLADLE SIVQSMAGAC
GVAGVEIITG DTKVVEKGHL DKIFINTTGV GYIFPGVDLG YHRIKPGDKV LVNGSLGQHG
IAVLCKRYGF DFAGQVVSDC APLNGITSLL LKEIRGIKIM RDLTRGGLAT TAKEIASACG
LDIWLDENCI PVDAGVKGAA EMLGLDPLYL ANEGKFMVIV SPEEAEKAVA VMQGHELGRD
ASIIGEVKPG KGNVYLSTSL GGTKLLDLMA GSPLPRIC