Gene Athe_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1093 
Symbol 
ID7409650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1186720 
End bp1187721 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content37% 
IMG OID643715459 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_002572967 
Protein GI222529085 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGGA CTATTCGTAA GGAACATGGC AATGGCGGAA AACAGACATA TGAGCTGATT 
GACAACCTTT TCAAGCCAAT ATTTGGTTCT GAAATATTGA AAAGCGGTGA TGATTCAACA
GTATTTTCTT TAAGCGGTAT GAAAGTGGGA ATATCAACAG ATTCATTTGT AGTAAAGCCA
TATTTTTTTA AAGGAGGAGA TATTGGAAAG CTTTCAGTAT GTGGTACGGT AAATGACTTA
GCTGTTGCAG GACTTGAACC AAAGTATATT ACCGTTTCTT TTATAATTGA AGAAGGATTT
TCAATAGATG ATTTAGAGAA AATTACAAGA TCAATAAAGA AATATGCAGA TTTGGCAAAG
GTTGAAGTTG TTGCAGGGGA TACAAAGGTT GTAGAAAAAG GTGCGGCAGA TGGGATATTT
ATAAATACGA CTGGATTGGG TGTTGCAAGA GATACCCAAA AAATGCCTTT GATTTCAAGA
ATAAAAGGTA ATCAAGTTGT AATTGTCAGT GGGGATATAG GAAGACATGG AGCATGCATA
TATTCTCACA ATGAAGACCT TGGGTTTGAA GACAGGATAG AATCCGATTG TGGGCTTTTG
TTAGATGTAA TTTCAGAGCT GATGCAAGAG GTAGATGTTG CGTATATGAA AGACCTTACA
AGAGGTGGGC TTGCAACAGC TTTAAATGAG ATTGTCCAAA AGTCTGGCTT TGATATAAAA
GTTGAGGAAG AAAAAATACC TGTGGCAGAT GAGGTAAAAG CTCTGTGCGA TATTCTGGGG
CTGGATAGTT ATTATCTTGC TTGTGAAGGT AGGTTTGTTG CCATAGTAAA TGGTGATGAA
AAAGAGAAAG CTATTGAGAT TTTGAAAAAG TACAACAGGT TTGCATGTGA GATTGGGAAG
GTAGAAGAAA GTAGGGAAAA GAGAGTATAT CTGTCAACCA CCTTTGGCGG TACGAGAATT
TTGGACATGC TTTATTATGA AATGTTACCA AGGATTTGCT AA
 
Protein sequence
MMRTIRKEHG NGGKQTYELI DNLFKPIFGS EILKSGDDST VFSLSGMKVG ISTDSFVVKP 
YFFKGGDIGK LSVCGTVNDL AVAGLEPKYI TVSFIIEEGF SIDDLEKITR SIKKYADLAK
VEVVAGDTKV VEKGAADGIF INTTGLGVAR DTQKMPLISR IKGNQVVIVS GDIGRHGACI
YSHNEDLGFE DRIESDCGLL LDVISELMQE VDVAYMKDLT RGGLATALNE IVQKSGFDIK
VEEEKIPVAD EVKALCDILG LDSYYLACEG RFVAIVNGDE KEKAIEILKK YNRFACEIGK
VEESREKRVY LSTTFGGTRI LDMLYYEMLP RIC