Gene Tmz1t_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3926 
Symbol 
ID7873572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4321989 
End bp4323050 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content73% 
IMG OID643700863 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_002890886 
Protein GI237654572 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.224134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG TCAAAACCGG CTACGTCCGC CCCCTGGACC TGCGCAAGGG CCGCGTCGAC 
CTCGGCTTCG GCGCCGGCGG GCGGGCGATG GCGCAATTGA TCTCCGAACT TTTCCTGCGC
GCGTTCGCCA ACGACTGGCT CGCCCGCGGC GACGACGGCG CGGTGCTGCC CGCTCCCGCA
GCAGGCGAGC GCCTGGTGAT GGCGACCGAC GCCCACGTGG TGAGCCCGCT GTTCTTCCCC
GGCGGCGACA TCGGCAGCCT GTCGGTGCAT GGGACGGTGA ACGACCTCGC GGTGATGGGC
GCACGCCCGC TGTACCTCGC CGCCAGCTTC ATCCTGGAAG AAGGCTATGC GCTCGCCGAC
CTCGCCCGCA TCGTCGAATC GATGGCCTCC GCGGCGCGCG CGGCCGGGGT GGCGGTGGTG
ACCGGCGACA CCAAGGTGGT CGAACAGGGC AAGGGCGACG GCGTGTTCAT CACCACCACC
GGGGTGGGGG CGCTGCCGGC CGGGCGCGAT CCGGGCGGCG CACGGGCGCG GCCGGGCGAC
GTGGTGCTGG TGTCGGGGCG CATCGGCGAC CATGGCATGG CAATCATGGC GCAGCGCGAG
TCGCTCGCCT TCGACTCCGA GATCGTCTCC GACAGCGCGG CGCTGCACGG CCTGGTCGAG
GCGCTCTACG CCGCGGTGCC GGCCGAGGCG GTCCGCGTGC TGCGCGACCC CACGCGCGGC
GGACTGGCGA CCACCTTGAA CGAGATCGCC GCGCAGTCGG GCGTGGGCAT GGAGCTCGAC
GAGGCGGAGA TCCCGGTGTC GGCGCAGGTG CAGGCTGCCT GCGAGCTGCT CGGGCTCGAC
CCGCTCTACG TCGCCAACGA GGGCAAGCTG GTGGTGCTGT GCGCTCCGGA GCACGCCGGC
GCGGCGCTCG CCGCGCTGCG CGCGCATCCG CTCGGCACCG AGGCGGCGCG GATCGGATGC
GCCACCGCCG ACCCGCAGCA CTTCGTGCAG CTGCGCACCG GCCTGGGCGG GCGGCGCATG
GTGGACTGGA TCGCCGGCGA GCAGCTGCCG CGGATCTGTT GA
 
Protein sequence
MNTVKTGYVR PLDLRKGRVD LGFGAGGRAM AQLISELFLR AFANDWLARG DDGAVLPAPA 
AGERLVMATD AHVVSPLFFP GGDIGSLSVH GTVNDLAVMG ARPLYLAASF ILEEGYALAD
LARIVESMAS AARAAGVAVV TGDTKVVEQG KGDGVFITTT GVGALPAGRD PGGARARPGD
VVLVSGRIGD HGMAIMAQRE SLAFDSEIVS DSAALHGLVE ALYAAVPAEA VRVLRDPTRG
GLATTLNEIA AQSGVGMELD EAEIPVSAQV QAACELLGLD PLYVANEGKL VVLCAPEHAG
AALAALRAHP LGTEAARIGC ATADPQHFVQ LRTGLGGRRM VDWIAGEQLP RIC