Gene Tmz1t_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3631 
Symbol 
ID7873136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3988319 
End bp3989410 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID643700572 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_002890601 
Protein GI237654287 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGA AATGCGGAAT CGTCGGCCTG CCCAACGTCG GCAAGTCGAC CCTCTTCAAC 
GCGCTGACCA AGGCCGGCAT CCAGGCCGAG AACTACCCCT TCTGCACCAT CGAGCCCAAC
GTCGGCATCG TCGAGGTGCC GGATCCGCGC CTGGCCGCGC TGTCCGAGAT CGTCAAGCCG
CAGAAGATCC AGCCCGCCAT CGTCGAGTTC GTCGACATCG CCGGCCTGGT TGCCGGCGCC
TCCAAGGGAG AAGGCCTGGG CAACCAGTTC CTCGCCAACA TCCGCGAGAC CGACGCCATC
GTGCACGTCG TGCGCTGCTT CGCGGACGAC AACGTGATCC ACGTCTCCGG CAGTGTCGAC
CCGATCCGCG ACATCGAGGT CATCGACACC GAGCTCGCCC TCGCCGACAT GGCCACCGTG
GAGAAGGCGC TCAACCGCTA CAAGCGCCCT GCCGCCTCGG GTGACAAGGA GGCCAAGATC
CTCGTCGCCG TGCTCGAGAA GTGCTTCGCC CAGCTCGACC AGGGCAAGGC CGTGCGCGCG
CTCGACCTGT CGAAGGAAGA ATGGGCCAGC CTCAAGCCCT TCTGCCTGAT CACCGCCAAG
CCGGTGCTCT ACGCCGCCAA CGTCGCCGAG GACGGCTTCG AAAACAACCC GCACCTCGAC
GCCGTGCGCG CCCACGCCGC CGCCGAGGGC GCCGAAGTGG TCGCGCTGTG CGCCGCGATC
GAGGCCGAGA TCGCCGACCT CGAGGACGCC GACAAGAAGG AATTCCTCGA GACCATGGGC
CTGGAAGAAC CCGGCCTCGA CCGCCTGATC CGCGCCGGCT ACAAGCTGCT CGGCCTGCAG
ACCTACTTCA CCGCCGGCGT CAAGGAAGTG CGCGCGTGGA CCATCCACGT CGGCGACACC
GCCCCGCAGG CCGCCGGCGT CATCCACACC GACTTCGAGC GCGGCTTCAT CCGCGCCCAG
ACCATCGCCT ACGACGACTT CATCCAGTAC AAGGGTGAGG CCGGCGCCAA GGAAGCGGGC
AAGATGCGCG CGGAAGGCAA GGAATACGTG GTCAAGGACG GCGACGTGCT GAACTTCCTG
TTCAACGTCT GA
 
Protein sequence
MSLKCGIVGL PNVGKSTLFN ALTKAGIQAE NYPFCTIEPN VGIVEVPDPR LAALSEIVKP 
QKIQPAIVEF VDIAGLVAGA SKGEGLGNQF LANIRETDAI VHVVRCFADD NVIHVSGSVD
PIRDIEVIDT ELALADMATV EKALNRYKRP AASGDKEAKI LVAVLEKCFA QLDQGKAVRA
LDLSKEEWAS LKPFCLITAK PVLYAANVAE DGFENNPHLD AVRAHAAAEG AEVVALCAAI
EAEIADLEDA DKKEFLETMG LEEPGLDRLI RAGYKLLGLQ TYFTAGVKEV RAWTIHVGDT
APQAAGVIHT DFERGFIRAQ TIAYDDFIQY KGEAGAKEAG KMRAEGKEYV VKDGDVLNFL
FNV