Gene Tmz1t_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1800 
Symbol 
ID7085770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2021677 
End bp2023326 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content74% 
IMG OID643698822 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_002355448 
Protein GI217970214 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.081998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGCA AGCCCGGTCT GGGCAAATGT TTTGTAACTG CCGCGGCCCT CGCCGCGCTG 
GCCCTCATGG CGTCCGGCGT GCATGCGACA GCGGCTGGCG AGGCCGCGGT GGTCGCGGTC
GAGCCGGCCG CGAGCGACTC CGCGCTCGCG CTGCGCATCG AGCAGCGCCT GCGCGAGCGC
GCCGCGACGG CCGACGACCC CATCGCGCTG TTCTACCTCG CGCGTGCCTA CCGCTCGGTG
TGGACCGAAC CTGCGCGCGT GCACGCCCTG CTCGCGGCGG TCGAGGCGGT ACGCGGGCAT
GGGCTGGATG CGGCGGATTT CGCCCCCGCC CGCCTGCGCG CCGGTGCCGT CCCCGCGGCC
GATCCCGAGC GTGCGGCCGA GCGCGAGCTC TTGCTCACCG ACACCCTGGC CGCGCTGCTC
TTCCAGCTCC GCCATGGCAA GGTCGACCCG CGCGCGCTCT ACCGCGAGTG GAACTTCACC
CCGCCGCCCA GGCCCTACGA GCGTGCGGCC GAGCTTGCAC GCGTGCTGCA GGCGCCCGAT
CTGGCCGCCG CGGTCGACGC GTACGCGCCG GACCTGCCGC TCTACCGCGC GCTGCGCGCC
GAGCTGCTCG CCCAGCAAGG CCGGCTCGCC GTGGGCGACT GGCCCAAGGT CGCCGCCGGG
CCCACCCTCA AGCCCGGTGC AAGCAGCTCG CGCGTGGCCT CGTTGCGCGC GCGCCTGGCC
GCCGCGGGCG AGCGCGTGTC CGAGGCGCGC GACAAGTCCC ACTACGACGA AGCTCTGGTC
GAGGCGGTCA AGCGCTTCCA GGCCGCGCAC GGCCTGCAGG CCGACGGCGT GCTCGGGGCG
CAGACCCTGG AGGCGCTCAA CGCCAGCCCG GCGCAGCGCG TGGCGCAGAT CCGCGCCAAC
CTCGAGCGCC TGCGCTGGGT GGCGAGCGAC CTGCAGGGCG ACCGCCTGCT GGTGGACATC
GTCGGCTACC ACGCCGACCT CGTGCTCGAC GGCCAGCCGG TGTGGTCCTC GCGGGTGATC
GTCGGCAAGC CCAAGCGGCG CACCCCCTCG CTGCTCGACA GCGTCACCCA TCTGGTGCTC
AACCCGAAGT GGGTGGTGCC ACCCACCATC CTGCGCGAGG ACGTGATTCC GGGCGCAGCG
CGCAACCCGT CCTATCTCGC CAACCGGCGC CTGCGCGTGG TCGATCGCAG CGGGCAGACG
GTGGACCCCG CCACCATCGA CTGGAGCGGG GCGCGCCAGA GCGGTTTTCC CTATCGCGTC
GAGCAGCAGT CCGGTGCCGA CGGCTCGCTC GGGCGGATCA AGTTCTCGCT CTCCAACCCC
TACGTGATCT ACCTGCACGA CACCAACGCG CGCTCCCTGT TCAAGCGCGC CGAGCGTGCG
CTCAGCTCGG GCTGCGTGCG CGTGGAGAAG CCCGAGGAGC TGGCGGTGCT GCTGCTCGCC
GACAGCGGGC GCTGGAGCGC GCAGGCGCTG CAGGCGGCGC TCGACAGCGG GCGCACGCGC
ACCGTGGACG TGGGGCGCGA CGTCAAGGTG TTGCTGCACT ACGCCACCGC GGCGCTCGAC
GAGGCGGGCA GGGTGCTGCT GCGCAACGAC ATCTACGGCT ACGACGCGGC GATCGTGGCC
GCGCTCGATG CGCCCGCGCC GGCGCGCTGA
 
Protein sequence
MQSKPGLGKC FVTAAALAAL ALMASGVHAT AAGEAAVVAV EPAASDSALA LRIEQRLRER 
AATADDPIAL FYLARAYRSV WTEPARVHAL LAAVEAVRGH GLDAADFAPA RLRAGAVPAA
DPERAAEREL LLTDTLAALL FQLRHGKVDP RALYREWNFT PPPRPYERAA ELARVLQAPD
LAAAVDAYAP DLPLYRALRA ELLAQQGRLA VGDWPKVAAG PTLKPGASSS RVASLRARLA
AAGERVSEAR DKSHYDEALV EAVKRFQAAH GLQADGVLGA QTLEALNASP AQRVAQIRAN
LERLRWVASD LQGDRLLVDI VGYHADLVLD GQPVWSSRVI VGKPKRRTPS LLDSVTHLVL
NPKWVVPPTI LREDVIPGAA RNPSYLANRR LRVVDRSGQT VDPATIDWSG ARQSGFPYRV
EQQSGADGSL GRIKFSLSNP YVIYLHDTNA RSLFKRAERA LSSGCVRVEK PEELAVLLLA
DSGRWSAQAL QAALDSGRTR TVDVGRDVKV LLHYATAALD EAGRVLLRND IYGYDAAIVA
ALDAPAPAR