Gene Tmz1t_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1114 
Symbol 
ID7084643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1216364 
End bp1217659 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID643698129 
Productpolysaccharide export protein 
Protein accessionYP_002354769 
Protein GI217969535 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0366205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGCT TCACCACCGA CCGGATCGCC CACTGCGGCG CATCCAATGG CGCCGTGCGC 
GCCACCGACG CCAGTGGCCT TGACGCCCGC GCGGCCGGTC CCGCCACCCG CGCGGACGCC
ATCCGCAGCG GCGGTCTCGT CCGCACGGCC GTGATCGCCT TCGCCGCCGC AGCCATCGGC
GGCTGCACCC TCATCCCCGG CACCAGCGGC AGCCTCATGC GCGAGGAGTC CAGCGTGCCG
CTGCCGGTCA CCGAGGGCAC CGACACCGTC CCGGCCAACG TCAAGCTCAA GCCCATCACC
GCCGAGCTGA TCATCGAACA GCACCAGGCC AAGGAGTCGC GCTTCCGCAG CGGCAAGGCC
GGCGGCCAGA CCGCCACCCG CGCGAGCGAT CCCAAGAAGA TCCCCGGCTT CGACTACCAG
GACTACAAGC TCGGCCCCGG CGACATCATC AACGTCATCG TCTGGGACCA CCCCGAGATC
ACCATCCCGG CCGGCTCCTA CCGCTCGGCC GAGCAGTCCG GCACCCTGGT CGCCGAAGAC
GGCACCATCT TCTTCCCCTT CGCCGGCGTG GTGAAGGTCG CCGGCCTCAC CACCCGCGAG
GTGCGCCAGG TCCTCGCCAA GCGCATGGCC AGCGTCATCG AAAACGTCCA GCTCGACGTG
CGCATCGTCT CCTTCCGCAG CAAGCGCGTG TATGTCGTCG GCGAAGTCGC CAAGCCCGGC
CTGCAGCCGA TCGACGACAT CCGCATGACC CTGGTCGAGG CCATCAACCG CGCCGGCAAC
ATCACCGAAG AGGCCGACCA CGGCAACGTG CTGCTCACCC GCAACGGGCA GACCTGGCGG
GTCGACCTGC AGGCGCTGTA CGAAGAGGGC GACGTCAGCC AGAACGTGCT GCTGCAGCCG
GGCGACATCA TCAACGTGCC CGACCGCCAG CTCAACAAGG TCTTCGTGCT CGGCGAGGTG
CGCAACCCGG GCTCGTTCGT CATGAACAAG CGCCGCACCA CGCTCGCCGA GGCGCTGTCC
GACGCCGGCT TCGTCAACCA GAGCACGTCC GACCCGGCCT GGGTGTATGT GATGCGCAGC
GACAACGGCA GCGCCGAGCT CTTCCACCTC AACGCCCGCT CCCCCGACGC GCTGCTGCTG
GCCGAGCGCT TCCCGCTGCT GCCGCGGGAC GTGGTGTATG TCGATGTGGC GGCGATCGCG
CGCTGGAACC GCGTGGTCAG CAACATCCTG CCGACCTCGC AGATGCTGCA GCTCACCAGC
GAGACGCGGT ATCCGCTGTT CGGCGGCCGG CAGTAA
 
Protein sequence
MSRFTTDRIA HCGASNGAVR ATDASGLDAR AAGPATRADA IRSGGLVRTA VIAFAAAAIG 
GCTLIPGTSG SLMREESSVP LPVTEGTDTV PANVKLKPIT AELIIEQHQA KESRFRSGKA
GGQTATRASD PKKIPGFDYQ DYKLGPGDII NVIVWDHPEI TIPAGSYRSA EQSGTLVAED
GTIFFPFAGV VKVAGLTTRE VRQVLAKRMA SVIENVQLDV RIVSFRSKRV YVVGEVAKPG
LQPIDDIRMT LVEAINRAGN ITEEADHGNV LLTRNGQTWR VDLQALYEEG DVSQNVLLQP
GDIINVPDRQ LNKVFVLGEV RNPGSFVMNK RRTTLAEALS DAGFVNQSTS DPAWVYVMRS
DNGSAELFHL NARSPDALLL AERFPLLPRD VVYVDVAAIA RWNRVVSNIL PTSQMLQLTS
ETRYPLFGGR Q