Gene Tmz1t_3790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3790 
Symbol 
ID7874032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4176691 
End bp4179102 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content70% 
IMG OID643700732 
Productpolysaccharide export protein 
Protein accessionYP_002890756 
Protein GI237654442 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCA CCGAACATTG CAACCGCCTG CGGAGCGCTG CGAAGGCGCT GATCCTGGGC 
TTGGCACTGG CTTGCTCGTC CGTGACGATG GCGCAGAACG CCCCCGCTGC AGCGGGCTTC
GCGGCGGCGG GCGGTGCTCC GGCTGCCGTG GGGAGCGGCG GGGTGGGCAT GGGGGCCCCT
ACCTTGTCCG GTGGCGCGCC GGGTGCGGCC GTCGGTGCCG CCAGCGGTGC TGCGGCATCT
GCCGGTGCCG CGGACGGATT CGAGACCCAG GCTGCTCCGA CCGTGATCGG CGGTGAAGGC
CTGAGCCCCT CCCAGGATGC CGCGCTCGAC CAGGCGGTGA CGGTCGACGC CGACGGCAAC
GACCGCGTGG GCGGCACGGT GCGCGCCGCG CGGGTGCTCA ACGACTTCCA GCAATTCGTG
GCGCGCAGCA CCGGGCAGGT GCTGCCGCTG TACGGTTCGG GCTTCTTCGC GGGCTCGCGA
GTGTTCAACA GCCCCACCGC GCCGGTCGCC GACGACTACG TGCTCGGCCC GGGCGACCAG
GTGCTGGTGC GGATCTGGGG CGCCTTCGAG TCGCAGACCC GGGCGCAGAT CGACCGCAGC
GGCATGATCA CGCTGCCGAC CATCGGCCCG GTGAGCCTGG CCGGGGTGCG GATCGCCAAC
GCGGTGCAGG TGATCGAGAA TCAGGTGGGC CGCATCTACC GCGACGTGAG CGTCAGCGTG
AGCCTGGACC GGGTGCGCGG CATCACCGTG TTCGTGGTGG GCCAGGCGCG GCGTCCGGGC
ACCTACACGG TGTCGGGCAA TTCGACCCTG ATCGGCGCGC TGTTCCAGAG TGGCGGCCCG
GGGGCCAATG GTTCGCTGCG CCGCGTGCAG GTCAAGCGCG ACAACCAGGT GATCACCGAG
ATCGACCTGT ACCGCTTCCT CGCCAACGGC GACACCTCGG CCGACATCCG CCTGGTCGAT
GGCGACGTGA TCGTGATCCC CCCGGCGCAC GGCTACGTGG CGCTGACCGG GCAGGTGAAG
GCGCCGGCGA TCTACGAGCT CAAGGACCGT ACGGACACGC TGCGCAGCGT GCTGACGGTG
TCGGGCGGGC TGCCCGTCGT GGCCGACCCG CGCCTGGCCT TCGTCGAGCG CCTGGACCCG
AGCGCGGACC AGCCGCGCTC GGTGTTCGAG GTGTCGCTGC AGCCGGGGCA GCCCGACTTC
GTGCTGAAAT CCGGCGACCT GGTCGCGGTG CAGCCCATCC TGGCCGAGTT CGCCAATGCG
GTGACGCTGC GCGGTGGGGT GAGCGCACCG GTGCGCCTGC CCTACCGCGC CGGCATGCGG
ATCTCGGACC TGATTCCGGA CAAGGCCACC CTGATCAACC GCTACGTGGT GGACAACCAG
AACCGCAGCC TGCTCGATCG CGGCAGCTTC GTCGGCGATG TCGGCAACCT CTTCGTCGAC
ATCAACATGG ATTACGCGGT GGTCGAGCGG CTCGAGCGCC CGCAGATGGC GCTCAAGCTG
ATCCCGTTCA GCCTCAACGG GCTGTTCGCG GACCCCAACG GCCCGGACAA CCTGCGCCTG
CAGGCGGGCG ACACGATCTC GATCTTCACC GCGGGCGACG TGCGCGTGCC GGTGAGCCGC
CGCCGCGTGG TGATGCGGGT GGAGGGCGAG GTGAACCGCC CCGGGGTGTA TGTGGCCGAG
CCGGGCGAGA CCCTGGTGAA CATCATCGAG AAGGCGGGCG GGCCGACCGC GGACGCCTAC
CTGTTCGGCG CCGAGTTCTA CCGCGAGTCG GTGCGCAGGT CGCAGCAGGC CAACCTCGAC
AAGCTGGTGC AGCGCCTGGA GCAGCAGGCA GTTGCCGAGT CGGCGCGGGT TTCCGCCAAC
ATCATCGGGG ACGCCCAGGC CGTGGCCCAG GCGCAGGCGC AGCTGCGCGC CGAGCGCGAG
GCGCGTGGGC GTTTCCTCGC CCGCATGCGC ACGCTGAAGT CCTCGGGCCG GATGTCGCTC
GGCCTGCCGG CGGACGAGCC GAGCTTCGCG CAGATCCCCG GCTTCCGGCT GGAGAACGGC
GATCGCCTGG TGATCCCGAA TCGGCCCGAT TTCGTGCAGG TGTTCGGTGC GGTCAATACC
GAGTCTGCCC TGCTGTGGCG GCCCAGCCGC ACCGTTTCCG ACTACCTGGA GCAGGCCGGC
ATGAGCCGCG AGGGCGACCG CAGCGCGGCC TTCGTGCTGC GCGCCGACGG CACGGTGGTG
GCCGAAACGG GCAGTTGGTT CAGCAGCGTG ATGGGGACGA CGGTGCTGCC GGGCGACATC
ATCGTGATTC CGGAGCTGAT CGATCGCGAG TCGGGCTGGA CGGCCTTCGC GCGCATTGCC
AAGGACTGGA CGCAGATCTT CGCCAACCTC GGCCTCGGTG TGGCCGCGGT GCGCTCGATC
GAGAACGACT GA
 
Protein sequence
MNTTEHCNRL RSAAKALILG LALACSSVTM AQNAPAAAGF AAAGGAPAAV GSGGVGMGAP 
TLSGGAPGAA VGAASGAAAS AGAADGFETQ AAPTVIGGEG LSPSQDAALD QAVTVDADGN
DRVGGTVRAA RVLNDFQQFV ARSTGQVLPL YGSGFFAGSR VFNSPTAPVA DDYVLGPGDQ
VLVRIWGAFE SQTRAQIDRS GMITLPTIGP VSLAGVRIAN AVQVIENQVG RIYRDVSVSV
SLDRVRGITV FVVGQARRPG TYTVSGNSTL IGALFQSGGP GANGSLRRVQ VKRDNQVITE
IDLYRFLANG DTSADIRLVD GDVIVIPPAH GYVALTGQVK APAIYELKDR TDTLRSVLTV
SGGLPVVADP RLAFVERLDP SADQPRSVFE VSLQPGQPDF VLKSGDLVAV QPILAEFANA
VTLRGGVSAP VRLPYRAGMR ISDLIPDKAT LINRYVVDNQ NRSLLDRGSF VGDVGNLFVD
INMDYAVVER LERPQMALKL IPFSLNGLFA DPNGPDNLRL QAGDTISIFT AGDVRVPVSR
RRVVMRVEGE VNRPGVYVAE PGETLVNIIE KAGGPTADAY LFGAEFYRES VRRSQQANLD
KLVQRLEQQA VAESARVSAN IIGDAQAVAQ AQAQLRAERE ARGRFLARMR TLKSSGRMSL
GLPADEPSFA QIPGFRLENG DRLVIPNRPD FVQVFGAVNT ESALLWRPSR TVSDYLEQAG
MSREGDRSAA FVLRADGTVV AETGSWFSSV MGTTVLPGDI IVIPELIDRE SGWTAFARIA
KDWTQIFANL GLGVAAVRSI END