Gene Tmz1t_3636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3636 
Symbol 
ID7873141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3992842 
End bp3994482 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content73% 
IMG OID643700577 
Producthypothetical protein 
Protein accessionYP_002890606 
Protein GI237654292 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGC CAGGCGGGAC GGCGGGTGCG CCGGCCGCGG CGGCATCCGT CGGGGCGCAT 
GGCGCGATCG CGCGCGGGCT CGCCCTCGCC GTGCTGGCGC TGGCCGGGGT GTTCGTGCTG
CTCACCTTCG ACCAGCACGG CATCAGCAAC GACGAGGAGG TCCAGCATGT CTACGGCCGC
CTGCTGCTCG ACTTCTACGC CTCCAGCTTC GCCGACCGCC AGGCCTTCGA ATACAAGAAC
CTCTATCTTT ACGGGGGCTT CTTCGACCTC CTCGCGGCGG CCTTCGACCG CGCCGGGGTG
GCCGAAGGGC CGGCGCTGTG GGACCTGCGC CACCTGATCT CGGCCGTCTT CGGTCTCCTC
GGGCTGGCCG GCACCTGGCT GCTCGCGCGC CGGCTCGCGG GCGAGTGGGC GGGGCTGGCG
GCGCTCGTGC TGTTGTCGAT CACCGGCTCG TGGTCGGGCG CGATGTTCAC CCACACCAAG
GACATCCCCT TCGCCACTAC GATGCTGTGG GCGCTGTACT TCAGCGTGCG CGTGCTCGAC
ACGCTTCCCG CGCCGCCGTG GCGCGTGCTC GCGGGGCTGG GTGTGGCGCT CGGCTGCGCC
TTCGGCCTGC GCATCGGTGC GGTGTTCGCG GTGTTCTACC TTGGCGTCGG CGTGCTCGCG
GCGACGGCCT TGCAGCCGGG CGGGCGGGTG CGTTTCCTGC TTCGCGGCGT GCTGGCGCTG
CTGCCGGCGG CGGCGATCGC GCTGGCGCTG GGCGCGCTGT TCTGGCCGTG GGCGGCGATG
GAGCCGGGTA ACGTGCTCAC GGCGATGCGC GCGTTCTCGC ATTTCAGCTT CGAGCTCGAC
ACCGTGCTGG CCGGGCGCGT GATGAACGTC GGCGAGGTCC CGGGGCATTA CCTCGCGGCC
TACCTGCTGG TGCGCCTGCC GGAGCTTTTC CTCGCCGGGC TCGCGCTCGC GCTGCTGCTC
GGCGTGCGCG CCGTGCCGGC GCTCGCTGGC GAGCAGGCCC TGCGCGCGGC CCTGCCGTGG
CTGCCGGTGG TGCTGGCAGC GCTGTTCCCG CTCGTCTACA CCCTGCTTGC GGCGCCGCCG
CTGTACAACG GGCTGCGCCA TTTCAGCTTC GTGCTGCCGC CACTGGCGGT GCTGGCGGGC
ATGGGGCTCG TGCGCGCGTG GCACGGCCTG AGCGTCCGTC CGCCGCTGCT GCGCCGCGCC
GTGCTGGGCG CGTGCGCCCT GGCGGTCCTC GGCCAACTCG GCCAGCTCGC CCGCCTGCAT
CCCTACGAGT ACCTCGCCTA CAACCGCCTC GCGGGCGGCG TGCAGGGGGC GGTCGGGCGC
TGGGAGCAGG ACTACTGGGC GAGCAGCCTG CGCGAGGCGG TGCACGCCCT CAACGCCCTG
GTCGCGCGCG AGGGGGGGGC AGGGCGACAC TATTCCGTGG CCGTGTGCGC CGAGCCGCTG
CAAGCCCAGG TGTGGCTCGC GCCCGGGTTG CGCGCGACGC GCGACTGGTG GGGGGCGGAC
TTCTACCTCT CCCCCACCCA CATGGGTTGC GACGAGGCCA TGCGGGGCCG CGTGGTGGCG
CAGGTCGAGC GTGCGGGGCT GGTGCTGGCG GTGGTCAAGG ATCGGCGCGC GCTGGTTGGC
GAGGAGCGGC GGCCCCGATG A
 
Protein sequence
MIPPGGTAGA PAAAASVGAH GAIARGLALA VLALAGVFVL LTFDQHGISN DEEVQHVYGR 
LLLDFYASSF ADRQAFEYKN LYLYGGFFDL LAAAFDRAGV AEGPALWDLR HLISAVFGLL
GLAGTWLLAR RLAGEWAGLA ALVLLSITGS WSGAMFTHTK DIPFATTMLW ALYFSVRVLD
TLPAPPWRVL AGLGVALGCA FGLRIGAVFA VFYLGVGVLA ATALQPGGRV RFLLRGVLAL
LPAAAIALAL GALFWPWAAM EPGNVLTAMR AFSHFSFELD TVLAGRVMNV GEVPGHYLAA
YLLVRLPELF LAGLALALLL GVRAVPALAG EQALRAALPW LPVVLAALFP LVYTLLAAPP
LYNGLRHFSF VLPPLAVLAG MGLVRAWHGL SVRPPLLRRA VLGACALAVL GQLGQLARLH
PYEYLAYNRL AGGVQGAVGR WEQDYWASSL REAVHALNAL VAREGGAGRH YSVAVCAEPL
QAQVWLAPGL RATRDWWGAD FYLSPTHMGC DEAMRGRVVA QVERAGLVLA VVKDRRALVG
EERRPR