Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3636 |
Symbol | |
ID | 7873141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3992842 |
End bp | 3994482 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700577 |
Product | hypothetical protein |
Protein accession | YP_002890606 |
Protein GI | 237654292 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCCGC CAGGCGGGAC GGCGGGTGCG CCGGCCGCGG CGGCATCCGT CGGGGCGCAT GGCGCGATCG CGCGCGGGCT CGCCCTCGCC GTGCTGGCGC TGGCCGGGGT GTTCGTGCTG CTCACCTTCG ACCAGCACGG CATCAGCAAC GACGAGGAGG TCCAGCATGT CTACGGCCGC CTGCTGCTCG ACTTCTACGC CTCCAGCTTC GCCGACCGCC AGGCCTTCGA ATACAAGAAC CTCTATCTTT ACGGGGGCTT CTTCGACCTC CTCGCGGCGG CCTTCGACCG CGCCGGGGTG GCCGAAGGGC CGGCGCTGTG GGACCTGCGC CACCTGATCT CGGCCGTCTT CGGTCTCCTC GGGCTGGCCG GCACCTGGCT GCTCGCGCGC CGGCTCGCGG GCGAGTGGGC GGGGCTGGCG GCGCTCGTGC TGTTGTCGAT CACCGGCTCG TGGTCGGGCG CGATGTTCAC CCACACCAAG GACATCCCCT TCGCCACTAC GATGCTGTGG GCGCTGTACT TCAGCGTGCG CGTGCTCGAC ACGCTTCCCG CGCCGCCGTG GCGCGTGCTC GCGGGGCTGG GTGTGGCGCT CGGCTGCGCC TTCGGCCTGC GCATCGGTGC GGTGTTCGCG GTGTTCTACC TTGGCGTCGG CGTGCTCGCG GCGACGGCCT TGCAGCCGGG CGGGCGGGTG CGTTTCCTGC TTCGCGGCGT GCTGGCGCTG CTGCCGGCGG CGGCGATCGC GCTGGCGCTG GGCGCGCTGT TCTGGCCGTG GGCGGCGATG GAGCCGGGTA ACGTGCTCAC GGCGATGCGC GCGTTCTCGC ATTTCAGCTT CGAGCTCGAC ACCGTGCTGG CCGGGCGCGT GATGAACGTC GGCGAGGTCC CGGGGCATTA CCTCGCGGCC TACCTGCTGG TGCGCCTGCC GGAGCTTTTC CTCGCCGGGC TCGCGCTCGC GCTGCTGCTC GGCGTGCGCG CCGTGCCGGC GCTCGCTGGC GAGCAGGCCC TGCGCGCGGC CCTGCCGTGG CTGCCGGTGG TGCTGGCAGC GCTGTTCCCG CTCGTCTACA CCCTGCTTGC GGCGCCGCCG CTGTACAACG GGCTGCGCCA TTTCAGCTTC GTGCTGCCGC CACTGGCGGT GCTGGCGGGC ATGGGGCTCG TGCGCGCGTG GCACGGCCTG AGCGTCCGTC CGCCGCTGCT GCGCCGCGCC GTGCTGGGCG CGTGCGCCCT GGCGGTCCTC GGCCAACTCG GCCAGCTCGC CCGCCTGCAT CCCTACGAGT ACCTCGCCTA CAACCGCCTC GCGGGCGGCG TGCAGGGGGC GGTCGGGCGC TGGGAGCAGG ACTACTGGGC GAGCAGCCTG CGCGAGGCGG TGCACGCCCT CAACGCCCTG GTCGCGCGCG AGGGGGGGGC AGGGCGACAC TATTCCGTGG CCGTGTGCGC CGAGCCGCTG CAAGCCCAGG TGTGGCTCGC GCCCGGGTTG CGCGCGACGC GCGACTGGTG GGGGGCGGAC TTCTACCTCT CCCCCACCCA CATGGGTTGC GACGAGGCCA TGCGGGGCCG CGTGGTGGCG CAGGTCGAGC GTGCGGGGCT GGTGCTGGCG GTGGTCAAGG ATCGGCGCGC GCTGGTTGGC GAGGAGCGGC GGCCCCGATG A
|
Protein sequence | MIPPGGTAGA PAAAASVGAH GAIARGLALA VLALAGVFVL LTFDQHGISN DEEVQHVYGR LLLDFYASSF ADRQAFEYKN LYLYGGFFDL LAAAFDRAGV AEGPALWDLR HLISAVFGLL GLAGTWLLAR RLAGEWAGLA ALVLLSITGS WSGAMFTHTK DIPFATTMLW ALYFSVRVLD TLPAPPWRVL AGLGVALGCA FGLRIGAVFA VFYLGVGVLA ATALQPGGRV RFLLRGVLAL LPAAAIALAL GALFWPWAAM EPGNVLTAMR AFSHFSFELD TVLAGRVMNV GEVPGHYLAA YLLVRLPELF LAGLALALLL GVRAVPALAG EQALRAALPW LPVVLAALFP LVYTLLAAPP LYNGLRHFSF VLPPLAVLAG MGLVRAWHGL SVRPPLLRRA VLGACALAVL GQLGQLARLH PYEYLAYNRL AGGVQGAVGR WEQDYWASSL REAVHALNAL VAREGGAGRH YSVAVCAEPL QAQVWLAPGL RATRDWWGAD FYLSPTHMGC DEAMRGRVVA QVERAGLVLA VVKDRRALVG EERRPR
|
| |