Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0162 |
Symbol | |
ID | 9154296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 165933 |
End bp | 167888 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | putative galactofuranosyltransferase |
Protein accession | YP_003645155 |
Protein GI | 296137912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTTCCA CCGATACTCA GGCGAAAACC CTTCTGCAGC GCGTCATCCT GCCGCGCCCG GGTGAGCCGC TCGACGTGCG CTCGCTGTAT CTCGAAGAGG CCGAGACCAA CTCACGTCGC TCGCACGCGC CCACGCGCAC GTCGCTGAAC ATCGCCGGAG AGTCCGAGGT CAGCTTCGCC ACCTACTTCA ACGCCTTCCC GGCGTCGTAT TGGCGCCGCT GGACGGTCCT GAAAGACGTG GTGCTGCGGA TCGAGCTCAA GGGCACGGCC CGGGTCGATC TGTACCGCTC CAAGGTCGAC GGTGCGCGGA TCGCTCTGGG CGGCAACCTC GTCGAGACCG ACGCCACCGG CTACGGCGTC GCCGAATTCG CCACCGATCT CGGCCCGTTC GAGGACGGCG GCTGGATCTG GTTCGACGTG ACCGCCGATT CCGATACCGA GATCATCTCC GCAGGCTGGT ACGCCACCGT CTCCGAGGAG GGCCTGCCGG ACAAGCGCGT CACCGTCGGC ATCCCCACCT TCAACCGGCC CGACGACGCT GTGGCCGCGA TCCGCGCGCT CACCAGCGAC CCGCTGGTCG ACGAGGTGAT CGACGCCGTC CTCATGCCCG ATCAGGGCAA CAAGAAGGTG ATCGACCACC CGGATTACGC CGACGCCATC GCTCCCCTCG GTGACCGGTA CCAGCGGTTC GAGCAGGGCA ACCTCGGCGG CTCCGGCGGC TACGGCCGCA TCATGTACGA GGCGCTGCGG CTCACCGACA GCCCCTACGT GCTGTACATG GACGACGACA TCGCCATCGA GCCGGATTCG ATCCTGCGCG CACTGCAATT CGCCCGCTTC GCCAAAACCC CGATGCTCGT CGGCGGCCAG ATGCTGAACC TGCAGGACCG CAGCCACCTG CATTGCATGG GCGAGGTCAT CGACCGGGGC GCCTTCATGT GGACCGCCGC CCCCTTCGTC GAATACGACC ACGACTTCTC GAAGTACCCG CTCTCCGACA AGGAGAACAG CAAGAACCTG CACCGGCGCA TCGATGTCGA CGGCAACGGC TGGTGGATGT GCCTCATCCC GCGCGTCGCC GCCGAAGAGA TCGGTCTGCC GATGCCGCTG TTCATCAAGT GGGACGACTG GGACTACGGC CTGCGGGCCG CCGAACACGG CTACCCCACG GCGACGGTGC CCGGCATCGC GATCTGGCAC ATGGCCTGGT CCGATAAGGA TGACGCGATC GATTGGCAGG CCTACTTCCA TCTGCGCAAC CGACTGGTGG TCGCGGCGCT GCACCACGAG GGCAGCACCC GCGGCATCAT GTCCAGCTCG ATCAAGGCGC TCATGAAGCA CCTGCTGTGC CTGGAGTACT CGACGGTCGC GATTCAGATC GAGGCCATGC GCGACTTCCT GCGCGGCCCC GAGGCGTTGT ACGAGCTGCT TCCGACGGCT CTGCCCAAGG TCGCGGCCAT GCGTAAGGAG TACCCGGATG CCGTGGTGCT CCCCAGCGCC ACCGAGCTTC CGCGCACCAC CGGCGCCGCG ACCGCGCTCG GCACGAAGAT CCCGCTGAAC CCGATCACCA AGGTCAAGAC CCTCGCAGCG GCGGTGCGCA ACAACCTGCG CCCTGCCGAT CCGCACAACC ACGAGGTGCC GCAGGCCAAC TACCCGCCGC TCGAGGCGCG GTGGTTCAGC TTGGGCCGGG TGGACGGCGT CACCGTCACC ACGGCCGATG GTCGCGGCGT GGTCTATCGC CAGCGCGATC GGGAGAAGAT GTTCGCGCTG ATGCGCGAGA GCCTCACGGT GCACCGCGAG GTCGATCGCC GCTTCGAGGA GATGAAGCAG CGCTACCGGG CCGCGTACGG CGACCTGACC AGCCGCGAGG CGTGGTCGAA GATCTTCGAG CCCACGACGC CGGGAGCGGA GCGAGCGGGG CGAATCGAAC CCGAGAACGC GGGGGAGCAG AAGTGA
|
Protein sequence | MSSTDTQAKT LLQRVILPRP GEPLDVRSLY LEEAETNSRR SHAPTRTSLN IAGESEVSFA TYFNAFPASY WRRWTVLKDV VLRIELKGTA RVDLYRSKVD GARIALGGNL VETDATGYGV AEFATDLGPF EDGGWIWFDV TADSDTEIIS AGWYATVSEE GLPDKRVTVG IPTFNRPDDA VAAIRALTSD PLVDEVIDAV LMPDQGNKKV IDHPDYADAI APLGDRYQRF EQGNLGGSGG YGRIMYEALR LTDSPYVLYM DDDIAIEPDS ILRALQFARF AKTPMLVGGQ MLNLQDRSHL HCMGEVIDRG AFMWTAAPFV EYDHDFSKYP LSDKENSKNL HRRIDVDGNG WWMCLIPRVA AEEIGLPMPL FIKWDDWDYG LRAAEHGYPT ATVPGIAIWH MAWSDKDDAI DWQAYFHLRN RLVVAALHHE GSTRGIMSSS IKALMKHLLC LEYSTVAIQI EAMRDFLRGP EALYELLPTA LPKVAAMRKE YPDAVVLPSA TELPRTTGAA TALGTKIPLN PITKVKTLAA AVRNNLRPAD PHNHEVPQAN YPPLEARWFS LGRVDGVTVT TADGRGVVYR QRDREKMFAL MRESLTVHRE VDRRFEEMKQ RYRAAYGDLT SREAWSKIFE PTTPGAERAG RIEPENAGEQ K
|
| |