Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1939 |
Symbol | |
ID | 9156094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2023155 |
End bp | 2024402 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003646891 |
Protein GI | 296139648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.314426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCG AGACCGCACC CGACGGTACG CGCTTGGCCG CGGCCGAATC CCGACATGCG GCATCGCCCG CCGCCCCCGT CCTCGACATC GTGATCCCCG TGTACAACGA GGCGCACACC ATCGCCCACT GCGTGGAGAC CTTGCACGCC TACCTCACCG ACACCCTGCG GGTCCCTGCG CGCATCACCA TCGCCGACAA TGCGAGCACC GACGAAACCC TGCGCGTGTC CCACTCCCTG GCGAGCGCCA TCGACGGAGT CCGCGTGGTG CACCTGGACG CGAAAGGCCG TGGCCGGGCG CTGCGCCGAG TGTGGTCGGA GTCCGATGCG CAGGTGCTCG TGTACATGGA CGTCGACCTG TCCACTGACC TCAACGCCCT GCTCCCGTTG GTCGCTCCCC TCATCTCCGG ACACAGCGAC CTCGCGATCG GCACCCGGCT GGGCCGTGGT GCCCGAGTGC GACGGGGCCC CAAGCGGGAA TTCATCTCCC GCGGCTACAA CGTGCTGTTG CACACCGCGC TGCGCGTGCG CTTCTCCGAC GCCCAGTGCG GATTCAAGGC GATCCGCACC GACGTCGCGC GGGAGTTGCT ACCCCTGGTG GAGGACGGTG AATGGTTCTT CGACACCGAA CTACTGGTGC TGGCGGAGCG CGCCGGACTG CGCATCCACG AGGTCCCGGT CGATTGGACC GACGATCCGG ACAGCCGGGT CGACATCGTC GATACCGTGG CCAAAGATCT GCGAGGCATG GCCCGTGTGG GTCGGGCGCT GGCGGCCGGG CGGCTGCCAC TCGACGACGT ACGTCGTGCG GTCGGACGCG ACGAACCCCG GATCGCCGGC GTGCCGCACG GGATGATCGG TCAGCTCGCC CGGTTCGCGG TCGTCGGTTT GGCGAGCACG GTGGCCTACG CAGTGCTGTA TCTGGCGCTG CACTCGGCGA TCGGCGCACA GGCCGCGAAC TTCGCAGCGC TCCTCATCAC CGCCGTGGGC AACATCGCCG CGAACCGCGC ATTCACCTTC GGTGTGCGAG GCCGGCGCGG CGCGATGCGG CACCACACGC AGGGCCTGGT GGTCTTCCTC GTCACGTGGG CACTCACCGC CGGAAGTCTG GCACTGCTGG CGACGGCGGC GCCCGCAGCA TCCCGGGAGC TGCAGCTCGC GGTGCTGGTG ATCGCGAATC TGGTGGCGAC GGTGCTGCGC TTCGTGGGCA TGCGGCTGAT CTTCCGGTCC CCCGGGGCCG CCCCGTGA
|
Protein sequence | MTTETAPDGT RLAAAESRHA ASPAAPVLDI VIPVYNEAHT IAHCVETLHA YLTDTLRVPA RITIADNAST DETLRVSHSL ASAIDGVRVV HLDAKGRGRA LRRVWSESDA QVLVYMDVDL STDLNALLPL VAPLISGHSD LAIGTRLGRG ARVRRGPKRE FISRGYNVLL HTALRVRFSD AQCGFKAIRT DVARELLPLV EDGEWFFDTE LLVLAERAGL RIHEVPVDWT DDPDSRVDIV DTVAKDLRGM ARVGRALAAG RLPLDDVRRA VGRDEPRIAG VPHGMIGQLA RFAVVGLAST VAYAVLYLAL HSAIGAQAAN FAALLITAVG NIAANRAFTF GVRGRRGAMR HHTQGLVVFL VTWALTAGSL ALLATAAPAA SRELQLAVLV IANLVATVLR FVGMRLIFRS PGAAP
|
| |