Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2868 |
Symbol | |
ID | 9157036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2975502 |
End bp | 2976620 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003647805 |
Protein GI | 296140562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG GGTCCCGGAC GGCGCGGCGC ACCGCGCTGG CCGCGTCGGT CGCCGCCGTG CTGTTGATCT CGGCGTGCGC CGATTTCAGC GGCAGCGAGT CGGCGCCCTT CACCGGACCG CCCACCGGCG GAGCGCCGAC GACCACACCG CCGCAGTTGC CCACCAGTGC GGCTCCGAAA CCACCCGGAC CGTGCATCGA CCAGGATCCG CTGGTGCTCG CCACCTGCCT CACCGACCCC ACGTCGCTGA TCACCACTCC GGACCTCGAG CACGCCTACG TCGCCGAACG CGGCGGCACC GTCCAGTACA CGACCACCGA TCAGGCCAGC CGGGAGGTGC TGCGGATCGG CGTGGACACG GCCGGCGACG GCGGCCTCAC CTCGATCGCA CTCTCCCCCA CCTACGACCA GGACCGGCTG TACTACGCCT ACGTGAGCAC TCCCACCGAC AATCGGGTGG TGCGGGTGAC ACCGGGTGAC GAGCCGAAGG TGATCCTCGC CGGCATACCC AAGGGGCCGC TGGGCAATGC GGGATCACTG CTGTTCGTGG GCCGTGATCT CATCGTCGCC ACCGGTAACG CCGGGGATCA GGCCGCCGCA CGGGATCCGC GGTCCTTGGC GGGCAAGGTA TTGCTGCTGC CCTCACCCGG CACCGTCACA CCGACCGTGC CGGAGGTACT GGCGACCGAC GGCGGAATAC GCGCCTCACT GTGCCAGGCC GGGCCGAAGG GACCGGTGTT CGTCGCCGAC CAAGGCGCCA CGGAGGACCG CCTGCGGGTG GTGACCCCCG GGTCGCCCAC CGGCGTCGCT TGGACCTGGC CCGACCGTCC GGGGATCGCC GGGTGCGCCG TGGTCGACGG TGGTGTGGTG GTCAGTCACA GCCGGGCCGG CCGCGTGGAA TTCGTTGTGC TGCCCAAGGG TTCGACGACC GCCGATAAGG AACCGCTGCC GATGCTCGAT CGCAAGCGGT ACGGCGTCTT CGGTCGGCTC GCGACCGGGC CGAAGGGGCT GCCGCAGGGC GTGACCACGA ACAAGGCCAC CGGCCCCGTC GCCCCGACGG ACGATCGCGT CGTGCTGCTC CCCCTCCCGG GTGGGGACGC CGCCTCCGGC GAGGATTAG
|
Protein sequence | MSEGSRTARR TALAASVAAV LLISACADFS GSESAPFTGP PTGGAPTTTP PQLPTSAAPK PPGPCIDQDP LVLATCLTDP TSLITTPDLE HAYVAERGGT VQYTTTDQAS REVLRIGVDT AGDGGLTSIA LSPTYDQDRL YYAYVSTPTD NRVVRVTPGD EPKVILAGIP KGPLGNAGSL LFVGRDLIVA TGNAGDQAAA RDPRSLAGKV LLLPSPGTVT PTVPEVLATD GGIRASLCQA GPKGPVFVAD QGATEDRLRV VTPGSPTGVA WTWPDRPGIA GCAVVDGGVV VSHSRAGRVE FVVLPKGSTT ADKEPLPMLD RKRYGVFGRL ATGPKGLPQG VTTNKATGPV APTDDRVVLL PLPGGDAASG ED
|
| |