Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4212 |
Symbol | |
ID | 9158400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4342914 |
End bp | 4344014 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | solute-binding protein |
Protein accession | YP_003649119 |
Protein GI | 296141876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.850897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACACTCC GATATTCGCG CACCCTCCGG GTTTGTATCA CCGGCCTGGC GGTGGCCCTC GGGGCCGCGT CGCTGACGGC GTGCAATGCC AACAGCTCGC AGAAGAGCGG TGACGCGGGC GGTGGCAACA CCATCGCACT GCTGCTCCCG GAGTCGAAGA CCACGCGCTA CGAGGCTTTC GACAAGCCGC TCTTCGAGAA GAAGGTTGCC GAGCTGTGCA GCAGCTGCAA GGTCAGCTAT TACAACGCCG ACCAGGACGA GGCCAAGCAG TCGCAGCAGG TCGAGACCGC GATCACCGCG GGCGCGAAGG CGATTGTGCT GGGCCCCGTC AACGGCGACG GCGCGGGCGG CATGGTCGCT TCGGCGAAGA ACGCCAAGAT CCCGGTGATC GCCTACGACC GATTCATCGC CGGTGCCGAC TACTACCTGT CCTTCGACAA CGAGCAGGTG GGCCGGATTC AAGGCAAGTC ACTCGTCGAT GCCCTCGGCG GCCGGGGCAA CATCCTCATG CTCAACGGTG CACCGTCGGA TCCCAATGCC GCGCAATTCA AAGCGGGTGC ACGCAGTGTG ATCGGGCAGA GCGGGATCAG GGTGATCGGC GAGTACGACA ACCCGGACTG GAGCCCGGAC AAGGCGCAGC AGTTCGTCAC CGATCAGCTC AGCAGGAACA AGCCATCGGA TATCCAGGGC GTGTACGCCG CGAACGACGG GCAGGCGGGC GGTGTGGTCG CGGCACTCAC CGGTGCCGGC GTGAGCGCCG ACGCGCTGCC GCCGATCACG GGTCAGGACG CCGAATTGGC CGCGGTGCAG CGGATCGTCG CCGGCCGCCA GTCCATGACG GTGTACAAGC CCATCGCCAT CGAGGCGAAC ACCGCGGCGG AGGTGGCTGT CGCATTGGCG ACCGGTAAGG ACGTGCCGAC CACGACTGCC TCGGGGATCC AGCGGTCCAC GTTCAAGGAC GTATCCTCGT ACATTTTCAC CCCGATCGCG GTGACCAAGG CCAATGTCGC GAGCACCGTC GTGGCGGACA AGTTCTACAC CGCCGACCAG ATCTGCACGC CTGAGTACAA GGACGCGTGC GCGAAGGCGG GCATCAACTG A
|
Protein sequence | MTLRYSRTLR VCITGLAVAL GAASLTACNA NSSQKSGDAG GGNTIALLLP ESKTTRYEAF DKPLFEKKVA ELCSSCKVSY YNADQDEAKQ SQQVETAITA GAKAIVLGPV NGDGAGGMVA SAKNAKIPVI AYDRFIAGAD YYLSFDNEQV GRIQGKSLVD ALGGRGNILM LNGAPSDPNA AQFKAGARSV IGQSGIRVIG EYDNPDWSPD KAQQFVTDQL SRNKPSDIQG VYAANDGQAG GVVAALTGAG VSADALPPIT GQDAELAAVQ RIVAGRQSMT VYKPIAIEAN TAAEVAVALA TGKDVPTTTA SGIQRSTFKD VSSYIFTPIA VTKANVASTV VADKFYTADQ ICTPEYKDAC AKAGIN
|
| |