Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3415 |
Symbol | |
ID | 9157590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3504839 |
End bp | 3506083 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003648338 |
Protein GI | 296141095 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.497108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCGG CCACCGTCCC GGCGACGATG CTCGCCGCGC GAGCCGCGGC CGACCCGGGG TCGGGTGCGG GCTCGGGTGG AACGGTACCG GGGACCCCGG GAGCGGTGCC GTCGGTTCCC GCGACCCGGT GGACTCCCGA GCGGGCGCAG CAGTGGCGCG AGCAGGCCGG CTGGATGGTG GGCTGCAACT TCATCAATGC GAACGCGGGC AACCAGTTCG AGATGTTCCA GGCGCAGACC TTCGATACCA ACCGGATCAA CACCGAGCTG GCGTGGGCTC GCGGCCTCGG GATGTCGGTG ATCCGGGTTT TCCTGCAGGA CCAGTTGTGG ACCGCTGACC CCGCCGGGTT CACGCAGCGC CTGGACACCT TCCTGTCGAT CGCCTCGGCC AATGGCATCC GCACCATGTT CGTGCTGTTC GACTCCTGTT GGGACCCGAA CCCCAAGCCC GGGGTGCAGC GCGAGCCCAC ACCCGGCGTG CACAACTCGA CCTGGGTGCA GAGCCCGGGG GCCGCCGGCC TCACCAACGC CGACACCAGT GCCCTGCAGG CCTATGCCAC CGGTGTGGTC AAGGCCTTCG CGAACGACCC GCGCGTGGTC GCCTGGGACG TGTGGAACGA GCCGGAGAAC CTCGCGGACT CCTACCCGCT GAGCCCGCCT GACAAGGTGG CCCGCGTGGC GCAGTTGCTG CCCAAGGCGT TCGAGTGGGC CCGTGCCGGG AACCCGTCGC AGCCACTCAC CTCCGGCGTG TGGGCCGACA CCCGGCCCGA GATCCGCACG ATCCAGCTGG AGCAGTCCGA TGTGATCAGC TTCCACAGCT ACGATCCGCC GGAGAAGTTC CGCTCGATGG CCGCAGACCT CGCCAAGGAG GGGCGCCCGC TGCTGCTCAC CGAGTACATG GCCCGCGCGC AGGGCAGCAC CATCGAGACC ATCCTGCCGA TCTGCAAGGA ACTGAAGATC GACGCGATGC AGTGGGGCTT CGTGGCCGGC CGCAGCCAGA CCTACTATCC GTGGGACTCG TGGAAGCAGC CTTATGTGGG TGCCCGCCAG CCGCGCGAGT GGTTCCACGA CATCCTCTGG CCCGACGGTA GGCCGTACCG TGACTCCGAG GTCGCGACGA TCCGGCAGCT CACTGCGGGC ACCACTGCGC CGGCTCCCGC AACCCAGGCG CCCGCGCAGC CCGCGCCACC GCAGCAGCAG GCCGCCGCCC CGCAGGCGCC ACAGCAGCCG CCCACGGCGC AGTAG
|
Protein sequence | MAAATVPATM LAARAAADPG SGAGSGGTVP GTPGAVPSVP ATRWTPERAQ QWREQAGWMV GCNFINANAG NQFEMFQAQT FDTNRINTEL AWARGLGMSV IRVFLQDQLW TADPAGFTQR LDTFLSIASA NGIRTMFVLF DSCWDPNPKP GVQREPTPGV HNSTWVQSPG AAGLTNADTS ALQAYATGVV KAFANDPRVV AWDVWNEPEN LADSYPLSPP DKVARVAQLL PKAFEWARAG NPSQPLTSGV WADTRPEIRT IQLEQSDVIS FHSYDPPEKF RSMAADLAKE GRPLLLTEYM ARAQGSTIET ILPICKELKI DAMQWGFVAG RSQTYYPWDS WKQPYVGARQ PREWFHDILW PDGRPYRDSE VATIRQLTAG TTAPAPATQA PAQPAPPQQQ AAAPQAPQQP PTAQ
|
| |