Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4334 |
Symbol | |
ID | 9158516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014159 |
Strand | - |
Start bp | 97524 |
End bp | 98861 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003649234 |
Protein GI | 296141992 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.163253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGCA AGAATTTAGG TCCGGCCGAA CACGCTTCAA GCGATCCTGA AACTCTTGTT TTGACTCCCC AGTGGTCGGC AGCACAGAAG ATCTGTTTCC GACTGCTATT CACTCTGGGA TCTGGGCTGC TTATTCTTGT AATTTACAGT AACGCTGGCC TTGCACCAAT ACTGCGCATT ACCGGAGTGC AGTGGGTTGT TTCCCAGATT GGCAGTTACG TTAGCCGTGG AGACTCCGTC GCTATCGGTA CGGGCAGCGA CCAGGCATGG CAGTGGTACA TGCTCTTAGG CTGGGCGATC CTCGCTGTCA TCATCACAGC GGCCTGGACA GCACTGGACC GTTCCCATCA GAACTATCGC GGCCTCGCTG GTCTGCTCCA TGTATTCACC CGCGTAGGGC TTGCACTATC TTTGATCATC TACGGGCTCG CTAAGGTTCT TCCGACGCAG ATGGGGTACA TGGCCTTGCC TGCATATCAG CTACAACTCG TCGGCGATAC TAGCTTGTTC CACACGCTGT GGGGTTTCAT GGGAGCCTCG ACGCCCTACT CGGTAGCTGT TGGCCTAGTC GAGCTGGTCT CTGGCATACT CTTGCTGTGC CGTCGCACAT GGCTCTTCGG GGCCACCCTG GCCATCATCG CAACTGTCCA AGTTCTACTA CTAAACCTTT TCTATGACGT ACCGGTAAAG ATCGTGGCCA GTGCTTTGAC CATTGCAGCA TTTGTAGTCC TTGCCCCCGG TTGGCGAAAT CTCTGGTACG CACTAACGAA CCAGCCGGGC GTCCCGCCAC TGACGCTTTG GCCAGCTAGC GGTTCAGGCC GCGCTGGCAT ATCGGTCGTT GGAACAATAG TGAAATGGCT GGCAGCGAGC CTTATTATAA TCAACCACGG CACCGCTGGC GCAATCGGAC TATACATTCT GCACACTCCT CGGAGCGACT TAGATGGAGT ATGGAGCGCC AACGAGTTCA CAATTAATGG TTCACCTGCA CGCGTCGAAG ATCGCCCTTG GACGAATATG TCTATCACGC TGCGCGGCAG CGACACCAAC CCCGCTTTAG CTCCCGCGAG CAAACGCTAC GATACCCTCG TTTCCCAGGA TTCCACTGGT CATACTGTCG CCTGGCGGCT TGAGCAGACA GGAGTCGAAT TAAACCTCCG AGCCGGAGGC AATGGGCCAC GCGTAAAGAT AACGACAAAT CAAGTCAGTC ATGACTTGCT GTATATTTCA GGCACCATCG GCGGCAATAA GATCGAAGGA AAATATGTCC GGCGTGCGAT GCAACGAGAG AACTCCATAA GGCTGGTCCA GCCAGATGCG CAAAATTCTT CGCGCTGA
|
Protein sequence | MNSKNLGPAE HASSDPETLV LTPQWSAAQK ICFRLLFTLG SGLLILVIYS NAGLAPILRI TGVQWVVSQI GSYVSRGDSV AIGTGSDQAW QWYMLLGWAI LAVIITAAWT ALDRSHQNYR GLAGLLHVFT RVGLALSLII YGLAKVLPTQ MGYMALPAYQ LQLVGDTSLF HTLWGFMGAS TPYSVAVGLV ELVSGILLLC RRTWLFGATL AIIATVQVLL LNLFYDVPVK IVASALTIAA FVVLAPGWRN LWYALTNQPG VPPLTLWPAS GSGRAGISVV GTIVKWLAAS LIIINHGTAG AIGLYILHTP RSDLDGVWSA NEFTINGSPA RVEDRPWTNM SITLRGSDTN PALAPASKRY DTLVSQDSTG HTVAWRLEQT GVELNLRAGG NGPRVKITTN QVSHDLLYIS GTIGGNKIEG KYVRRAMQRE NSIRLVQPDA QNSSR
|
| |