Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3609 |
Symbol | |
ID | 9157788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3723433 |
End bp | 3724650 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | DNA-directed DNA polymerase |
Protein accession | YP_003648526 |
Protein GI | 296141283 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.448364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCG AGCACCAGGA GCGGACGGAG GCCACGATCC TCCATGCCGA CCTGGACTCG TTCTACGCGT CGGTCGAACA GCGCGACCAC CCGTGGTTGC GCCGCCGTCC CGTCCTCGTC GGTTCCGGGG TGGTGCTTGC CGCGAGCTAC GAAGCGAAGA CCCGCGGCGT GCGCTCACCC ATGCCGCACC GGGAGGCTCT CGCACTGTGC CCCACCGCCA CGACCATGCC GCCCCGGTTC GACGCCTACA TGGAGGCCAG CCGATGCGTC TTCGAGATCT TCCGGGACAT CTCACCGCTG GTCGAAGGCC TGTCGGTCGA CGAAGCCTTC CTCGACGTCG GCGGGCTGCA CCGGATCGAC GGAACGCCGG CCCAGATCGG ACAGCGACTG CGCGAGCGGG TGCGCCGTGA AGTCGGGCTG CCGATCACCG TCGGCGTGGC GCGCAGCAAG TTCCTGGCCA AGGTGGCCAG TGCAGTCGGC AAGCCCGACG GACTGCTGGT GGTGGAGCCC GGCAGCGAAC TGCAATTCCT CCACCCCCTC CCGGTGCGCC GACTGTGGGG CGTCGGCCCG AAGACGGAGG CGCGGCTACA CGAGGCCGGC ATCACCACCG TCGGGCAGAT GGCGGCACTG GGCGAGCAGC GGCTCGCGAA ACTGCTCGGC CGCGGCACCG GCCGACACCT GTTCGCACTG TCCATGGCGC ACGACCCGCG GCGAATCGAC ACCGGTCGGC GCCGCAAGTC GATCGGCGCG CAGCGTGCCC TGGGCCGCAG GATCACGTCC GAGAACGAGA TCGAGGGCAC CGCTCTCGCC ATCATCGACC GACTCGGGGA GCGGCTGCGC GCCGCGGGCC GAGTCACCCG CACGGTGGAA CTGCGGCTGC GGTTCGACGA TTTCACCTCC ATCACCCGGT CGCGATCACT GCGCGAGCCC ACCGATCACA CCGAGGTCGT GCTGGCCACC GCGCGGGCGC TCCTCGACGA GGCGATGCCA CTGATCCGGG AGCGTGGATG CACGCTGATC GGCCTGTCGC TGGCGAACCT GGAGAATCAC GGCGCGGTCC AGCTCACTCT CCCTTTCGAC GGCGGCGAGC ACGATGCCGA CCTCGACCTC GCGATGGATG CGCTGCGCAC CAGATTCGGC CGCGCGGCCG TCACGCGCGG CTCGCTGCTG CACCGCCCGT TGGGCCCGGA CGCCCCGATG CTCACCGAGC AGGACTGA
|
Protein sequence | MHIEHQERTE ATILHADLDS FYASVEQRDH PWLRRRPVLV GSGVVLAASY EAKTRGVRSP MPHREALALC PTATTMPPRF DAYMEASRCV FEIFRDISPL VEGLSVDEAF LDVGGLHRID GTPAQIGQRL RERVRREVGL PITVGVARSK FLAKVASAVG KPDGLLVVEP GSELQFLHPL PVRRLWGVGP KTEARLHEAG ITTVGQMAAL GEQRLAKLLG RGTGRHLFAL SMAHDPRRID TGRRRKSIGA QRALGRRITS ENEIEGTALA IIDRLGERLR AAGRVTRTVE LRLRFDDFTS ITRSRSLREP TDHTEVVLAT ARALLDEAMP LIRERGCTLI GLSLANLENH GAVQLTLPFD GGEHDADLDL AMDALRTRFG RAAVTRGSLL HRPLGPDAPM LTEQD
|
| |