Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1472 |
Symbol | |
ID | 9155622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1538394 |
End bp | 1541516 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | ribonuclease, Rne/Rng family |
Protein accession | YP_003646438 |
Protein GI | 296139195 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.800895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGACA CACCGTCGCC GGAAGAACAG AACAACGAAG CTCGGGAGGA GTTCCCGCAG AAGTTGCGCG TGCACGCGCT CGCTCGTCTG TTGGGACTGA CCAGCAAGGA GGTGCTCGCG CACCTCGGTG ACCTCGGTTT CGTCGCGCGC AGCGCGCACT CGAGCATCGA TCGCAGCGCC GCCGAGCGGG TGCGCGACCG GATCGCCGAA CTCGCGGCCG CCCCCGATGG CGCCGCCACC CCCGAGGCGC CCGCGGAGAC CGAGGCCTCG CACGGGTCAC CAGCCGAGAC TCCGGCCCCC GCCGACGAGA CTCCGTCGTT GTTCTCCGCG CTCGCCGCCC CGGCCCCAGC CGCCGAGTCC GCGACCACGC AGGCAGTCGA CACCACGGTT CCGCTGTTCC TGCAGCCCGA GGCCGACGCC GCGCCGCGTC GCCGCACCCG CAGCCGGGCC AAGGCCGAGC CGAAGAACGA TGAGGTCACC GAGGCGTCCG ATCAGGCCGC TTCCGAGGCG CAGCCGGCCG CGACGGGCGA ACAGGACGCC GCGGATGCCG ACGCCGCTAA CGGGGCCGAC GCAACCGAGG CGCAGAGCGA TGCCCCGTCG GGCGAGAGCA CCGACGATGA CAACGGCGGC AACCGCCGAC GTCGCCGCGG CCGCCGCGGA CGCGGCCGGG GTCGGGGCGA GAACGCCGAT GAGCAGGACT CCGCGAACGA CGAGGATGCC GCCGACGCCA CGCCCGAGAA GGCGGATCAG CCCCCCGCGC CGGAGAAGGC CGACGGTGAG GACTCGACCG CCGAGAACAA GGACGACGAG TCCGAGGGTG ACGACGTCGA GGATGTCACC GACGGCAGCT CGCGTCGCCG TCGCCGTCGC CGTCGCCGCC GCGGTGGGGG AGACGACGCC GACAGCGCCG CCAGCGACGA TCCGCCGAAC ACCGTGGTGC ACGAGCGCGA ACCCCGGCAG AAGTCACGGC GCGACGAGGT GCGCGGCATC AGCGGTTCCA CGCGCCTGGA GGCCAAGCGA CAGCGCCGCC GCGACGGCCG CGATACCGTG CGCCGCCGCC CGCCGATCCT CACCGAATCC GAGTTCCTGG CTCGCCGCGA GGCCGTCGAC CGCGTGATGG TGGTGCGCGA ACGCACCAAG GTCGGGCCGT CGGAAGGCCA GGACGGCCAC ACCGTGCCGC ACCCGCAGGA CTACACGCAG GTCGCCGTGC TCGAAGACGG TGTGCTCGTC GAGCACTTTG TCACCTCGTC CAGTTCGGCG TCGATGGTGG GCAACATCTA CCTCGGCCGC GTGCAGAACG TGCTGCCCTC GATGGAGGCG GCCTTCGTCG ACATCGGCCG TGGCCGCAAC GGCGTCCTGT ACGCCGGCGA GGTGAACTGG GACGCTGCCG GACTCGATGG CAACGCCCGC AAGATCGAGC AGGCGCTCAA GCCCGGCGAC CAGGTTCTCG TTCAGGTCTC CAAGGATCCG GTGGGCCACA AGGGCGCCCG CCTGACCACG CAGATCTCGC TGGCCGGGCG CTTCCTGGTG TACGTGCCCG GTGGCGGCTC CGCGGGTATC TCCCGCAAGC TCCCCGACAC CGAGCGCAAG CGCCTGAAGG AGATCCTCAA GGAGATCGTC CCGGCCGATG CGGGCGTGAT CATTCGCACC GCGTCGGAGG GCGTGAGCGC CGAGGAGCTG GCGGGTGATG TCTCGCGGTT GCAGGCGCAG TGGGCCGAGA TCGAGGAGGC CTCCAAGGCG AAGGGGGTGC GCGCGCTCTA CGAGGAGCCC GACCTCCTGG TCAAGGTGGT GCGCGACCTG TTCAACGAGG ACTTCAGCAA GCTCGTCATC GAGGGCGGCA CCGCCTGGGG CACCGTGGAG AAGTACGTCT CCACGGTCGC TCCCGACCTG ATGCCCCGCG TCGAGCGGTT CGAGAAGCGG CACGCCGACG CCCCCGATGT CTTCGCGGCC TACCGGATCG ACGAGCAACT GGCCAAGGCC CTCGACCGCA AGGTGTGGCT GCCCTCGGGC GGCACCCTGG TGATCGACCG CACCGAGGCC ATGACCGTGG TCGACGTGAA CACCGGCAAG TTCACCGGCT CCGGCGGCAA CCTGGAGGAG ACGGTCACCC GTAACAACCT CGAGGCGGCC GAGGAGATCG TGCGGCAGAT GCGCCTGCGC GACATCGGCG GCATGATCGT CGTCGACTTC ATCGATATGG TCCTGGAGTC GAACCGCGAC CTGGTGTTGC GGCGCCTGAC CGAGGCCCTG GGCCGCGATC GCACTCGCCA TCAGGTCTCC GAGGTCACCT CGCTGGGCTT GGTGCAGATG ACCCGCAAGC GGATCGGCAC CGGCCTCGTC GAGGCCTTCT CCACGCCGTG CCAGGCCTGC TCGGGCCGCG GCATCATCAT CCACGCGGAT CCGGTCGAGA CCGCGGGTGG CGACGACTCC GGTCGTTCGG GCGAGAAGTC CGAGGGCAGC CGCAAGAAGC GCAAGCGTTC GAAGTCCGAC GGTGCGCAGC AGCCCGTCGT CGCGCCGAAG GACGACAAGG CGGCCCATAA GAGCGAGCAC CCGATGTTCA AGGCGATGGC GCAGCATCAC GACGACGAGG ACTCCACTCC CGTCGACGGT GCGCAGGACG GCGAGGCGGT GGCCGAGGCG CCCGCCGACG CGGGGAAGCC GGCGGAGCAG AGCACCGAGC CCACGCAGGA ACCCAAGCGC GAGCGTCGTC GTCGGCGTGA GCCGAAGCAG GACGCGCCGA GCCAGGCGGC GACGATCGAG TCGGCGCCGA CCGAGACCGC GCCGGCCCCG CAGGCCGCCG CCGAGTCGAC CCCGGCCGCA CCGGTGGCCG CTGAGCCGGC GGCTGCGGAA CCGGCCTCCG CCGTGGCGCC GTCGGCACCG CGTCGCCGCC GGGTCGCCCG CAAAGCGCCC ACCACCACGT CGGCGGCGGC ACAGACGATC GTCGTCGACC TGGCTCAGGA GGCCCCGGGT GCACCGGCGG TCACCGCTCC GGCGGCAGAC GGTGCGGGGG AGACCGCCGC CGAACCCGCC CGTAAGCGAG CGCGCCGCCG CGCGGCCGCC CGCCCGGCGG GCCCTGCGGC CGGTGGAAGT GACACTGCTG ACAAGACGGA CCAGCCGGTT TGA
|
Protein sequence | MADTPSPEEQ NNEAREEFPQ KLRVHALARL LGLTSKEVLA HLGDLGFVAR SAHSSIDRSA AERVRDRIAE LAAAPDGAAT PEAPAETEAS HGSPAETPAP ADETPSLFSA LAAPAPAAES ATTQAVDTTV PLFLQPEADA APRRRTRSRA KAEPKNDEVT EASDQAASEA QPAATGEQDA ADADAANGAD ATEAQSDAPS GESTDDDNGG NRRRRRGRRG RGRGRGENAD EQDSANDEDA ADATPEKADQ PPAPEKADGE DSTAENKDDE SEGDDVEDVT DGSSRRRRRR RRRRGGGDDA DSAASDDPPN TVVHEREPRQ KSRRDEVRGI SGSTRLEAKR QRRRDGRDTV RRRPPILTES EFLARREAVD RVMVVRERTK VGPSEGQDGH TVPHPQDYTQ VAVLEDGVLV EHFVTSSSSA SMVGNIYLGR VQNVLPSMEA AFVDIGRGRN GVLYAGEVNW DAAGLDGNAR KIEQALKPGD QVLVQVSKDP VGHKGARLTT QISLAGRFLV YVPGGGSAGI SRKLPDTERK RLKEILKEIV PADAGVIIRT ASEGVSAEEL AGDVSRLQAQ WAEIEEASKA KGVRALYEEP DLLVKVVRDL FNEDFSKLVI EGGTAWGTVE KYVSTVAPDL MPRVERFEKR HADAPDVFAA YRIDEQLAKA LDRKVWLPSG GTLVIDRTEA MTVVDVNTGK FTGSGGNLEE TVTRNNLEAA EEIVRQMRLR DIGGMIVVDF IDMVLESNRD LVLRRLTEAL GRDRTRHQVS EVTSLGLVQM TRKRIGTGLV EAFSTPCQAC SGRGIIIHAD PVETAGGDDS GRSGEKSEGS RKKRKRSKSD GAQQPVVAPK DDKAAHKSEH PMFKAMAQHH DDEDSTPVDG AQDGEAVAEA PADAGKPAEQ STEPTQEPKR ERRRRREPKQ DAPSQAATIE SAPTETAPAP QAAAESTPAA PVAAEPAAAE PASAVAPSAP RRRRVARKAP TTTSAAAQTI VVDLAQEAPG APAVTAPAAD GAGETAAEPA RKRARRRAAA RPAGPAAGGS DTADKTDQPV
|
| |