Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3881 |
Symbol | |
ID | 9158062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4005288 |
End bp | 4007675 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Protein of unknown function DUF1998 |
Protein accession | YP_003648793 |
Protein GI | 296141550 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.911773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTCAAG TCCCGGAAAA CGAACATTCC GAGGCTATCG GCAGAGCTTC AGCGTCGGGG GCAAACGCAT TGGAACACAA CACCTTCGGG CGCGAACTGC TGCATCGGAT CCAAGCCGGA ACAGGCCCCG CAGGCGATGG ACTGACCCAT GTGGCCGATA TCCCCTCACG ACGAGCCGAA TTCGCGCAGT GGCCACAGTG GCTGCCGTCG AACATCCGGG ACGGGTTCAT CGAGTCGGGC GTCGAGCGGC CCTGGCGGCA CCAGATCGAG GCCGCCGAGC ACGCGCACGC CGGGCGACAC GTGGTGATCT CCACAGGTAC AGCGTCAGGA AAGTCGCTCG CCTATCAGCT TCCCGTACTC GCCGGCCTGG CTACGGACCC GCGCGCCACC GTGCTCTACC TCTCGCCCAC CAAAGCTCTG GGCACCGATC AGCACGCCGC CGCGCTACGC CTGACGTCGA TGTTCGACGG CCTGGGCGAC GTCTCGCCCG CGATGTACGA CGGTGACACC TCGCAGGAGA TGCGCCGCTG GGCACGTTCG GATAGTCGTT GGGTGTTCAC CAATCCGGAT ATGATCCACG TGGGAATGCT TCCGCGGCAT GCCAAGTGGG CGCGCTTCCT CCGTGGCTTG AGATACGTGG TGGTGGACGA GTGTCACCAC TATCGAGGTG TGTTCGGATC GCACACCGCG TTGGTGCTGC GCCGCCTGTT GCGGGTGGCG GCGAAATACG GCGCCGAACC GACGGTGATC TGTGCTTCGG CGACCACGTC CGATCCGGCT GGCGCAGCAT CGCGGCTCAT CGGCTCCGAC TGCGTTGCCG TGGAGACCGA TTCGTCGCCG CATGGACCGC GCACAGTGGT GTTGTGGGAG CCGCCACTGA TCCCGGATCT GGAAGGCGAG AACGGCGCCC CGGTACGCCG GCAAGCGACC ACCGAGGCGG CGCGGATGAT GGCAGATCTG GTCGTGGAGG GCGCTCGGAC GCTGGCTTTC GTCCGGTCCC GACGGTCCGC CGAGACCGTC GCACTCTCCA CCCGGCGCAT GCTCGCCGAG GCCACGCCGG AGCTGGCCTC TCGGGTGGCG GCGTATCGCG CGGGCTACCT CGCGGAGGAC CGTCGTGCGC TGGAACGAGG CCTGAACGAT GGTGAGCTGT TGGCGGTTGC GACCACGAAC GCGCTGGAGC TGGGTGTCGA TATCGCGGGC CTCGATGCCG TGCTCATGGC CGGATTCCCC GGTACGGTCG CCTCGTTCTG GCAGCAGGCC GGGCGCAGTG GACGACGCGG ACAGGGGTCG CTGATCCTGC TGATCGCCCG CGACGATCCC CTGGACACCT ACCTGGTGCA CCACCCCGAG TCGCTGCTCG GGCGACCGGT GGAGGCCACC ATCACCGATC CGTGGAACCC GTATGTGCTG GGGCCGCAGT TGCTCTGCGC CGCCGGCGAA CTCCCGCTGA CGCGCGAGGA GGTGACCGCG CTGGGTGCGG TCGACGTGGT CGGTCGCCTC ACCGCCGACG GACTGCTGCG CGAGCGCCCC GCCGGGTACT TCCTGGCCGC CGGTATCGAT CCCCATGCAC GCGTGAATAT CCGGGGCGGA GCGGGCAGCG AGGTGCTGAT CGTGGAGGAG CTCACGGGCC GGCTGCTGGG AACGGTGGAT TTCAATCGCG CGCTGTCGAC GGTGTACGAG GGCGCCGTGC ACGTGCACCA GGGTGAGTCG TACGTGGTGG ACGAGCTCGA CCTCGATGAG GGGTTGGCGA TGGTGCACGC GGAGGAGCCC GAGTGGACGA CCTCAGCACG CGAAGACTCC GATGTCCGGG TCACCGGCGT GCACCAGACC GAGGAGCTGG GCGCGGTGAC GGCGCGGTTC GTCTCGGTCG AGGTGACCAG TCAGGTGGTC GGATACCTCC GCACGCTGCG CACCGGTGAG GTGCTCGATG CGGTGGAACT GGACCTGCCG GAGACGTCAC TCGCCACCCA GGCGGCGTTG ATCACGATCG AGCCGGATGC ATTGCTCGCG GCGGGGCTCG CACCCGAGCA CTGGCCGGGC GCGCTGCACG CTGCAGAGCA CGCGGCGATC GGCCTGCTGC CGCTTGTCGC CTCGTGTGAC CGGTGGGACA TCGGCGGGCT CTCGACCGCG CAGCACGAGG ACACCGGCCT GCCGTCGATC TTCGTCTACG ACGGTTACCC CGGCGGGGCC GGATTCGCCG AACGCGGCTA CGAGGCGATC GCCACCTGGC TCCTGGCCAC CCGTGACGCG GTCGCGGCCT GCGAGTGCCC GGCAGGCTGT CCCTCCTGCG TGCAGTCTCC CAAGTGCGGC AACGGGAACG ACCCGCTCGA CAAGGCCGGC GCGATCGTCG TTCTCGACCT CGTCTTGGCC GCCCTCGCAC AGGGCTGA
|
Protein sequence | MIQVPENEHS EAIGRASASG ANALEHNTFG RELLHRIQAG TGPAGDGLTH VADIPSRRAE FAQWPQWLPS NIRDGFIESG VERPWRHQIE AAEHAHAGRH VVISTGTASG KSLAYQLPVL AGLATDPRAT VLYLSPTKAL GTDQHAAALR LTSMFDGLGD VSPAMYDGDT SQEMRRWARS DSRWVFTNPD MIHVGMLPRH AKWARFLRGL RYVVVDECHH YRGVFGSHTA LVLRRLLRVA AKYGAEPTVI CASATTSDPA GAASRLIGSD CVAVETDSSP HGPRTVVLWE PPLIPDLEGE NGAPVRRQAT TEAARMMADL VVEGARTLAF VRSRRSAETV ALSTRRMLAE ATPELASRVA AYRAGYLAED RRALERGLND GELLAVATTN ALELGVDIAG LDAVLMAGFP GTVASFWQQA GRSGRRGQGS LILLIARDDP LDTYLVHHPE SLLGRPVEAT ITDPWNPYVL GPQLLCAAGE LPLTREEVTA LGAVDVVGRL TADGLLRERP AGYFLAAGID PHARVNIRGG AGSEVLIVEE LTGRLLGTVD FNRALSTVYE GAVHVHQGES YVVDELDLDE GLAMVHAEEP EWTTSAREDS DVRVTGVHQT EELGAVTARF VSVEVTSQVV GYLRTLRTGE VLDAVELDLP ETSLATQAAL ITIEPDALLA AGLAPEHWPG ALHAAEHAAI GLLPLVASCD RWDIGGLSTA QHEDTGLPSI FVYDGYPGGA GFAERGYEAI ATWLLATRDA VAACECPAGC PSCVQSPKCG NGNDPLDKAG AIVVLDLVLA ALAQG
|
| |