Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1965 |
Symbol | |
ID | 9156120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2052531 |
End bp | 2054987 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF404 |
Protein accession | YP_003646916 |
Protein GI | 296139673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.206535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGTCG ACGGACTACA GGATCTCGAC GCCGATGCCC TCGCCCGGTT GCAGGCACGG GTACGGGGCC TGATCGATGA CGAGGGCATC ACCTACAACG CACTCGACGC GTTACCGAGT GACCTCGCCG CACCGGCCAC CACCGGGCGG TGGCGACTCG ATCCGTTGCC GGTGCTGCTG AGTACCGACG AGTGGGAACC CCTGGCGCGC GGCGCCGCCC AGCGGTCGAC GCTGCTCGAC GCGCTGCTCC GCGACTTCTA CGGCGAACAG CGCACCATCC GCGACGGCCT GCTGCCCCCC GAGGTGCTGT TCGCCCACCC CGGTTTCATC CGCCGGGCCT TCGGTGTTCC CGCGCCCGGG ACCAAGGCGT TGTTCCTGCA CGCCGCTGAC GTCGGCCGGA TCGGAGCCGG CGGCGCGTAC GCGGTGGCCG CCGACCGCAC CCAGGCGCCG TCGGGCGTGG GCTACGCCCT CGCGGACCGG CGGGTCACCT CCCGCGCCCT GCCCCGCGAA TTCCGCTCCG AGACACCGCG ACCGGTGTCG TCGTTCGCCG CCGCGCTGCG GGGGCAACTG CTCGAGAGCG CTCCACCCGG GGTCGACGAT CCCACGGTGG TGGTGCTCAG TCCCGGATCG TTCTCCGAGA CGGCGTTCGA CCAGGCCTAC CTCGCCTCCG TGCTCGGATT CCCGCTGGTC GAGGCCTCCG ACCTCACCGT CCGCGACGGT GGTGTGTACA TGCGCGCACT CGGCCGAATG AAACGCGTCG ACGTGGTGCT GCGCCGTGTC GACTCCGAGT TCTCCGACCC GCTCGATCTG CGCACCGACT CCCGGCTCGG TGTGGTCGGC CTGGTCGAGA TGATGACCCG CGGGGCCGTC ACCGTGGTGA ACACACTCGG CTCCGGCGTT CTGGAGAACC CGGCCCTGCA CGCCTACCTG CCGCAGTTGT GCCGCGCCCT GCTCGATGAG GACCTGTTGC TCGACTCCAC CCCGACCGTG CACGCGGCCA CTCCCGCCGG TCGCGCCGTG ATCGACGGCG CACTCGACGA TCAGTTGCTG ATCGATTTCT CCACCGGGGA GCGGATCCTG GGCGCCGACC TCACCGCCGA GGCCGCCGCG GCACTGCGCG CCCGCATAGC CGAGCAGCCC GCGCTGTGGT GCGCGAAGGA ACTGGTGCCC TTCGACACCG AGCCCGCTCT GTCCGACGGT GCCGTGCTGG ACCGCGGATT CTCCCTGCGG GTCTTCTCGC TGGCGCAGGA GGCCGGCTAC ACCGTGCTCG GTGGCGGCCT CGGCCAGGTC CTCCTCGACG GTGCCGCCGG CGCGCAACTG CATACATCGG CGGCCCGTGA CGTCTGGGTG CCCGCGGGGG AGGACAGCCG CGGATCGATC CGCGTGGCGA CCTCTCCGCG CGTGGCCAGG ATGAACGGCG TCACGGCGAG CGGACCCGTC GCCACCCCGC GCGTCCTGTC CGACCTGTTC TGGATCGGCC GGTACGCAGA ACGCGCGGAG GCGATGGTGC GGCTGCTCAG TGTGGCCCGC GAACGCGATC AGGAGTTCCG GCACCGGCCC TGGCAGCCCG GGGCCGCCTC CCTGCAGCCC CTGCTCGACG CCGTCGTCGA GGTGTCCGAT ACCGGGCAAC TCGGACCGAT CGTGGCCGAG GGCGCCGACC AGTCCGATGT GCTCGCCCGA TTGCGTCGAC TCACCCTCGA CACCGACCTG CCCGGCACCG TCGCCTTCGC CGGCGTGCGC CTGCGCGCCT GCCTGCGCGC GGTCCGGGAT CAGATGTCGA CCGACACCTG GCTCGTGCTC AGCGGCGCCG AGCGCTCGCT GGGCCGGCTC GCCGCCGACC GGCACGACGG GGGCGAGCAA CTCGACCAGA CCCTCGGCGA GGTGCTGGTC TCGCTGCTCG CCTTCGCCGG ACTCGCCCGT GAATCGCTGG TCCAAGATCC AGGATGGCGC ATGATGGACG CCGGGCGGCG GATCGAGCGG GCCCTGCAGT TGGCCGACCT GACCACATCC ACCGTGGTCC CGGCCCGAGA ATCCGAGGTC GAGACAGGCC TGCTCGACGC GTACCTCGTG GCCTGTGAGT CGTCGGTCAC CTATCGGCGG CGGCACCGGT CGGTACTGCG TGCGGGCGCC GTGGTCGACC TGATGTTCCT CGACGCCGAC AATCCACGGT CGATGGTCTT CCAGCTCGAC TCGCTCTCGC GCGACCTGCA GAACCTGCCG GACGAGATGC GCAGCGTGGC CGCCGAACGC ACCGCCACCG AACTACTGGG CCGGCTGCGC CGGTTCGACC CCGAGGAGGC CGAGACCGTG ACCGACGGTG TCCGCACCGA ACTCGTCGCA CTGATCGATG CCATCACCGG TGGCCTGCGC GATATCTCAG ACGTGCTTGA GCGGACCCGT TTCGCGCTGC CCGCCGAGGC TCGTCCGATC TGGGTCGGGG TGCCGTCGTG GGCCTGA
|
Protein sequence | MTVDGLQDLD ADALARLQAR VRGLIDDEGI TYNALDALPS DLAAPATTGR WRLDPLPVLL STDEWEPLAR GAAQRSTLLD ALLRDFYGEQ RTIRDGLLPP EVLFAHPGFI RRAFGVPAPG TKALFLHAAD VGRIGAGGAY AVAADRTQAP SGVGYALADR RVTSRALPRE FRSETPRPVS SFAAALRGQL LESAPPGVDD PTVVVLSPGS FSETAFDQAY LASVLGFPLV EASDLTVRDG GVYMRALGRM KRVDVVLRRV DSEFSDPLDL RTDSRLGVVG LVEMMTRGAV TVVNTLGSGV LENPALHAYL PQLCRALLDE DLLLDSTPTV HAATPAGRAV IDGALDDQLL IDFSTGERIL GADLTAEAAA ALRARIAEQP ALWCAKELVP FDTEPALSDG AVLDRGFSLR VFSLAQEAGY TVLGGGLGQV LLDGAAGAQL HTSAARDVWV PAGEDSRGSI RVATSPRVAR MNGVTASGPV ATPRVLSDLF WIGRYAERAE AMVRLLSVAR ERDQEFRHRP WQPGAASLQP LLDAVVEVSD TGQLGPIVAE GADQSDVLAR LRRLTLDTDL PGTVAFAGVR LRACLRAVRD QMSTDTWLVL SGAERSLGRL AADRHDGGEQ LDQTLGEVLV SLLAFAGLAR ESLVQDPGWR MMDAGRRIER ALQLADLTTS TVVPARESEV ETGLLDAYLV ACESSVTYRR RHRSVLRAGA VVDLMFLDAD NPRSMVFQLD SLSRDLQNLP DEMRSVAAER TATELLGRLR RFDPEEAETV TDGVRTELVA LIDAITGGLR DISDVLERTR FALPAEARPI WVGVPSWA
|
| |