Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0324 |
Symbol | |
ID | 9154459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 331543 |
End bp | 333093 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003645307 |
Protein GI | 296138064 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCGGCG TGTCGCCTGA CGCGGCCTGG GATTTCACTC GTGCGATGTC GCTGCGTTCG CATGTGCGTG GCCTTAGTGC TGATGGTTAC AGCGAGTTCA TTTCGTGCGA TCGCGCGGCG CCGGCCGGTG TGTGGGCGGT TTACCTGACT GATGCTCGGC TGCGGTATCG GCTGCTCGGG TTCGACTTCG ATGCCAAGGG CGATACTGCG GAGATCGCCG CGGTCGCTCG GACTGCGGCT CAGCACAGTG CCGACCAGCT GGTGTCGATG CTTCGGGCGT GCGGTATTGA GCACGTGGTG TGCGAGTCGG GCCCGTCGGG TGGCCGGCAT GTCTGGGTGG GCCTGGCTAC GCCGCTGTCG GTGGAGTATG TTTCGGCGCT CGCTGATCTT GCTCGCACGG TTTTCGGCCG CGTTCAGGCC AGCGGTGAGT GGGTGCTTGA TATCGGCATG CTCAGCAACC GCAAGACCGG GCTGCTGCGC CCACCGGGCT CCCCGCACGC TGCCGGTGGC GCGTCGACGG TGATCGCCGG TGATCCGACG GTGCTGCTGC CGGATGCGCG CTCGACGACC CGTCAGCAGG TCGACGCGCT TGTGCAGCTG CTTCAGGAAG CGAAGGCTGC GCTGCCCGCA GACGCAGGCG ACCCGATTGA TGGCGGTGGC CACCAGCGGT CCGTCACGGT CGCCGAAGCC CCGGATGGGC ACTTGTGGAT TCCGGGCACT TTCCGTGAGC TGTCGGGCCC GGCCCTGGCG AAGGTCCATG CCCCTATTAC CGCTAGTACG GACGCCTCGG ATGTGTTGTT CTCGATCCTG TGTAGCTGTG CCCGCGCGCA CTGGAGTTTC ACCGACGTTC ACACGCGTCT CGCCGGGCTC CCCGGTCTCG AGCACGCCCG CACCCGCCGC AGCAAACACC ACCCCGGCAC CCGCGAGCGC CGCCCCCGCA AGGGCAGCCA GTCCCCCAAA CAAGTCCTTG CCGCCGACTG GCGCCGCGCG ATCGCCTACC TCGCCGACCA CAGCACCACA TCAGAGGACC ACGACCTCCC CGACGATCCG GACTTCGCCC GCCGTACCGC GGCCATCTGC CAGGCCACCA CCGGAATCCG CCAGCGGGCC GATGCGAACA TGCGCCGTTG GAGCATCCGC GGCGGCGCCG CCGACCGCCT CGTCCTCGAC GCACTCTGCG AGATCGCCGA CAACGTCGTC CAACTCACCA TCGGCGCCGA CGTGCGCACC CTCGCCGAAA TGGCCGGCAT CACCCGCGAA CGCAGCCGCA ACGCCCTCCA CCGCCTCTCC CGCGACGGCT GGATCACCCT CGCCGCCGAA CACACCGGCC CCCACGCCGC CGTCTGGACC CTCAACAACA CAAGCCCGCA GCCTGCCGAG GTGCCTACTG CAACACCTGC TGAGACGGCA CCGGAGAGGG GACCACACGT GGATACCGAG GAGAGCACTA CTCGCGTGTC ACTCGGGGCC ATAGGCCCCG GAGAGCCCAC CCCACCTGTT GCCGAAGGGC CTCCCGCGGA GGATGGAGGG AGTGTCAGAT TTTCGGTGTA A
|
Protein sequence | MVGVSPDAAW DFTRAMSLRS HVRGLSADGY SEFISCDRAA PAGVWAVYLT DARLRYRLLG FDFDAKGDTA EIAAVARTAA QHSADQLVSM LRACGIEHVV CESGPSGGRH VWVGLATPLS VEYVSALADL ARTVFGRVQA SGEWVLDIGM LSNRKTGLLR PPGSPHAAGG ASTVIAGDPT VLLPDARSTT RQQVDALVQL LQEAKAALPA DAGDPIDGGG HQRSVTVAEA PDGHLWIPGT FRELSGPALA KVHAPITAST DASDVLFSIL CSCARAHWSF TDVHTRLAGL PGLEHARTRR SKHHPGTRER RPRKGSQSPK QVLAADWRRA IAYLADHSTT SEDHDLPDDP DFARRTAAIC QATTGIRQRA DANMRRWSIR GGAADRLVLD ALCEIADNVV QLTIGADVRT LAEMAGITRE RSRNALHRLS RDGWITLAAE HTGPHAAVWT LNNTSPQPAE VPTATPAETA PERGPHVDTE ESTTRVSLGA IGPGEPTPPV AEGPPAEDGG SVRFSV
|
| |