Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4033 |
Symbol | |
ID | 9158217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4160973 |
End bp | 4162511 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003648943 |
Protein GI | 296141700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.104579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATC GCCGCCTGAT CGTCACCCCG CTCGCCGCCG CACTGATCGG TGCCACACTG GTCATCGTCA GCGCTGGATG CTCTTCCGGT CGTGGCGACG ACGCCGCCAC GCTCGACCTG ACCACCGGCG TGAACACGCC GCTCACGATC TCCGCCGGAC AGCTCGCGAA GCTCGCTGAC GGCGGTGCGG TCGTCGGTGT CGAATCGTCG CCGAACGGCA CCGTCGAGGC AGGGAAGGGC GGTGCCCTGG TGTTCAAGCC GAACAGCGGC TACACGGGGA CGGTGGAGCT CAAGGCCACC GTGTCGCCTG CGGTCCAGCT GTTCTCCTCG GACATCCCGC CGCTCACCAC CGTCGGCGGG GTCAGCGTGG ACGCGAGCGG CTACGGCTCG TCGTGGGTGC CCGTTCCCGG CGCACCGGAC GAGTTCTACG GCCTCACCGA TCGCGGCCCC AATGTCGATG GTCCGCAGAA GGATCAGAAG ATCTCGGTGA CACCCGATTT CACGCCGCAG ATCGGCCGGT TCAAGCTCGA ATCCGGGGTC GCCAGGTTGT TGAGTGTGAT CACCCTCAAG GGCCCCGACG GCCGGCCGCT CAACGGCCGC ACTGACACAG CGGCGCCCAC GGGCGAGAAG ATCATCGACC TGGACGGCCG GGAGATCCCG CCCACGGATC ACGGCATCGA CTCCGAGGGC CTGGTCGCGA TGCCGGACGG ATCCTTCTGG GTGTCCGACG AATACGGCCC GTTCCTCATT CACTTCGACT CCAACGGCCA GGAGCTCGAG CGGCTCGCGC CGGGGCGCGG CCTACCCGAG GTGCTCAAGA ACCGCACCCC GAACCAAGGC ATGGAGGGTC TGACCCTCAC TCCCGACGGT TCCAAGCTGG TCGGAATCAT GCAGTCGGCG CTCAACCTGC CGGGGCTGAG CGGCAATGCC AAGGAGGTAC CGGCCACCCG GATCGTGACC GTCGACCTCA AGACCAAGGC GACACAGCAG TTCGCGTACC TGCTCGACAA CCCCAAGGAC ACCAAGAAGG CGGTTTCCGA GATCACCGCG ATCTCCAACA CCGAGTTCCT GGTCGACGAG CGCGACGGCA AACTCGCCCC CAAGGCCAAT AAGACGATCT ACACGATCAG TCTCGACGGT GCCACGCCGC TCACCGAGCA GCAGAATCTG GAAACGATCG TGGGGGTCAG CAATACCGCA GCGGCGGAGA GCGCCCTCAA AGCCGCGGGC ATCACGCCCG TCCGTAAGTC GGTGGCGCTC GACCTGAGCG GGCTCGTCGA CAAGCTCAAT CCTCGAGGTA CCTTCTTCGG CCATGACAAG GTCGAGGGTC TGACCACCGT CGATGGCGGA AAGACCCTGT ACATCGCCAA CGACAGCGAT TTCGGCCTGG CCGGTATCGC CGGCCCGAAG GTGCCCTTCC AGCTCAAGCC GAAGATGCTC GCGAACGGCC TGCAGGACAG CCTGGAAGTG CTCCGCGTCG ACACGGCTCG GTTGAACGAG GCGACCGCCA CCCGGACGAT CAAGGTCACC GTCAGCTAG
|
Protein sequence | MSDRRLIVTP LAAALIGATL VIVSAGCSSG RGDDAATLDL TTGVNTPLTI SAGQLAKLAD GGAVVGVESS PNGTVEAGKG GALVFKPNSG YTGTVELKAT VSPAVQLFSS DIPPLTTVGG VSVDASGYGS SWVPVPGAPD EFYGLTDRGP NVDGPQKDQK ISVTPDFTPQ IGRFKLESGV ARLLSVITLK GPDGRPLNGR TDTAAPTGEK IIDLDGREIP PTDHGIDSEG LVAMPDGSFW VSDEYGPFLI HFDSNGQELE RLAPGRGLPE VLKNRTPNQG MEGLTLTPDG SKLVGIMQSA LNLPGLSGNA KEVPATRIVT VDLKTKATQQ FAYLLDNPKD TKKAVSEITA ISNTEFLVDE RDGKLAPKAN KTIYTISLDG ATPLTEQQNL ETIVGVSNTA AAESALKAAG ITPVRKSVAL DLSGLVDKLN PRGTFFGHDK VEGLTTVDGG KTLYIANDSD FGLAGIAGPK VPFQLKPKML ANGLQDSLEV LRVDTARLNE ATATRTIKVT VS
|
| |