Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2293 |
Symbol | |
ID | 9156449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2382882 |
End bp | 2384504 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function DUF222 |
Protein accession | YP_003647239 |
Protein GI | 296139996 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGA ACAGCAGCGG CGCAAGTGAA TATCTTGACG TACTGACCTC CTTCGAGCGG GCCGCCGAAC GTCTCGCCGC TGTCGATCCG GTGATGCTCA CTTCGCGAGA AGTACTCGAG AGCCTGCACC GCCTCGAAAC CGCAGCGCGA AAGGTCCCGT ACGCCCAAAA CTCTCTTGCC CGGATCGCAT TCGAACAGGA CCTGCCGGGG CAACTGAACT ACACCGGGCT CAAAGAAGTG CTCGTTGATC AGCTGCGTCT CGCAGGTTCG GAGGCCCGCG ACCGCATGCA CGGCGCGGTC GAGCGGGCGC CGCGCCACGA ACGCGGCGTC GCCCCCGAAC CAAAGCGCGC GCTGATCGCA GCCGCCCAAC GCGCGGGGCG GATCTCCGAA CGGCACGCCA CCGCGATCCA GCGGATCGTG AACAGCTGCC GCAGGCGCCT CACCACCACC GATGTCGAAC ACCTTGAAGA CATCTTGGTC ACCGCCGCCA CCGACGTCAC TCCCGACGAC CTCACCAAGA TCGGCAAGCG TGCCGTCGAT CTCCTCGACC CCGATGGCGC CGAGCCCAAA GAGGACATCA TCGCCCGCGC CCGCGGCGTC GAGATCGGAA GCCAGGACGA CGATCTCATG ACCGACTTCG GCGGCTCGCT CAGCCCGGAG GGCCGCGCCG TCATGAGCGC GATCTTGGAG AAGCTCGCCC GGCCCGGAGT AAACAACCCG GACGACGCCG ACGCGCCGAT CACCCCGGCC GATCGGGCCG CGATCGACCA AGCAGCCCAG CGGGACAAGC GCACGCAGGC CCAGCGTAAT CACGACGCCC TTATCACCGC TCTCCGCATT GCGCTCGGCA CCGGGGAACT CGGGCAACAC CGCGGCCTGC CCTGCATTCC GATCATCACC CTGACGATCG ACCAACTGGA GTCCGAGACC GGCATTGCCA CCACCGCCAC CGGCGGCCGG CTCCCCGTTC CCGACGCGCT CCGCATGATG GGCACCAACC CCAAGTACGC GCTCCTTCTC GACCTGGCGT CCCGCCCACT GTTCCTCGGC CGAGAGAAGC GCCTCGCCAC CCGGGACCAA CGCATCGCGC TCTACGGATC CGAGAAGGGC TGCACCGCAC CCGGCTGTGA CCAGCCCGCG ACGCGCACCC AGGTCCACCA CGTCACCGAC TGGGCCGACG GCGGAGGCAC CGACATCACC GCACTGACCC TGGCCTGCGA TAAGCACCAC GCGGCAGTGA CCCCGGCGAG CGGCGATCGC TCGCACGGCC TCGAGACCAT CACCATTGCG GATGGGCCGA ACGCCGGCCG CACCGGCTGG CGCCGCACAG CCGACCCCAC TCACCGGTAC CGCGTCAACC ACACCCACCA CACCGACGAA CTCCATCGCC ACGCCCTCGA ACACTGGCGG CGCCGGGCGA AGCAGTTCCG CGCGCGCTGG CTCGCCGAAG ACCTCCGCGT GCAGTACAAC GCCGTCATCG GCAGCACCTA TCGCGATATC TCCGCGACCC TCGACGGACC GAACGGCCCG CCCCTGCTCG AACAACTCCT GGCAGAGCAC GACGCCGACA ACGCCTGGCG CCCTGCTCCA CCCGGTGATG CCAGCCCGCC TCGCGCGGCT TGA
|
Protein sequence | MSENSSGASE YLDVLTSFER AAERLAAVDP VMLTSREVLE SLHRLETAAR KVPYAQNSLA RIAFEQDLPG QLNYTGLKEV LVDQLRLAGS EARDRMHGAV ERAPRHERGV APEPKRALIA AAQRAGRISE RHATAIQRIV NSCRRRLTTT DVEHLEDILV TAATDVTPDD LTKIGKRAVD LLDPDGAEPK EDIIARARGV EIGSQDDDLM TDFGGSLSPE GRAVMSAILE KLARPGVNNP DDADAPITPA DRAAIDQAAQ RDKRTQAQRN HDALITALRI ALGTGELGQH RGLPCIPIIT LTIDQLESET GIATTATGGR LPVPDALRMM GTNPKYALLL DLASRPLFLG REKRLATRDQ RIALYGSEKG CTAPGCDQPA TRTQVHHVTD WADGGGTDIT ALTLACDKHH AAVTPASGDR SHGLETITIA DGPNAGRTGW RRTADPTHRY RVNHTHHTDE LHRHALEHWR RRAKQFRARW LAEDLRVQYN AVIGSTYRDI SATLDGPNGP PLLEQLLAEH DADNAWRPAP PGDASPPRAA
|
| |