Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2262 |
Symbol | |
ID | 9156418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2353274 |
End bp | 2354359 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF808 |
Protein accession | YP_003647210 |
Protein GI | 296139967 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.406953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGAG GACTCGCGGC GCTGCTGGAC GACATCGCCG CGCTCGCACG GCTGGCAGCC GCATCGGTGG ACGACGTGGC CGGCGCGGCC GGACGAGCCA GCGCGAAGGC CGTGGGAGTG GTGGTCGACG ACACGGCCGT CACCCCGCGC TATGTGCAGG GGATCAAGCC CGAACGCGAG CTGAGCATCA TCAAGCGCAT CGCGATCGGC TCGCTGCGCA ACAAGCTCCT GATCATCCTG CCCGCAGCAC TGATCCTCAG CCAGTTCGCG CCGTGGGCGC TCACGCCGAT CCTGATGCTC GGCGGCACCT ACCTCTGCTA CGAGGGCGCG GAGAAGATCT GGGAGAAGGT CAGTGGCCAC GGCGACGGCG GGGTCGAGGC ACCGTCGTCC TCGACCGGCC CGGTCGACGA GGACAAGGTG GTCAGCGGCG CGGTCCGCAC CGACTTCATC CTCTCCGCCG AGATCATGGT CATCGCGCTC AATGAAGTCG CCGGCGAGAT CCTGTGGCAG CGCGCGGTCA TCCTGATCGT CGTGGCGATC GCGATCACCG CGCTGGTCTA CGGAGTGGTC GCACTCATCG TGAAGATGGA CGACGTGGGC CTGGCTCTGT CGAAGAAGTC CTCGCCCACC GTCAGCCGCA TTGGATACGG CCTGGTCAAG GCCATGCCCG GCGTGCTCAG CACGCTCGCC ACCGTGGGCG TGGTCGCGAT GTGCTGGGTG GGCGGGCACA TCCTGCTGGT CGGTCTGGAT GAGTTGGGCT GGCACACCCC CTACTCGGTG GTCCACTCCT TCGAGGTGTG GGTGCACGAC CTGGTGCCCG CGATCGGCGC CGTGACGGGC TGGCTGGCCA ACACCCTCGC CTCGGCCGTA TTCGGTGTGA TCGTCGGCGG CCTGGTGGTG CTGATCATGC ACGCCATCCC CACGCGCACG AAGGGCACCC ACGAGGCCGG CCCCGATTCC CTGGTCGCGG CCGAGACCGC GGCCGTCGAG GACGACCTCG CGGACGCACA GGCCGACGGC GCCGCCGAGG GCCCGGACGA GCCTCAGACG GGGCCTCAGA CAGGGCCTCA GACGACGGGG ACGTAG
|
Protein sequence | MSGGLAALLD DIAALARLAA ASVDDVAGAA GRASAKAVGV VVDDTAVTPR YVQGIKPERE LSIIKRIAIG SLRNKLLIIL PAALILSQFA PWALTPILML GGTYLCYEGA EKIWEKVSGH GDGGVEAPSS STGPVDEDKV VSGAVRTDFI LSAEIMVIAL NEVAGEILWQ RAVILIVVAI AITALVYGVV ALIVKMDDVG LALSKKSSPT VSRIGYGLVK AMPGVLSTLA TVGVVAMCWV GGHILLVGLD ELGWHTPYSV VHSFEVWVHD LVPAIGAVTG WLANTLASAV FGVIVGGLVV LIMHAIPTRT KGTHEAGPDS LVAAETAAVE DDLADAQADG AAEGPDEPQT GPQTGPQTTG T
|
| |