Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2799 |
Symbol | |
ID | 9156964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2898433 |
End bp | 2900094 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF404 |
Protein accession | YP_003647736 |
Protein GI | 296140493 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.991145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCCGC AGAAACAGGC CAAGTCCCAT ACCCGGCGAC CGGCCGCCCC CCGACTGTCC GACGACGCGC GGCTGTTCGC GGGATACGAC GACGCCCCGT CGTTCGGCGC CGCCTTCGAT GAGATGTTCG CCGATGACGG CACGGTCCGC TCGCCCTACA AGCGAGTCTT CGAGGCACTC TCGTCGGCCG ACGAGTCCGA CCTCGCCGCC CGCGTCGATG CCCTCGGTGC CGCGTTCATC GACCAGGGTG TGACCTTCTC GCTCGAGGGC CGCGAGCGCC CCTTCCCGCT CGACCTGGTG CCCCGCGTGA TCGCCGCCGG CGAATGGAAC CGGCTGGAGA AGGGCATCAA ACAGCGCGTT CGGGCGCTGG AGATGTTCCT TGACGACATC TACTCCGAGC AGGAGATCCT GCGCGACCAG GTGGTGCCCA AACGCCTGGT CACCTCGTGC GCGCACTTCC ATCGCCAGGC AGCCGGCATC CGCCCGCCGA ACGGCGTGCG GATCCACGTC GCGGGTATCG ACCTCATCCG CGACGCCGAG GGCACTTTCC GCGTGCTGGA GGACAATCTG CGCTCCCCGT CGGGCGTGAG CTACGTGATG GAGAACCGCC GAACCATGGC GCAGGTCTTC CCCGACCTGT TCCTGCGCCA CCGGGTGCGT GCCGTCGGCG ACTACAGCTC CCACCTGTTG CGCGCGCTGC GCCGCTCCGC CGCCTCGAAC GAGGCCGACC CCACGGTCGT GGTGCTCACG CCGGGTATGG CGAATTCCGC CTACTTCGAG CACTCCCTGC TCGCCCGGCA GATGGGTGTC GAACTGGTCG AGGGCCGCGA CCTGTTCTGC CGCGACAACG TGGTCTACAT GCGCACCACC GGCGGTGAGC AACAGGTGGA CGTGATCTAC CGTCGCATCG ACGACGATTT CCTCGACCCC ATGCAGTTCC GGCCCGACTC GGTGCTCGGC GTGGCAGGCC TGCTCAATGC CGCCCGCGCG GGCAATGTGG TGATCAGCTC GGCGGTCGGC AACGGCGTGG GCGACGACAA ACTCACCTAC ACCTACGTGC CCGAGATCAT CGACTACTAC CTCGGCGAGA AGCCCCTGCT GCAGAACGTC GACACGCTGC GCTGTTGGCT CGACGAGGAA CGCGAAGAGG TGCTCGACCG GATCGACGAA CTGGTGATCA AACCGGTCGA GGGGTCCGGC GGCTACGGCA TCGTCTTCGG CCCCGACGCC AGCGACAAGG AGCTTGCGAC CATGCGGCGC AAGGTCGCCG CCGATCCGCG CGGCTGGATC GCGCAGCCGG TCGTCCAACT CTCGACGGTG CCCACCAAGA TCGGGGAGAG CGCCCGTCCG CGTCACGTGG ACCTGCGCCC GTTCGCGGTC AACGACGGAG ACGATGTGTG GGTGCTGCCC GGCGGCCTCA CCCGCGTCGC CCTCCCCGAG GGCTCGCTGG TGGTGAACTC CTCGCAGGGC GGTGGCTCCA AGGACACCTG GGTGCTCGCG GCGCGATCGT CCGTCGCGGA GGCCGAGCTG GAGGGCGAGG CGCTGGTGCC CTCCCGCGAT CTGGCCGAGC AGCCGCACGT GGAATTGGGG CCCGATCTGA GCCAGCAGGA TCAGCAACAG CAACAGGCGA CCTGGACGGA GGCACCGCAT GCTCGCGCGT AA
|
Protein sequence | MTPQKQAKSH TRRPAAPRLS DDARLFAGYD DAPSFGAAFD EMFADDGTVR SPYKRVFEAL SSADESDLAA RVDALGAAFI DQGVTFSLEG RERPFPLDLV PRVIAAGEWN RLEKGIKQRV RALEMFLDDI YSEQEILRDQ VVPKRLVTSC AHFHRQAAGI RPPNGVRIHV AGIDLIRDAE GTFRVLEDNL RSPSGVSYVM ENRRTMAQVF PDLFLRHRVR AVGDYSSHLL RALRRSAASN EADPTVVVLT PGMANSAYFE HSLLARQMGV ELVEGRDLFC RDNVVYMRTT GGEQQVDVIY RRIDDDFLDP MQFRPDSVLG VAGLLNAARA GNVVISSAVG NGVGDDKLTY TYVPEIIDYY LGEKPLLQNV DTLRCWLDEE REEVLDRIDE LVIKPVEGSG GYGIVFGPDA SDKELATMRR KVAADPRGWI AQPVVQLSTV PTKIGESARP RHVDLRPFAV NDGDDVWVLP GGLTRVALPE GSLVVNSSQG GGSKDTWVLA ARSSVAEAEL EGEALVPSRD LAEQPHVELG PDLSQQDQQQ QQATWTEAPH ARA
|
| |