Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3057 |
Symbol | |
ID | 9157228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3168670 |
End bp | 3170037 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Protein of unknown function DUF2029 |
Protein accession | YP_003647989 |
Protein GI | 296140746 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0888832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACA CCGACGTCAC GGGCCCGCCG GTCGACGGCG CGACGCAGGT AGTCTGGTGC CCCGTGACGA CGTCCGACCA CTCCGGCCCG ACGTCCCGCG GCACCCTGCG CCGTTGGGCG CTCGCGGTGT TCCTGCTCTC GCTGGCGGCG AGATTCGTGT GGATCGCGGC AGGCCAGCAC AACTTCAACT TCGTCGACCT GCGGGTGTAC TACGAGGCGG CGCAGCGTGT CGCCGATGGG ACGCTGTACG ACTTCGCGCT CACCGATTAC ACCCCGCAGC AGCCGCTGCC GTTCACCTAT CCACCGTTCG CCGCGCTGTG CTTCTACCCG CTGAAGTTCC TGCCGTTCCC GGTGGTCGCG GTCGCCTGGG TGCTCCTCAC CGCGGGGCTG TTGTTCCTTG TGGTGCGGAT GAGCCTGGCG ATCCTGGCGG CCGGCCGCGG CGCCCCGTCG CGCCTGGGCG ACGTTCCCGT CGAACACGCC CTGTTCTGGA CCGCCGGTGC GCTGTGGATC GATCCCGTGC GCACCAATCT CGACTACGGC CAGATCAACG TGGTCCTCAT GGCGCTCGGC GTGTGGGCCG CGTACCTGAT CGCGCGCACC GAGGCGGGCC CGCCCGCGCT CGTGCTGCGG CCGCAGCGGG TCGACGGTGC CTCGGGCGCC CTGATCGGGA TCGCCGCCGG CATCAAGCTC ACCCCCGCCG TGGGCGGGCT GTTCCTCCTC TCTCGCGGCC GACCCTGGGC CGCCGTGTTC TCGGGCCTCA CCTTCGGCGC TACCGTCGGC TTCAGCTACC TGCTGCTGCC CTCGGAGACC CGGCGCTACT TCACCGTGCT TCTCGGCGAT ACCGGACCGA TCGGCGACCC CGCCAAACCC GACAACCAAT CGTTGCGCGG CGCGATCTCC CGGTTCGCCG GCCACGACGT GGGTACCGGC GCCGCGTGGA TGGTGGGCCT CGCCGTTGCG GCACTGCTGT TGTTCGCCGC CTGGTGGGTG GTGCGCCGGG GCGACGCCCT CATCGCTCTG GTGCTCGTGC AGTTGCTCGG GCTCCTGGGG TCGCCGATCT CGTGGATCCA CCACTGGGTG TGGATCGTGC CGCTGCTGAT CTGGGTGGTG CACGGCCCGC TCGGCCGACT TGGCGACCAC CGGTTCAGCG GCGCGGGCGC CGGATACACA CCGTGGTCGA CGGCACTCGC GGGCACATTC GTCGTGGTCG GACTCGTCGG GGTGCACTAC ACCGACGATG TGCTCGGCTG GCTCGGGCTG AGCGACGGTC CGGTGTGGGC CCTGTTCATG GCGCAGGGTC TGGGCGCGGC GATCGGTCTG ATCGCCCTGA TCCTCGCGCA GCGCCGACGC CTCCAGGCCG TAGTCTGA
|
Protein sequence | MVNTDVTGPP VDGATQVVWC PVTTSDHSGP TSRGTLRRWA LAVFLLSLAA RFVWIAAGQH NFNFVDLRVY YEAAQRVADG TLYDFALTDY TPQQPLPFTY PPFAALCFYP LKFLPFPVVA VAWVLLTAGL LFLVVRMSLA ILAAGRGAPS RLGDVPVEHA LFWTAGALWI DPVRTNLDYG QINVVLMALG VWAAYLIART EAGPPALVLR PQRVDGASGA LIGIAAGIKL TPAVGGLFLL SRGRPWAAVF SGLTFGATVG FSYLLLPSET RRYFTVLLGD TGPIGDPAKP DNQSLRGAIS RFAGHDVGTG AAWMVGLAVA ALLLFAAWWV VRRGDALIAL VLVQLLGLLG SPISWIHHWV WIVPLLIWVV HGPLGRLGDH RFSGAGAGYT PWSTALAGTF VVVGLVGVHY TDDVLGWLGL SDGPVWALFM AQGLGAAIGL IALILAQRRR LQAVV
|
| |