Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2096 |
Symbol | |
ID | 5704675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2414090 |
End bp | 2415151 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271581 |
Product | glucose-1-phosphate thymidyltransferase |
Protein accession | YP_001536952 |
Protein GI | 159037699 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00805861 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGCCC TCGTGTTGTC CGGTGGCATG GGTACCCGCC TGCGTCCCTT CACACACTCG ATGCCCAAGC AGCTGTTCCC GGTGGCCAAC CAGCCGGTAC TGGCGCATGT GCTGGGCAAG ATCCGTACGC TCGGCGTAAC AGAGGTCGGC ATCGTCGTCG GCGGTGGTGG TGCGGCCCAG GTCGAGGAGG CGATCGGGGA CGGCGCGCGG TTTGGTCTAC AGGTGACCTA CGTCCACCAG CACCAGCCCC GCGGCCTTGC CCACGCCGTC CAGGTCGCGG CCGACTTCCT CGGTACCGAC GACTTCCTGG TGTACCTGGG CGACAATGTG CTCACCGAGG GGCTCGTCGA GTTCGTGGCC CGGTTTCGAG ACGAGCGTCC CGCGGCCCAC CTGCTGGTGC AGAAGGTGAG CGATCCCCGG TCGTACGGGG TGGTCGAGCT CGATGCCGGA AGAGTGCAGC GACTGGTGGA GAAGCCCGCC TCCCCCCGCA GTGACCTTGC CATCGTCGGC GTGTATCTCT TCACTCACGA GATCCACACC GCCATCCGGG AAATCCGTCC TGGCCGGCGC GGCGAGTTGG AGCTGACGGA CGCGGTGCAG TGGCTGGTGG ACAGTGGAGC GCGGGTCGAG GCGACGGAGT ACGGCGGCAA CTGGAGTGAC GTCGGCCAGG TTGATGACCT GCTCGAGTGC AATCGGCACC TGCTCACCAA GCTCGACGCC GACGTCGCCG GCGAGGTCGA CGAGGTGTCC GTGGTCGAAT CCGGGGTGCG AGTCGAGGCC GGTGCTCGGG TGGTCCGTTC CGTGCTTCGG GGCCCGGCGG TGATCGGCGC GGAGACGTTG GTCGAGGACA GCGTCATCGC CCCGAACACG TCCGTCGGGG CGGGCTGTGT GGTGCGGAGG ACCTGCCTGG CCGACTCGAT TGTGCTGGAC GGGGCTCAGG TCACCGGCGT ACCCCAGCTG CGTGGCTCGA TCGTCGGCCG GTCAGCCACC GTGACCGGTG TCGCCGAAGG CGTGCACCGG CTGCTCGTCG GCGACCACAG CCAGGTGGAG ATCGCCCGAT GA
|
Protein sequence | MKALVLSGGM GTRLRPFTHS MPKQLFPVAN QPVLAHVLGK IRTLGVTEVG IVVGGGGAAQ VEEAIGDGAR FGLQVTYVHQ HQPRGLAHAV QVAADFLGTD DFLVYLGDNV LTEGLVEFVA RFRDERPAAH LLVQKVSDPR SYGVVELDAG RVQRLVEKPA SPRSDLAIVG VYLFTHEIHT AIREIRPGRR GELELTDAVQ WLVDSGARVE ATEYGGNWSD VGQVDDLLEC NRHLLTKLDA DVAGEVDEVS VVESGVRVEA GARVVRSVLR GPAVIGAETL VEDSVIAPNT SVGAGCVVRR TCLADSIVLD GAQVTGVPQL RGSIVGRSAT VTGVAEGVHR LLVGDHSQVE IAR
|
| |