Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2047 |
Symbol | |
ID | 5705492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2341224 |
End bp | 2342411 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271534 |
Product | glycosyl transferase family protein |
Protein accession | YP_001536905 |
Protein GI | 159037652 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.555181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0026549 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATCGACG TGCGTGTGCT GTTCGCCAGT CTCGGAACAC ACGGCCACAC CTACCCACTC CTGCCGCTGG CCGCCGCCGC TCGGGACGCC GGCCACGAGG TAACCTTCGC CACCGGTGAG GGCTTCGCCG AGGTGTTGCG TGCGCAGGGC TTCGACCCGA TCGCCACCGG GATGCCGGTC TTCGACGGCT TCCTGGCGGC GCTACGGATC CGCTTCGATA CCGACAGCCC CGATGGGCTG ACACCCGAGC AGCTCAGCGA GCTTCCCCAG ATCGTGTTCG GGCAGGTGAT GCCGCAGCGC ATCTTCGACA GGCTCCAACC GGTGCTCGAC CGGGTGCGAC CCGACCTCGT GGTGCAGGAG ATCAGCAACT ACGGCGCAGG ACTTGCCGCC ACCAAGGCCG GCATCCCGAC CATCTGCCAC GGAGTCGGCC GTGACACCCC GGACGAGCTC ACCCGCTCCA TCGAGGACGA GGTGGGCAGG CTCGCCGCTC AGCTCGGCAT CGACCTGCCG CCCGGGCGTA TCGACGCCTT CGGCAACCCG TTCCTCGACA TCTTTCCGCC GTCGTTGCAG GAGCCGGCGT TTCGTTCCCG CCCCGAGCGG TACGAGTTGC GCCCGGTGCC GTTCACCGAA CGGCCGAAAG TGCCGGACTG GGTACTCGCG CGGACCAGGT CCCGGCCCCT GGTGTATCTG ACCCTGGGCA CCTCCAGCGG CGGCACCGTC GAGGTGCTGC GGGCCGCGAT CGACGGCCTG GCCACCCTGG ACGTCGACGT CCTCGTCGCG GGCGGCCCGT CGCTCGATCT CGCCCAGCTC GGCGAGGTGC CGACCAGCGT GCGGCTGGAG TCGTGGGTCT CGCAGGCGGC GCTGCTTCCC CACGTCGACC TCGTGGTCCA TCACGGTGGC AGCGGGACCA CCATCGGCGC GTTCGACGCT GGCGTGCCGC AGCTCTCCTT TCCGTGGGCG GGTGACTCGT TCGCGAACGC CCAAGCCGTG ACCCAGGCGG AGGCCGGTGA CCACCTGCCG CCCGGCGGTG TCAACGCCGA GGCGGTGGCG GACGCCGCGA AGCGGCTGAT CGCCGACGAG AGCTACCGGA CGGCGGCGAA GGCGGTCGCC GTCGAGATCG CCGCGATGCC GACCCCCGAC GAGGTCGCCC GCCGGCTGCC CGAGTTCGCC GGACGGCGGG CCGCCTGA
|
Protein sequence | MIDVRVLFAS LGTHGHTYPL LPLAAAARDA GHEVTFATGE GFAEVLRAQG FDPIATGMPV FDGFLAALRI RFDTDSPDGL TPEQLSELPQ IVFGQVMPQR IFDRLQPVLD RVRPDLVVQE ISNYGAGLAA TKAGIPTICH GVGRDTPDEL TRSIEDEVGR LAAQLGIDLP PGRIDAFGNP FLDIFPPSLQ EPAFRSRPER YELRPVPFTE RPKVPDWVLA RTRSRPLVYL TLGTSSGGTV EVLRAAIDGL ATLDVDVLVA GGPSLDLAQL GEVPTSVRLE SWVSQAALLP HVDLVVHHGG SGTTIGAFDA GVPQLSFPWA GDSFANAQAV TQAEAGDHLP PGGVNAEAVA DAAKRLIADE SYRTAAKAVA VEIAAMPTPD EVARRLPEFA GRRAA
|
| |