Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1690 |
Symbol | |
ID | 5705225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1951032 |
End bp | 1952588 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271193 |
Product | glycosyl transferase family protein |
Protein accession | YP_001536568 |
Protein GI | 159037315 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0342764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00101578 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGTCC CCGACGTCAC CGTCATCACG GCGGTCTACA ACACCATGCC GTACCTGACC CGCTGCCTGA CGTCGCTGGT GGAGCAGACC ATCGGGCGAG ACCGGTTGGA GGTCATCGCG GTCGACGACG GGTCCACCGA CGGCAGCGGT CCCGAGCTGG ACCGGTTTGC CCGGCTCTAC CCGGGCACGG TGAAGGTGGT GCACCAACCC AACTCGGGCG GCCCGGCTGC ACCGAGTAAT CGCGGGCTGG AGCTGGCGAC CGGCCGCTAC GTCTTCTTTG TCGGCTCCGA CGACTACCTG GGGCCACAGG CGCTTCAGCG GCTGGTCACC GCCGCCGACC GGTGGGAGTC GGACGTGGTG CTCGGCCGCC TGGTGGGGGT GAACAGTCGC TACATTCACC AGGCGATCTA CGCCGAAAGC TCCGCCGACG TCGACCTGTT CGGCTCGGCT CTGCCCTGGT CGCTGTCGAA CACGAAGCTG TTCCGGCGGG AACTCGTCGA GCGGCACGGG CTGCGCTACC CGGAGGACAT GCCGGTCGGC AGCGACCAGC CGTTCACCAT CGAGGCCTGC GTCCGGGCCC GCAGGGTCTC AGTGCTCGCC GACTACGACT ACTACTACGC GGTGCGTCGG TTGAACGCGC GTAACATCAC CTACCGCAGC CGGCACCTGG AGCGGCTGCG CTGCGCCGAG GAACTGGTCA CCTTCGTGGC CGGGCTGGTC GAGCCCGGCC CGAACCGCGA CGCGGTGCTG CTGCGACACT TCACCTGGGA GGTCGCCAAG CTGTTGGAAA ACGACTTCCT GCAGCTCGAT CGCACCGTGC AGGACCAAGT GGTGGCAGGG GTGCGGACGC TCACCGAGGC GCATCTGACC GACCGCATCC GGGATCGTCT GCCGATCGAG GCCCGGGTGC GGCTCGCCGC TGCCCGGTAC GGTGACACCG ACCACCTCCT CGCGGTGATC CGGCAGGACG CCGAGTTGGG TATCCCGCTC GCCGTGATCG AGGGTGAACG CTGGTATGCC GGCTACCCGG GTTTCCGAGA TCCGCGACTG CGCATTCCGG ACTGCTGGTA CGAGATCACC GATACCGCCG CCGACTGGGT GGCCCGGCTA GACACCGTCT CGGCGGCCTT CGAAGGATCA CGGGCGCTGC TGGTGACCGC CCGCAGCCCC CGCCCTGACC TGCCGGAGCT GGCGTCGTCG GTCCGGCTCG CGGCCGGTGA CGTGACCGGC GAGACGCTGT CGACGGTCGC GGACGCCACC GGCACGACCG TACGCGCCCG GATTCCGTTG GATCGGTTGC TGGAAGGCGC TGGCCCGGGT GGGGAACTGC GCACGGTCCA GGCGCTTGCG AACGCGTTCG GCACCACCGG CGCGGCGGCC CTGCGCGGCG CCCGGCGGCC GGTGCCCCAG CGGGCGGTGC TGCGCCGGGG CGCCCGACTC CATGTTCTGA CCATTACCAC CAATCACAAG GGCCAGCTTG TCATCGCCGT AGCACCTGTC ACCCCACGCC GGTTGATGGC CCGCCTGCGG CGCAGGCTTC CACTAGGAGG AAAGTAG
|
Protein sequence | MTVPDVTVIT AVYNTMPYLT RCLTSLVEQT IGRDRLEVIA VDDGSTDGSG PELDRFARLY PGTVKVVHQP NSGGPAAPSN RGLELATGRY VFFVGSDDYL GPQALQRLVT AADRWESDVV LGRLVGVNSR YIHQAIYAES SADVDLFGSA LPWSLSNTKL FRRELVERHG LRYPEDMPVG SDQPFTIEAC VRARRVSVLA DYDYYYAVRR LNARNITYRS RHLERLRCAE ELVTFVAGLV EPGPNRDAVL LRHFTWEVAK LLENDFLQLD RTVQDQVVAG VRTLTEAHLT DRIRDRLPIE ARVRLAAARY GDTDHLLAVI RQDAELGIPL AVIEGERWYA GYPGFRDPRL RIPDCWYEIT DTAADWVARL DTVSAAFEGS RALLVTARSP RPDLPELASS VRLAAGDVTG ETLSTVADAT GTTVRARIPL DRLLEGAGPG GELRTVQALA NAFGTTGAAA LRGARRPVPQ RAVLRRGARL HVLTITTNHK GQLVIAVAPV TPRRLMARLR RRLPLGGK
|
| |