Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1909 |
Symbol | |
ID | 5708118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2203365 |
End bp | 2204525 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641271413 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001536785 |
Protein GI | 159037532 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.653406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000425018 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGGAAC CGACCTCGCG GGCCCGCGGC ACGGTCGCCC TGCTGCTCGC CTCCAGCACC GGTGGTGTCG GGCAGCATGT CCGCTCACTG GCCGCCGGGC TGACCAGCAT CGGCGTCGCC GTGCTGGTCT GTGGCCCCGC TGCGACCGAG GAACAGTTCG ACTTCACCGG CGTCGGCGCC CGCTTCGAGG CGGTGGAGAT CCGAGCCAGC CCGACCCCCG CCGACATGCG GGCGGTGACT GCGCTTCGCC AGGCCCTCGC CGCCGAGCCG GTCGATGTGC TGCACGCGCA CGGGTTGCGC GCCGGGCTGG TCGCCGTCGC CGCCCGACCG GCTGTCCCGC TGGTGGTGAC CTGGCACAAC GCCGTCCTGG CCGGAGGACT GCGCGGCAGC GTGTCCCGCC TGGTCGAGCG GGTCGTCGCC CGCAATGCCC GAGTGAGCCT CGGTGCCTCG GCCGACCTGG TCCAGCGAGC CACCGAGTTG GGCGCAGGCG ACGCCCGGCT CGCCCCGGTC GCCGCGCCGC CGCTGTCTGA ACCGCACCGC CGCCGGGACG CGGTTCGCGC CGAGTTCGGC GTCGGCGGTG AGCAGCCGCT GGTGCTCTCA GTCGGCAGGC TGCACCCACA GAAGCGGTAC GACGTGTTGG TCGACGCCGC CGCCCGGTGG CGAAACCGTG CCCCCGTTCC GGCCGTCGTG ATCGCCGGCA GCGGCCCCGC CTACCTGCAA CTGGCTGCCC GGATCTCCGC CGCGCGGGCG CCGGTCACCC TGCTGGGACA CCGCACCGAT GTGGCTGACC TGCTGGCCGG CGCCGACGTC GCGGTGGTGA CCAGTGACTG GGAGGCCCGG CAACTGTTCG CGCAGGAGGC GATGCGGGTC GGCGTGCCGC TGGTGGCGAC CGCGGTCGGC GGTCTGCCGG AGCTGGTCGG CGACGCGGCC ATACTGGTGC CGCCGGGCGA TGTCGATGCG GTCGACGCGG CCGTTGGCCG CCTGTTGGAC GACCCGGCCC TGCGGGCCGG GCTGGGTCGG CAGGCCCGGG AGCGGGCGGC GACGTGGCCG ACCGAGGCGG ACACCTGCGC CCAACTCGCC GCGCTCTACG CCGAGCTGGC TCCGGGCGCC ACCGGCCCCA CCACCCCGGA GGCCGGCGCA CCCTCGACGG GGCCGCGGTG A
|
Protein sequence | MTEPTSRARG TVALLLASST GGVGQHVRSL AAGLTSIGVA VLVCGPAATE EQFDFTGVGA RFEAVEIRAS PTPADMRAVT ALRQALAAEP VDVLHAHGLR AGLVAVAARP AVPLVVTWHN AVLAGGLRGS VSRLVERVVA RNARVSLGAS ADLVQRATEL GAGDARLAPV AAPPLSEPHR RRDAVRAEFG VGGEQPLVLS VGRLHPQKRY DVLVDAAARW RNRAPVPAVV IAGSGPAYLQ LAARISAARA PVTLLGHRTD VADLLAGADV AVVTSDWEAR QLFAQEAMRV GVPLVATAVG GLPELVGDAA ILVPPGDVDA VDAAVGRLLD DPALRAGLGR QARERAATWP TEADTCAQLA ALYAELAPGA TGPTTPEAGA PSTGPR
|
| |