Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1095 |
Symbol | |
ID | 5707016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1232875 |
End bp | 1234005 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270610 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001535994 |
Protein GI | 159036741 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.174275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0671242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATAGCT ACTACAGCGA CGGGAAGCCC AGCGGCGAGA ACGTGATGGT GGACGCCGCC GCCGAAGCGC TGCGCCGCGC GGGCCATGCG GTGGACCTGG TCGGGCGGCG AACAGATCAC CTCAGGGGTG GTGCGGGCTA CACCGCCATG GCCGCGTTCA ACACCGCCAC GGGTGTCGGC CCGTCGCCGG TCGGAGAGTT GTGGGCCACG GAGGTCGACG TGGTTCACGT GCACAACCTG TTTCCGAACT TCGGCCGTGC CTGGCTGCGT ACGCTGTCCA AGCCCCTCGT CGTCACCCTG CACAACTACC GGCCGTTGTG CGCGGCCGCG ACCCTGTTCC GCGCCGGTGC CACCTGCACC GCCTGCCTGC GCGGCCCACT GCCCGGACTG CGTCACGGCT GCTACCGCGG CAGTCGGATG GCCACCCTGC CGCTGACGCT GGGCCAGCGG ACGCTGCGCC GCGACCTGCT GGGCCGGGCG GACCGGTTGA TCGTATTGTC CGACGTGCAG CGGGACTTCT ACGTGCGGGC CGGTGTGGAC GAGCGCAAGC TGGTCGTCGT CCCGAACTTC GTGCCGGACG AGCTGGACCG AGGGCCGGGG GCTGGCGGAG CGGGCTGGGT CTTTGCCGGC CGCCTGGACG ATGCCAAGGG CATCATCGAA CTCGTCGCCC GCTGGCCGCG CGAGCTGCGA CTGACAGTCT TCGGCGCCGG GCCGCTGCTG GCGAACGCCA AGGAACTCGC ACAGGGCAGG GACGTCACCT TCGCCGGGCA CCTGCCGCGC GAGGCCGTGA CCGCCGCCGT CCGGAACGCC CGCGGACTGG TGTTCCCGAG CCGGTGGCCG GACCCGTTCG GGCTGGCCTA CGCGGAGGCC ATGGCAGCCG GGACCCCGGT GCTGGCGCGC CGTCCCGCAG CGGCGGCCCA GTTCGTGCAC CAGCACCGTA CCGGGATGGC CGTCGACGAG GTCACCAGCG ACGCGATACG CGCCGCGCAC GAGCTGTTCC CCTCCTTGCG AACCGCCTGC CGCGCCGCGT ACGTCGCGCA CTATCGGGAG CCCGACCACG TTGCCGCGCT CACCGGGGTG TACCTGCAGG CCTGCGCGGC CCGCCGGGCC AAACGGGCAG CGACGGCATG A
|
Protein sequence | MHSYYSDGKP SGENVMVDAA AEALRRAGHA VDLVGRRTDH LRGGAGYTAM AAFNTATGVG PSPVGELWAT EVDVVHVHNL FPNFGRAWLR TLSKPLVVTL HNYRPLCAAA TLFRAGATCT ACLRGPLPGL RHGCYRGSRM ATLPLTLGQR TLRRDLLGRA DRLIVLSDVQ RDFYVRAGVD ERKLVVVPNF VPDELDRGPG AGGAGWVFAG RLDDAKGIIE LVARWPRELR LTVFGAGPLL ANAKELAQGR DVTFAGHLPR EAVTAAVRNA RGLVFPSRWP DPFGLAYAEA MAAGTPVLAR RPAAAAQFVH QHRTGMAVDE VTSDAIRAAH ELFPSLRTAC RAAYVAHYRE PDHVAALTGV YLQACAARRA KRAATA
|
| |