Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2778 |
Symbol | |
ID | 5706170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3159586 |
End bp | 3160587 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641272234 |
Product | glycosyl transferase family protein |
Protein accession | YP_001537604 |
Protein GI | 159038351 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.2611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00237783 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCCTGT CGGTAGTGGT GCCCTGCTTC AACGAGGAGG CCTCGGTCGA GCAGCTGCAC ACCGCGGTCA CCGCCGCGGT CGCCGAGCTC TCCGATGTGG AGATCGAGGT GGTCTATGTC GACGACGGCA GTGTCGACGG CACCCTCGCG GCACTGCGGC GACTCGCCGC CATCGACCCG GCGGTGCGAT ACACCTCACT GAGCCGCAAC TTCGGCAAGG AGGCGGCGAT GCTGGCCGGC CTGAAGCGGG CCACCGGGGA CGCCGTCGTG ATCATGGATG CGGACCTGCA ACACCCACCA CGGCTGCTAC CGGACATGGT GGCGTTGTTC CGGCAGGGTT TCGACCAGGT GATCGCCCGC CGCGACCGAC GCGGGGACCG GTTCCTGCGC ATGGTGGCCT CGCGGTCCTT CTACCGGATG GTGAACTGGT GGATCGACGT GCGGCTGTTG GATGGGGCCG GCGACTTCCG GTTGCTGTCC CGACTCGCTG TGGACGCGGT GCTGGCCATG CCGGAGTACA ACCGCTTTTC CAAGGGTTTG TTCTCCTGGA TCGGATTCCG GACCGTCGTG ATAACCCACC GCAACGAAAC CCGACGGACG GGCCGGAGCA GGTGGACGTT CGGCAACCTG TTCAACTACG CGTTCGACGG GCTGCTGTCG TTCAACAACC GGCCCCTCCG GCTGGCCATC TACGGCGGCC TGTTGCTCAC CCTGATCGCG CTGGGGTACA TGATCTGGGT GGTCGGGGAT GCCCTCAGCA AGGGGATCGA CGTACCCGGT TACACCACCA TCATCGTCAG TGTCATCGGT CTGGGCGGTA TCCAGATGGT GCTCCTCGGA GTGATCGGGG AGTACATCGG CCGGATCTAC TACGAGACCA AACGCCGGCC GCACTATCTG GTGCAGGAGA CGGATGACCC GGCCCCGGAC CCCCGGACGC CCCGCCCACG ACCGACCCCG CCGCCGGTCG ACGGCCGAGC CCGTCACCAC CGAGACCGAT AG
|
Protein sequence | MLLSVVVPCF NEEASVEQLH TAVTAAVAEL SDVEIEVVYV DDGSVDGTLA ALRRLAAIDP AVRYTSLSRN FGKEAAMLAG LKRATGDAVV IMDADLQHPP RLLPDMVALF RQGFDQVIAR RDRRGDRFLR MVASRSFYRM VNWWIDVRLL DGAGDFRLLS RLAVDAVLAM PEYNRFSKGL FSWIGFRTVV ITHRNETRRT GRSRWTFGNL FNYAFDGLLS FNNRPLRLAI YGGLLLTLIA LGYMIWVVGD ALSKGIDVPG YTTIIVSVIG LGGIQMVLLG VIGEYIGRIY YETKRRPHYL VQETDDPAPD PRTPRPRPTP PPVDGRARHH RDR
|
| |