Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3514 |
Symbol | |
ID | 5704642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4054204 |
End bp | 4055334 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272941 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001538307 |
Protein GI | 159039054 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00713418 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAGA GCCGAACGTT GCTCGTCACG AACGACTTCC CGCCCCGGCC CGGGGGGATT CAGTCCTTCG TGCACGGCCT CGCGGTGCGC CAGCCCCCGG GGTCGGTGGT GGTCTACGCC TCGCGCTGGC GCGGCGCCGA GGAATTCGAC GCCGCCCAGC CCTTCACGGT GGTGCGGGAG AACACCCGGG TGCTGCTGCC CACCCCGCTG GTGGCCCGGC GGGCCGCCCG GCTGGCCCGG GAGTACGACT GCGACACGGT GTGGTTCGGC GCGGCCGCGC CGCTCGGGCT GCTCGCCGCG GGGCTGCGGC GACGGGCCGG GATCCGTTGG ATGGTGGCAC AGACCCACGG GCATGAGGCC GGCTGGGCGG CCCTGCCCGG TGCCCGGACC GCGTTGCATC GCATCGGCCG GGCCGTCGAC GTGACGACCT ACCTGGGGGA GTACACGCGG CTCCGGCTGG ACCGGGCGTT GCGCGGGGCG ACCGAGCTGC GCCGGCTCGC ACCCGGCGTC GACCTCGACA CCTACCACCC GGCGGTCGAC GGGGAGTCGG TGCGGGTGCG GCTCGGGCTG GCCGACCGGC CGGTGGTGGT CTGCGTGTCC CGGCTGGTGC CCCGGAAGGG ACAGGACATG CTGATCCGGG CGTTGCCGGG GATCCGGCAC CGGGTTCCGG ACGCGGCGTT GCTGATCGTC GGGGGTGGAC CATACCGAAG CGCGCTGGGG AAGCTGGCCC GGCAGGTCGG CGTCGAGCGT GATGTGGTGT TCACCGGCAC CGTGCCGGCA GCCGAGCTGC CCGCGCACTA CGCGGCCGGT GATGTGTACG CGATGCCCTG TCGCACCCGC AACCGGGGCT TGGACGTGGA GGGGCTGGGC ATCGTCTACC TGGAGGCATC CGCGACTGGC CTGCCCGTGG TGGCCGGTGA CTCCGGTGGC GCGCCGGACG CCGTGCGCGA CGGTGAGACC GGTTTCGTGG TGCGCGGGCG CGACGTGGCC CAGCTCGTCG ACCGGGTGGC GACGCTGCTG GCCGACCGGG ACCTTGCCCG CCAGTTCGGT GCCACCGGCC GGGCCTGGGT CGAACGTGAG TGGCGGTGGG AGACCCAGGC CACCCGCATG GCCGACCTCC TTCGCCCCTG A
|
Protein sequence | MSESRTLLVT NDFPPRPGGI QSFVHGLAVR QPPGSVVVYA SRWRGAEEFD AAQPFTVVRE NTRVLLPTPL VARRAARLAR EYDCDTVWFG AAAPLGLLAA GLRRRAGIRW MVAQTHGHEA GWAALPGART ALHRIGRAVD VTTYLGEYTR LRLDRALRGA TELRRLAPGV DLDTYHPAVD GESVRVRLGL ADRPVVVCVS RLVPRKGQDM LIRALPGIRH RVPDAALLIV GGGPYRSALG KLARQVGVER DVVFTGTVPA AELPAHYAAG DVYAMPCRTR NRGLDVEGLG IVYLEASATG LPVVAGDSGG APDAVRDGET GFVVRGRDVA QLVDRVATLL ADRDLARQFG ATGRAWVERE WRWETQATRM ADLLRP
|
| |