Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0398 |
Symbol | |
ID | 5705657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 458617 |
End bp | 459963 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641269923 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001535318 |
Protein GI | 159036065 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.348364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00367672 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCGGAAC AGCACACCGG TGTCGGTCGT CAGCGAGGTG CCCGTCCGTG GCCCCGACCC CGCCGCGTCG CGACACTCTC CGTGCACACG TCGCCGCTGC ACCAGCCCGG CACCGGTGAT GCCGGGGGGA TGAACGTCTA CATCCTCGAA GTCGCACGGC GGCTGGCCGA AGCGGACGTG GAGGTCGAGA TCTTCACCCG GGCGACCTCA GCCGATGTGC CACCGGTGGT CGAGATGATG CCGGGCGTGC ACGTCCGGAA CATCATCTCC GGCCCGCTGG GTGGGTTGAC CAAGGAGGAA CTGCCCGGCC AGCTCTGCGC GTTCACGGCG GGGGTGCTGC GGGCCGAGGC GTCCCGAGCC GCCGGGCACT ACGACCTCAT CCACTCGCAC TACTGGCTGT CCGGGCAGGT TGGCTGGCTG GCCAAGGAGC GTTGGGGAGT TCCGCTGGTG CACACCGCGC ACACCCTCGC CAAGGTCAAG AACGCACAGC TCGCCGCCGG CGACCGGCCG GAGCCCAAGG CTCGGGTGAT CGGCGAGGAG CAGGTGGTGG CCGAGGCCGA CCGGCTGGTC GCCAACACCA AGACCGAGGC CGGCGACCTG ATCGACCGGT ACGACGCCGA CCCGACCCGC GTCGAGGTGG TCGAGCCGGG TGTGGACCTG GCCCGGTTCA CCCCCGCGGC TGGCGACCGA TCCCGGGCGC AGGCCCTCGC CCGCCGTCGG TTGGGCCTGC CCGAGCGTGG GTACGTGGTG GCGTTCGTCG GCCGGGTCCA GCCGCTCAAG GCACCGGATG TGCTGATCCG CGCGGCGGCG GCACTGCGTC AGCGGGATCC GGCGCTCGCC GAGGAACTGA CCGTGGTCGT CTGCGGTGGC CCCAGCGGTA GCGGGCTCGA CCGGCCGACC CACCTGATCG AGCTGGCCGC CTCGTTGGGT GTCACCGACA GCGTACGGTT TCTGCCGCCG CAGACCGGCG ACGACCTGCC CGCCCTGTAC CGCGCGGCCG ACCTGGTAGC GGTCCCGTCC TACAACGAGA GCTTCGGGCT GGTCGCGTTG GAGGCGCAGG CATGCGGCAC GCCGGTGGTG GCGGCCGCTG TCGGCGGCTT GGTCACCGCG GTACGTGACC AGGTCAGCGG TGTCCTCGTC GACGGCCATG ACCCGGCCGT GTGGGCCCGT ACGCTGAGCC GTCTGCTGCC GGACACCGGA CTGCGTGCGA CGTTGGCCCA GGGCGCGCGA CGCCATGCCT GCAACTTCTC CTGGGACCGG ACGGTCAGCG GCCTGTTGGA GGTCTACGGC GAGGCGGTCG CCGCGTACGG ACCCCAATCG TCCGAGCTCG CCACCTGTTC TTGTTGA
|
Protein sequence | MAEQHTGVGR QRGARPWPRP RRVATLSVHT SPLHQPGTGD AGGMNVYILE VARRLAEADV EVEIFTRATS ADVPPVVEMM PGVHVRNIIS GPLGGLTKEE LPGQLCAFTA GVLRAEASRA AGHYDLIHSH YWLSGQVGWL AKERWGVPLV HTAHTLAKVK NAQLAAGDRP EPKARVIGEE QVVAEADRLV ANTKTEAGDL IDRYDADPTR VEVVEPGVDL ARFTPAAGDR SRAQALARRR LGLPERGYVV AFVGRVQPLK APDVLIRAAA ALRQRDPALA EELTVVVCGG PSGSGLDRPT HLIELAASLG VTDSVRFLPP QTGDDLPALY RAADLVAVPS YNESFGLVAL EAQACGTPVV AAAVGGLVTA VRDQVSGVLV DGHDPAVWAR TLSRLLPDTG LRATLAQGAR RHACNFSWDR TVSGLLEVYG EAVAAYGPQS SELATCSC
|
| |