Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5004 |
Symbol | |
ID | 5705744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5672190 |
End bp | 5673497 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274397 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001539738 |
Protein GI | 159040485 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000372843 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATCG TGGTGGCACA CAACCGGTAC CGGGAGGCTC AGCCGTCCGG CGAGAACACC ATCGTCGACG GGGAGATCAC CCAGCTGACC GCGGCCGGGG TCGAAGTGCT GCCGTTCCTG CGTAGCTCCG ACGAGATCGG GTCAATGTCC ACGCCCGCGA AGGCGCTGTT GCCGATCTCC CCGCTCTACG CTCCCCGGGC CCAGCGGGAG CTGGGTCGCC TGCTCGACGA GCACCGGCCG GACGTGCTGC ACCTGCACAA CCCGTACCCG CTGCTCTCCC CCTGGGTGGT GCGGACCGCG CACCGTCATG GCGTGCCGGT GGTCCAGACG GTGCACAACT ACCGGCAGGT CTGCTCGTCC GGGCTGTACT TTCGGGATGG GATGATCTGC CAGGACTGCC GGGGGCGGGT ACTGGGCGTA CCAGCGATCG TGCACCGCTG CTATCGGGGG TCGCGGGCGC AGAGCGCACT GATGGCGACG ACGCTTGCCG CACACCGAGG CACCTGGCGC TCGGTGGACC GATTCATCGC GCTCACCACG GCGATCGCCG AGCACCTCGG GGACTACGGC ATCCCGCAGC AGCGGGTCGT GGTCAAGCCG AACGCGGTCC CCGACCCAGG TGCTCCGGCG CCGCTGGGCA CCGGCTTTCT CTTCCTCGGC CGGCTCACCC CGGAGAAGGG GCTGGACCTG CTGCTCGACG CCTGGCGCCG GCATCCGGAC GGTGCGCTCG GCCCGCTCCG CATCGCCGGT GACGGCGAGT TACGACCACT GGTGCAGCAG GCCGCGGAGC AGCGGGCTGA TGTGACCTTC CTCGGCCCGC TGGACCGGGA CGGGGTCCAC GCCGCACTGG TGGCCAGCGC GGTGGTGCTG GCCACCTCCA CCTGGCACGA CGTGCTGCCG ACCGTGATCA TTGAGGCGTT GGCCGCCGGC CGGCCGGTGC TCGGCACCGC CCTCGGGGGT ATCCCGTACC TGGTGGGCGC CGATACCCCC CGTGAACCCG CCGGTACCGG ACCGGCCGAT GTTGCATCGG CTGCCGCCGC GGCGACCGGC CCAGGGTCAC CGCCCCCGGT GGCGGCCACC GCCGTGCCGG TCGGGATCCT GGTCGGGGAG GCGGGCTGGG TGGTGCCGCC GGATCCGGCC GCACTGGCCG CCGCGCTGCC GGTGGCCGCC GCTGGTGCCG CCACCCGGGC ACCGGCCGCG CGGGCCCGCT ACGAGCGCAC CTTCCATCCG GACGTGGTCA CGCGGCGCCT GATCGAGATC TACACCGACA CTGCCCGCAC GGCGGGACCG GGCCGGTCAA CGCAGTGA
|
Protein sequence | MKIVVAHNRY REAQPSGENT IVDGEITQLT AAGVEVLPFL RSSDEIGSMS TPAKALLPIS PLYAPRAQRE LGRLLDEHRP DVLHLHNPYP LLSPWVVRTA HRHGVPVVQT VHNYRQVCSS GLYFRDGMIC QDCRGRVLGV PAIVHRCYRG SRAQSALMAT TLAAHRGTWR SVDRFIALTT AIAEHLGDYG IPQQRVVVKP NAVPDPGAPA PLGTGFLFLG RLTPEKGLDL LLDAWRRHPD GALGPLRIAG DGELRPLVQQ AAEQRADVTF LGPLDRDGVH AALVASAVVL ATSTWHDVLP TVIIEALAAG RPVLGTALGG IPYLVGADTP REPAGTGPAD VASAAAAATG PGSPPPVAAT AVPVGILVGE AGWVVPPDPA ALAAALPVAA AGAATRAPAA RARYERTFHP DVVTRRLIEI YTDTARTAGP GRSTQ
|
| |