Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1689 |
Symbol | |
ID | 5705224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1949413 |
End bp | 1951035 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271192 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001536567 |
Protein GI | 159037314 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.224017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00115774 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGTTCA ACGGCGATGT TCAGGTCGCG GGCACCCGGG GGCGGATCGT CATGCTCGTC GACAATGGGG TGGCCGGAGA CTCCCGGGTG CAAAAGGCCG CCCGGTCCGC CGCTCGGTCC GGCTGGGACG TGACACTACT CGGTCGGGCA CCGGGAAACG ATCCACGATC CTGGCAGCTC GGCCCTGCCG AGGTGCGGCT GCTCGCCATG CCGGACCCGC TGGCCCGCCG TCGGCACGAG TTTCGCAGGG CCTGGTTGCG CTGGCCACTG GCCTACCCGC CCAGCGGGAT CGCGGCGCAC CGACGACAGG CGGTGAAGGC GTGGCAAGCG GACTTGGCCG TGCGCCGGGC GCAGCTCGCG GTGGCGGACC CAGCAACGCC CCGGCTCGGC CTGCGTTCGC GCGCACTACA CGCCGAGGAA CTCGCCGCCC GGGTAGCGGG GACGTGGGTC TCGGCGCGGT ACTGGATGCT CACCCGGGCC CGTACGCGGC GCCGGTTCCG CAACCCCTGG GATCGGGCCT ACACCCTGTT CTGGCAGGCG GTCAAGGGCG ATGGCGCCTG GCGTAGGCTC GAGCCGAGTC TGTGGGACTA CGAGCTGGCC TACGGCCCGG TGCTCGACGC GTTGCAGCCG GACCTGATTC ACGCGAACGA CTTTCGGATG CTCGGCGTTG GTGCTCGCGC CAAGATCCGT GCCGCCGCCC GGGGGCGTGA GATCAAACTC GTGTGGGACG CGCACGAGTT CCTGCCGGGG GTTCGGCCCT GGCGGGACGA CGCCCGATGG CTGCCCGCGC ACCGTGCGCA CGAGCGGGAG TACGCGCCGT ACGCGGATGC GGTGGTGACG GTATCGCCGG CGTTGGCGGA GCTGCTTCGC GACGAGCACA AGCTGGCTGG GACGCCGGCG GTGGTGCTCA ATGCGCCGGA CATCGGGACC GTCCCGCCGG CCGACGAGGA GGCGGCACCG GATCTCCGGG CCCGCTGCGG CGTCGACCCG GACACCCCCC TGCTCGTCTA CAGTGGCCTC GCATCGCCGC AGCGCGGCCT GGACATCATG GTCGAGGCGT TGCCGCGCCT GCCCGGTGTG CACGCGGCAC TGGTCGTCAA TAAGCCGGAC GGCCCGCATG CAGCCGAGGT ACGGGCACGG GCCGCCCGAC TCGGTGTCGC CGACCGCGTG CACGTCCTGC CGTACGTGGC GCACTGGCAG GTGGTGCCGT TCCTGGCTGG TGCGGACGCC GGGATCATCC CCATCCACCA CTGGCCCAAC CATGAGATCG CGCTGATCAC CAAGTTTTTC GAGTATTCGC ACGCGCGGTT GCCGTTGGTG GTCAGCGACG TGAAGACGAT GGCCGACACG GTCCGTTCTA CCGGCCAGGG TGAGGTGTTC CGGGCTGAGG ACGTGGCCGA CTACGTCCGC GCCGTTAGCG CGGTGCTGGC CGAACCGGAT CGTTACCGGG CCGCGTACGA CCGGCCGGGC CTGCTGGAGA GCTGGACCTG GGAGGCGCAG GCCGAGGTGC TGGACGCCGT CTACCGCCGG CTGTTGTCCG GGAGTGCGTC CGCGACGGGC CGAGCCGGCG AGCCGGTGGC GGCGCCGGAA GGCCCGCTGC CCGCAGTCGA GGTCGTCCCA TGA
|
Protein sequence | MGFNGDVQVA GTRGRIVMLV DNGVAGDSRV QKAARSAARS GWDVTLLGRA PGNDPRSWQL GPAEVRLLAM PDPLARRRHE FRRAWLRWPL AYPPSGIAAH RRQAVKAWQA DLAVRRAQLA VADPATPRLG LRSRALHAEE LAARVAGTWV SARYWMLTRA RTRRRFRNPW DRAYTLFWQA VKGDGAWRRL EPSLWDYELA YGPVLDALQP DLIHANDFRM LGVGARAKIR AAARGREIKL VWDAHEFLPG VRPWRDDARW LPAHRAHERE YAPYADAVVT VSPALAELLR DEHKLAGTPA VVLNAPDIGT VPPADEEAAP DLRARCGVDP DTPLLVYSGL ASPQRGLDIM VEALPRLPGV HAALVVNKPD GPHAAEVRAR AARLGVADRV HVLPYVAHWQ VVPFLAGADA GIIPIHHWPN HEIALITKFF EYSHARLPLV VSDVKTMADT VRSTGQGEVF RAEDVADYVR AVSAVLAEPD RYRAAYDRPG LLESWTWEAQ AEVLDAVYRR LLSGSASATG RAGEPVAAPE GPLPAVEVVP
|
| |