Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2933 |
Symbol | |
ID | 5705238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3321892 |
End bp | 3323310 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272382 |
Product | hypothetical protein |
Protein accession | YP_001537750 |
Protein GI | 159038497 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.697027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.639426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCC TGTTCACAGT TTTTCCCTCG TACGTGCACC TGTATCCGCT GGCACCAGTG GCGTGGGCGC TGCAGAGCGT CGGACACGAG GTCCGGGTCG CCTCCTCCGG AAACTTCGCG AGGGCAATCT CCAGCGTCGG CCTCACGCCG CTGTCGCTGG GTGATCCGGA TGCTGTCGAA GCACGCCTGC GGCCGGGGGC CAAGCAGCCA CCGAATCCAC AGGAGGTCCT CGCGTACGCC GACCTCATGG GGCTGGATTC CGCGGAACGC GAGAACTGGA TCGTCTTCTA CCAGTGGCTG TTGAACCCAG TTTCGGACTA TGTGCGCGTG GATCAGCCCG AGGCCGCACA CCTTGTTCGG TTCGCGCAAC GGTGGCAGCC AGACCTCGTG TTGTGGGATC CTATCTTTCC CGCCGGTGCG GTGGCGGCTC GCGCGTGCGG TGCGGCCCAC GGTCGCTTCC TCGGGGCGGC CCTCGACTAC TTCATGTACG GCACGGAGCG ACTCGAAGCG GCCCGTGACA AAGTACGCAA TGCCGGTCTA TCGGACAATC CGCTCGCGGA CCTCATCCGC CCCTTGGCTG ATCACCACGA CGTTGACGTT GACGATGAGC TCCTCAGGGG GCAGTGGACC GTGGATCCCA TGCCGGAAGG CGTCAGCCTC TCCACGGGCG GTCACAAGGT TCCGGTTCGT TGGGTGCCCT ACGTGGGTGG TGAACCGTGT CAGGAGTGGG TGCTCGATGG GCCCACGAGC CGACCGCGGG TCGTGCTGTC CCTTGGTGAG TCGGCCCGAC GGTATGTTGC CGGGGACTGG GGGCGCACGC CCAAACTGCT GGACGCTCTG GCGGGGATGG ATGTCGACGT CATCGCAACC CTGAATGAGC GCCAGCTACA GGGCATCTCG ACTGTTCCGG ACAACGTCCG CGTCATCGAA TGGGTGCCGC TGACGCAGCT TATGCCTACC AGCTCGTTGC TGATACATCA TGGCGGTACC GGCACGACGA TGTCGGCCCT GGCCAACCGC GTGCCTCAGC TGGTCTGCGA CACAGATGAG TCGTTCCTGA TGGGTCCGGC CGACGTGGTG CCGCGACTCG GTGATGCCGG GGTCTACCGC GCTGGTCGTG AGTTCGGAGT GACCGATGAC GACGCCGACG GCGACGCCGA GCAGGAGGGC TGGGTGATCC CTGGTCGTCA CCTCATGGCG CCACCGTGGT CGGGCGTGAT GACTCAGTAC GGTGCTGGTG AGCGACTCAA TCATCAGGTG ATGTCCGGCA CCGAGATCCG TGACCGGATC ACGCACGTGC TCTCCGAGCC ATCGTTCGCA GCTGGTGCGC GCGAGCTGTA TGAGGCGTGG ATGGCCAGGC CAAGCCCGAG CGACATCGTT TCGACTCTGG AGTCCTTGAC CGCCGAGCAC CGCCGTTGA
|
Protein sequence | MRILFTVFPS YVHLYPLAPV AWALQSVGHE VRVASSGNFA RAISSVGLTP LSLGDPDAVE ARLRPGAKQP PNPQEVLAYA DLMGLDSAER ENWIVFYQWL LNPVSDYVRV DQPEAAHLVR FAQRWQPDLV LWDPIFPAGA VAARACGAAH GRFLGAALDY FMYGTERLEA ARDKVRNAGL SDNPLADLIR PLADHHDVDV DDELLRGQWT VDPMPEGVSL STGGHKVPVR WVPYVGGEPC QEWVLDGPTS RPRVVLSLGE SARRYVAGDW GRTPKLLDAL AGMDVDVIAT LNERQLQGIS TVPDNVRVIE WVPLTQLMPT SSLLIHHGGT GTTMSALANR VPQLVCDTDE SFLMGPADVV PRLGDAGVYR AGREFGVTDD DADGDAEQEG WVIPGRHLMA PPWSGVMTQY GAGERLNHQV MSGTEIRDRI THVLSEPSFA AGARELYEAW MARPSPSDIV STLESLTAEH RR
|
| |