Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3558 |
Symbol | |
ID | 5705051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4106067 |
End bp | 4106978 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272985 |
Product | hypothetical protein |
Protein accession | YP_001538351 |
Protein GI | 159039098 |
COG category | [R] General function prediction only |
COG ID | [COG1090] Predicted nucleoside-diphosphate sugar epimerase |
TIGRFAM ID | [TIGR01777] conserved hypothetical protein TIGR01777 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.69545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0104917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTC TGATGGCCGG CGCGTCCGGC TTCCTCGGCA CCCGGCTGGC TGACCGGCTC ACCGCGGACG GGCACCACGT GACCAGGCTG GTCCGCCGGC CGCCGCGCGG CGCCGACGAG CGACAGTGGA ACCCGACCGC GGCGCAGCTC GACCCGGCGG TGGTGGCGGC GGCGGACGCC GTGGTCAACC TGGCCGGAGC CGGCGTGGGC GACCGCCGGT GGACCGACGA GTACCGGCGG ATCATCCGGT CCAGCCGAGT GGACACCACC ACCACGCTGG CGATCACGAT CGCCGGCCTA CCGGCCGCGG ACCGCCCGCA GGCACTGATC AACTCGTCGG CGGTCGGGTG GTACGGCAAC ACCGGCGACC GGGCCGTCGA GGAGGACGCG CCGGCCGGCG AGGGATTCCT GGCCGATGTC TGCCGGGTGT GGGAAGCCTC GACCCGGCCG GCCGAGGACG CCGGGGTACG CGTGGTACGG CTGCGAACCG GGTTGCCGCT GCACCGCGAC GGCGGGCTAC TCAAGCCACA GTTGCTGCCG TTCCGGCTCG GCATCGCCGG CCGGTTGGGC AGCGGCCGGC AGTGGCTGCC GTGGATCGCG ATGGTGGACT GGCTCGACGC CACCGCCCTC ACGATCGACC GGGCCGACCT CGCCGGCCCG GTCAACGCGG TCGGGCCGGC ACCGGTGACC AACGCCGAGT TCACCAGGGA GTTGGCCCGG CAGCTGCACC GCCCGGCGAT CATCCCGATC CCGGCGCTGG CGCTGAAGGT GGCTCTTGGC GGCTTCGCGC AGGAGGCACT GACCAGCGCC CGGGTCCTAC CCGGGGTGCT CACCCGGGCC GGCTTCGACT ACCGCCACCC GGATCTGTCA GGCGCCCTGA GAGCAGCACT CGTGGCACAG CCGGACCGCT GA
|
Protein sequence | MRILMAGASG FLGTRLADRL TADGHHVTRL VRRPPRGADE RQWNPTAAQL DPAVVAAADA VVNLAGAGVG DRRWTDEYRR IIRSSRVDTT TTLAITIAGL PAADRPQALI NSSAVGWYGN TGDRAVEEDA PAGEGFLADV CRVWEASTRP AEDAGVRVVR LRTGLPLHRD GGLLKPQLLP FRLGIAGRLG SGRQWLPWIA MVDWLDATAL TIDRADLAGP VNAVGPAPVT NAEFTRELAR QLHRPAIIPI PALALKVALG GFAQEALTSA RVLPGVLTRA GFDYRHPDLS GALRAALVAQ PDR
|
| |