Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2302 |
Symbol | |
ID | 5703831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2645071 |
End bp | 2646369 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271780 |
Product | cupin 4 family protein |
Protein accession | YP_001537151 |
Protein GI | 159037898 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0977306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00217301 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCCATC TCGACTCGCC GGGCGGTCAC GGCCGCCCGG CGTCCTCGAC CGTCACCACC GCGCCCCGGG CACCCTCCGG GCGTACCACG TCGGCGCTGG CCCGGTGTGT CTCGGTTGAG CCGGCGAAGT TCGCCGCCGC GCACTGGGGA CGGACACCGC TGCTGTCCCG CGCCGACGAG TTGTCCAACC CGGACGGATT CCTCGACCTG CTGAGCCCGG CTGATGCCGA CGAACTGCTC AGTCGGCGCG GATTGCGTAC CCCCTTTCTC AGGGTGGCGC AGGACGGCGC ACTGGTGCCG GCAGCCCGCT ACACCGGCGG CGGGGGTGCG GGCGCCGAGA TCGCCGACCA GGTCCTCGAC GAGAGGGTCC TGGACCTGTA CGCCGGCGGC GCCACCCTGG TGTTGCAGGG CCTGCACCGC ACCTGGCCCG CGCTTGTCGA CTTCGCCCGG GACCTCGGCC TGGACGTCGG GCAGCCGATG CAGATCAACG CCTACCTGAC CCCCGCCGGC AGTCAGGGCT TCGCCACCCA CTACGACACC CACGACGTGT TCGTTCTCCA GGTCGATGGG CGGAAGCACT GGCGGATCCA TCCCCCGGTG CTACCCGATC CACTGGAGCG GCAGCCGTGG GGCGGTCGGG CCGACGAGGT CTCCGCGACC GCGACGGGTG CGCCGGCCCT CGACGTGACG CTCGCTCCTG GCGACGCGCT GTACCTTCCC CGTGGCTGGC TGCACAGCGC GCAGGCCCAG GAGCACAGCT CCCTGCATCT GACCGTGGGA GTCCGGGCGC TGACCCGGTA CACGCTGGTC GAGGAACTGC TCGCCCTCGC CGCTGAGGAC CAGCGGCTGC GGGCCACCCT CCCGTTCGGC ATCGACGTGT CCGACCCGGA CGCGGTCGAG CCCGAGCTCA CCGAAACGGT GGAGGTGCTG CGGGACTGGC TCCTGCACGT GGATCCGACA ACGCTGGCCG CTCGGCTGAG ACAGCGCGCA TGGCCGGCGG CCCGACCGGC TCCGCTGCAT CCCCTCGCCC AGGCAGCTGC GTTGGACACG CTCGGCCCGG ACAGTCGCGT CATTCCCCGA CCGGGTCTAC GGTGGCAACT CGCCCCAGCG GGGCAGCGGG TGGCGCTGCG GGTCTTCGAC CGTACGATCA CCCTGCCGGA GGAGTGTGCC CCGGCCCTAC ACGCGTTGCT GTCCGGTGAG GTCACCCGGG TCGGTGACCT GCCCGGTCTG GCCGATGACA CCGACCGGGT CACCCTCGTG CGCCGCCTGC TCCGTGAGGC GGTCGCCGTA CCCGCCTGA
|
Protein sequence | MSHLDSPGGH GRPASSTVTT APRAPSGRTT SALARCVSVE PAKFAAAHWG RTPLLSRADE LSNPDGFLDL LSPADADELL SRRGLRTPFL RVAQDGALVP AARYTGGGGA GAEIADQVLD ERVLDLYAGG ATLVLQGLHR TWPALVDFAR DLGLDVGQPM QINAYLTPAG SQGFATHYDT HDVFVLQVDG RKHWRIHPPV LPDPLERQPW GGRADEVSAT ATGAPALDVT LAPGDALYLP RGWLHSAQAQ EHSSLHLTVG VRALTRYTLV EELLALAAED QRLRATLPFG IDVSDPDAVE PELTETVEVL RDWLLHVDPT TLAARLRQRA WPAARPAPLH PLAQAAALDT LGPDSRVIPR PGLRWQLAPA GQRVALRVFD RTITLPEECA PALHALLSGE VTRVGDLPGL ADDTDRVTLV RRLLREAVAV PA
|
| |