Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1179 |
Symbol | |
ID | 5703530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1326711 |
End bp | 1328108 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270697 |
Product | SCP-like extracellular |
Protein accession | YP_001536078 |
Protein GI | 159036825 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.249096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000057053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTACGGCT GGAACGACCC GAGGAACCCG GACGGTACCC ACCGGCAGCC CGAACCGACA GCCAACCAAC CCGCCTGGCT GACCGACCGA CCAGAGCCAC GCTCCTCGTA CCTGTTCGGC GACGAACCGG AGCAGCCGGC CGACGAACCG AAACGACCCA CCGACAGCTG GGGGCAACCG ACCGACGGGT CGACCCGGCA CACCGACAGC TGGCACCGGC CCACCGACGG GTGGAGCCAA CAGCAGCCAG CCACACACCG GCAGCCGGAG GCCGGAACAC AGAGCTGGGG CAGCGCGGAC GACCCGTACC GCGGACCCGC GGGTGGTAGG TACCCCGAGC AGCCCACGCC GAGCTGGGGA GCCAACCCCA CCTGGCAGCA GGGCGGGCCC ACCGACCCCT GGCGGCAGGA ACACCCGAGC GACGGGCACC GCCAGCAGCC CGGGGGCTGG CAAAACGAAC CCACCGGTGA GTGGCACCGC GGTGCCACTC CCAGCGGTGA GTGGCACCGC GCGGCCACCC CCACCGGCGG ACCCGAGCAC ACCGACCGGT GGCGGCCCCA TCAGACGACC GGCGGATACG CCGACACGCC GACGACCCGT ATGCCACCCG TCGTGGGCCC ATCGCCGACT GCCGACGTGC CGACCGGGGA TGGACCGCCC GCCGGGCGCC GGAACCGGCG GCCCCTGTTC ATCGGGGGCG CGGCGGCGGC GGCCACACTG GTGGTGAGCC TCGGGGTCGG TGCCGTCACC CTCATCGGTG GCGACGACAC CAGGCCCACC TCGGCCGCCG AGGACATCGT GGCAACGAAC CCGACATTCG GCGAAAGCTC AGCGCCGGCC ACCCCCACCA CCACGGCCAG CCCCAGCGCG ACCCCGGCGT CACCGTCACC GTCGGCCACA CCGAGTCGGA AGCCGGCACC GGCACCGTCG CGCTCCACCG CCGCGTCCCG TCCCACTCCG GCCCGGACCA CCACGTCCCC CGCCGGCAGC AACTCCACGC CCCCCAGCGG CAACATCAGT GCCGACGCCG CCAAGGTGGT CAGCCTGGTC AACGCCGAAC GCGCGAAGGC CGGCTGCAAG GCTCTGAGCG TCGACGACAA ACTGATGACC GCCGCCCAGC GGCACAGCCA GGACCAGGCC GACAACAAGA AGATGACGCA CACGGGCAGC AACGGCAGCA CCCTCGGCGA CCGGGTCAAG GCCGTCGGCT ACCGGTTCCG CGCCGCCGGG GAGAACGTCG CCTGGAACCA GCAGTCACCC GAGGCCGTGA TGAACGCGTG GATGAACAGC TCCGGCCACC GGGCGAACAT CCTGAACTGC TCCTTCACCG AAATCGGGGT CGGCATCGCG AGCAGCAACG GACCGTACTG GACACAGGTC TTCGCCGCGC CGCTCTGA
|
Protein sequence | MYGWNDPRNP DGTHRQPEPT ANQPAWLTDR PEPRSSYLFG DEPEQPADEP KRPTDSWGQP TDGSTRHTDS WHRPTDGWSQ QQPATHRQPE AGTQSWGSAD DPYRGPAGGR YPEQPTPSWG ANPTWQQGGP TDPWRQEHPS DGHRQQPGGW QNEPTGEWHR GATPSGEWHR AATPTGGPEH TDRWRPHQTT GGYADTPTTR MPPVVGPSPT ADVPTGDGPP AGRRNRRPLF IGGAAAAATL VVSLGVGAVT LIGGDDTRPT SAAEDIVATN PTFGESSAPA TPTTTASPSA TPASPSPSAT PSRKPAPAPS RSTAASRPTP ARTTTSPAGS NSTPPSGNIS ADAAKVVSLV NAERAKAGCK ALSVDDKLMT AAQRHSQDQA DNKKMTHTGS NGSTLGDRVK AVGYRFRAAG ENVAWNQQSP EAVMNAWMNS SGHRANILNC SFTEIGVGIA SSNGPYWTQV FAAPL
|
| |