Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1924 |
Symbol | |
ID | 5708277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2220394 |
End bp | 2222244 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641271429 |
Product | hypothetical protein |
Protein accession | YP_001536800 |
Protein GI | 159037547 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.348588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00176923 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCCGTC ATGGCTGGCC GAAATCAACG GGCAGCAAAG CCCGCTACGG CCCCGCGGCG ATGGAGAACA TGCGTCAAGC CGTCGCCGCT GGCGGGTGGC GCCCGACACC GGAACTGATC GAGCAGCTCA TGGGCTTTCC ACCCGGCTGG ACGTCGATCG ACTCCAAGCC CTCGGCAACG CCGTCGTCCC CCACGTCGCC GAACACATCG GCCGCATCAT CCTTGACCAC CACGAAAGCC AGTGACCTCA TGACACCAGC ACGACTGCAC CTGGTCACCG ACCAGCCGCA ACCCACCACC AGCAAGGAAA CGACAGGTGG CCCACTCTGG GATGTTCCAG TGCCCCTAAC CGGGCCGTCC ACCGCGCCGC CGCCGTTCCC GGCCGACGTG TTTCCGACCT GGCTGCGCGA CATGGTGACC GGCGTCGCCC GGTTCACCCA AACCGACCCG GCCATGGCCG GCACCCTCGC CGTTGCCGTG CTCTCTGCCT GTGCGGGCGG CCGGTTGGAG GTCGAACCGG TGCCCGGATG GCGGGAGCCG GTCAACGTGT TCGCCGCCGT CATCGCCGGA CCCGGTGAAC GCAAATCCCC GGTGCACCGC ACCATGACCG CACCCCTGTT CAGCGCCCAA TCAACCCTCG CCGAAGCGGT ACGCCCGAGG ATCGCCGAAG CCTCCGCGCT GCGCGACATT GCCGACCGGC AGGCCGAACA GGCCAAAGCC CAAGCGGCCA AGGCCACCGA CCCGGGCAAG CGCGATGAGG CCGCCGCCGA GGCGGTAGCC GCCGCTATCT CCGCTGAGGC CATTACCGTG CCCGGCCTGC CCCGGCTGAT CGTGGACGAC GCCACCCCCG AGGCCCTGAT TGGGTTGATG GCCGCCAATG GCGGACGCAT GGCGATCATC TCCGATGAGG GCGGCATCTT CGACACCCTC GCCGGCCGCT ACTCCGGCGC ACCCAACCTC GACCCCTACC TGAAAGGCCA CGCCGGACAA CCGATGAGCA ACGAACGGCA AACCCGCGAA GGAGCCACCG TCGACAAACC CGCCCTCACC GTCTGCGTCA TGGCGCAGCC CTCGGTGCTG CGCAAGTTCG GCGGGAACAC CGAGCTCGCC GGACGCGGAC TGCCCGCCCG GTTCCTGTTC GCCCTGCCCC GCTCCCTCGC TGGCTACCGG GCGGTCGACA CGCCGCCGAT CCTGGAGACG GTGACCGCCG GCTACCGGCG CCGGGTGCAC GATCTGGCCG CCACCCTCGC CGATCGGGAA GACCCCGCCG TCGTGGTCCT CACCGAGGAA GCCGGCAGGG TGCGGCGGGC CGCCGCCGAA CAGGTGGAGG CCGAGCTACG GCCCGGCGGG AGCCTCTACG ACATGCGGGA GTGGGGCAAC AAGCTCTCCG GGGCAACCCT CCGGCTGGCC GGGCTGCTGC ACGTCGCCCA CCACCCCGCC GACGCCTGGC GATGCCCCAT CGACGCCGAC CGCATGGCCG ACGCCGTACG CCTCGCCGAG TTCTTCGCCG CCCACTACCG GGCCGCGCTC ACCACGATCG GCAGTGACAC CGCAATCGAA CACGCCCGGT ACGTGCTCGG CGTACTCACC ACCAAGGGCA TGAGCACCTT TACCCGCCGG GAGCTGCACC GCAAGGTGTC CCGCCGGCTC CCGAAGTCCG ACGAGGTGTC GGCAGTGCTC GCCGAGTTGG CCGCCCTCGG GTGGGTCCGC AACGGACCGG ACGGCCGATA CGAACTGCAC CCCCGCGCCG TCGCGGAGGA CCCCGAAAGC GTTGACACGC TGACACCCGT CCCGACCGGC GACGTTTCCG CAGCTCACAG CACGTCCGAG GGTGTCAACG CCCCCCGTTG A
|
Protein sequence | MGRHGWPKST GSKARYGPAA MENMRQAVAA GGWRPTPELI EQLMGFPPGW TSIDSKPSAT PSSPTSPNTS AASSLTTTKA SDLMTPARLH LVTDQPQPTT SKETTGGPLW DVPVPLTGPS TAPPPFPADV FPTWLRDMVT GVARFTQTDP AMAGTLAVAV LSACAGGRLE VEPVPGWREP VNVFAAVIAG PGERKSPVHR TMTAPLFSAQ STLAEAVRPR IAEASALRDI ADRQAEQAKA QAAKATDPGK RDEAAAEAVA AAISAEAITV PGLPRLIVDD ATPEALIGLM AANGGRMAII SDEGGIFDTL AGRYSGAPNL DPYLKGHAGQ PMSNERQTRE GATVDKPALT VCVMAQPSVL RKFGGNTELA GRGLPARFLF ALPRSLAGYR AVDTPPILET VTAGYRRRVH DLAATLADRE DPAVVVLTEE AGRVRRAAAE QVEAELRPGG SLYDMREWGN KLSGATLRLA GLLHVAHHPA DAWRCPIDAD RMADAVRLAE FFAAHYRAAL TTIGSDTAIE HARYVLGVLT TKGMSTFTRR ELHRKVSRRL PKSDEVSAVL AELAALGWVR NGPDGRYELH PRAVAEDPES VDTLTPVPTG DVSAAHSTSE GVNAPR
|
| |