Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2950 |
Symbol | |
ID | 5707804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3346085 |
End bp | 3347305 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272399 |
Product | hypothetical protein |
Protein accession | YP_001537767 |
Protein GI | 159038514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0162357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACGC AGGCGAACCC AGCGACCACC AGCTGCACCG CTACTGCCGA GTCGGTGATG ATCCGTGACT CGATGAGTGC CGTCCCGCGG ATCTTCGCGA TGCCGCTGAC CTTCCTCACC GGCAAACCGG CAACGGGTCA GCGCCCACTG CGGCTGACCC CCGGCATCCA CCTGGCGGCC GCGACCGTGT CCGTGCTGAT CGGCCTGAGC CTGAGCTGGG TGGCGATGGC CGGTGGCGGG TGGTGGCTGC TGCTGATCGT GGGCTGGGCG ATGACGCTGC ACGGTGCCCG TAACGCGCGG ATGATGGTGT ACCACCAGGC GGCGCACCGG AACATGTGGG CCCGGCCGCG CCGGGACCAG GTCGTCGGTC GGATCGTGGC CGGGGTGCTG CTGGTGCAGG ACTTCAGCCG GTACAGCACC GAGCACGTCC TCGACCACCA CGCCGTCCAC CACATGACCG TGCGGGATCC CACGGTGCAG GCATTTCTCA TCGGGCTGGG GCTCCGTCCC GGGATGACCC GACGGGAGAT GTGGCGGCGC CTGATCGTCC ACAAGCTGCT CTCACCCACG TTCCACCTGA GCTTCCTGAT CGGCCGAATC CAGTCGTACT TCGCGCCGGC CAGCTGGCGG CAGCGACTGC TCACCCTGAC GGTCTACGGT GCCGTGATCG CGCTCGCCGT ACGCTTCGAC GCCTGGGTCT TCCTACTGGT CGCCTGGGTG TTGCCGATGA CCTTCTTCTA CCAGGTCAGC AACACGCTGC GGCTGTGCGT CAAGCACACC TTCCCGTCCC CCGCGGCCAC CGAGCGGCGC GGGCGCGGGT ACTTCGCCAG CCTCACCAAC GCGATCCTGA TCGGCGAGCG GGCCCCGGAC CGCGAGGTCA GCGGGCGGCT GCGTCGCCTG CGCGGCTGGG CCCGGTGGTG GCTGCGCATG CTGACCGTCC ACCTGCCGGT CCGCTATCTG GTCCTGACGG GCGACACCGT GGTGCACGAC TTCCACCACC GCCACCCGAT GAGCCGGGAG TGGGCGGACT ACATCTTCGC CCGGCAGGCG GACATCGACG CCGGGCACCG CGGCTGGCCG CCGTACCGGG AGATCTGGGG CCTGGTGCCC GCGATCAACC TGGTGTTCGA GTCGCTGTCG CGGGCCGACC CGCAGGAGTA CGACCGGGCA CGCATCCCAG AGGTCAGCGG ACGCAGCGTC TTCTCCGCCT TCGACGACTG A
|
Protein sequence | MVTQANPATT SCTATAESVM IRDSMSAVPR IFAMPLTFLT GKPATGQRPL RLTPGIHLAA ATVSVLIGLS LSWVAMAGGG WWLLLIVGWA MTLHGARNAR MMVYHQAAHR NMWARPRRDQ VVGRIVAGVL LVQDFSRYST EHVLDHHAVH HMTVRDPTVQ AFLIGLGLRP GMTRREMWRR LIVHKLLSPT FHLSFLIGRI QSYFAPASWR QRLLTLTVYG AVIALAVRFD AWVFLLVAWV LPMTFFYQVS NTLRLCVKHT FPSPAATERR GRGYFASLTN AILIGERAPD REVSGRLRRL RGWARWWLRM LTVHLPVRYL VLTGDTVVHD FHHRHPMSRE WADYIFARQA DIDAGHRGWP PYREIWGLVP AINLVFESLS RADPQEYDRA RIPEVSGRSV FSAFDD
|
| |