Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0764 |
Symbol | |
ID | 5708283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 851683 |
End bp | 853980 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641270283 |
Product | hypothetical protein |
Protein accession | YP_001535674 |
Protein GI | 159036421 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.902908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0151558 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGGG AGGAGGACGA CGCCGTGGCG CTCGTGCGCG TGTACTGCGG TCTGGCCTCG GCGGATCCGG CCGACCGACC GGCTTCGGCT GGTTCGGCGC TGACGTCCGC TGTAGTCGAC GATGCAGGTC GCCTGCTACA TATCAGCGAG ATCAGCGATG ACCCCGCCGG CTATGCCCAG TTGGTCACGC TGCTCGTGGA CCGCTCGGGC GGGCCGAGTG GCGCGGCGAT CGCCGCGGAC AGCGACGACC ACACGGTCAC CTCACTACTC AGTGCCGCCG GTCGGCCGCT CGCGATCGCC GACGACGACT CGGTGGACGA CTTCGCCGAA CGATTCGCCG ACGACGACTC CCCTGAGGAG GTGCAGGCGC CACCGGTCGA GCGGCGCGCC GTCGGCCTGG CCCGCGCCTT GCAGGCCGGT GCGCTCTCCG CTGTCACACT CCCGGCCCCT CGAGACCTCG CCGGCTACAA ACAGGTACTC GCCGCGCACG CCGCGCTCGC CAACGGCCGG CATTCGGCTG CGGTCGCGCT ACGCGAAGTG TTGCGCGAGC TGTTTCCGGC AGCCCTGCAG GCATACCCCG ATCCGGCCGC TCCGGTCGCG CTGGCGGTGC TGGACGCCCT TCCCGAACCC GGGATGCTGA CCGGCCCTGG GCGCGACCAA TCGGCCACTC GGGTCAGCGC GGACCTCACC GCCGAAGGCG TCGCCGCGTC CGAGGTGATC GAGGCCGCTA TCACCGCCCT CCGAGTCGCC ATCTCCGAGA CCCCACGGCG AGCCTCGGTC AACCCCGCCC TGACGGCAGC GGTGTCCACC ACGGTCCGGC AGGCTGTCGC CTCGGTCCGG GCCTGCGATG TCGGGTGCGC GGCGCTGGTC GGCGCGCTCG ACGACCGAGC GGGCTCCCCC GCGCCCGGCC GCCGTGCCGC CACCCGACAC GGCGCGCCGC TGGACAAGCC ACCGTCCACC GACGCCGCCC GAACCAGCGG ACGGCCAGGC GTGCCCCCTC GGCCGGCGAC GACCGGTGCT CGCCGTCCCC AGCCCGAGCC GGTGTCGGGC CAGACCCCGC CGCCGTCCCC GCGCCCCCTC GGCCCGCCAC CGGTCGCGCC GGCACCCGTG GCACCCCCGC CGGTGGCGCC GCAGCCGATC GCACCCGCAG CCGTGGCCGG GGCACCGGCC CCTGCGGCAC CCGGTCAGCC GGAGCCGATG CCGAGCCGTG TCGACGCTCC GACGAACCGC CCGATCTCAC CGCCACCGCC CCCACCTCCT GGCATCACGC CGATCCCGCC GTCGCAGCGA GGTTCGATCC CACCGGCCGA GGCCGGCGAG CCGTTCCGGG CCACCTTGAC CACTGCGGCG ATCCAGGAGG CCCGTGCCGA GCGACAGCGC ACCGCCACTC CCCCGCGACC CCACACGGCG TCGGAACCGC CGACGCCAAG CGGTGGCTTC AGCGTGACCG ACCTCAGCGT GCCGGTGCCC ACCCCGCGAC CGGCTCAGGA GCCCGCCCCC CCGGGCTCCC GGGCGAACTG GCCGCTGGTC AACAACGCGG ATCCGGCTGA CAGTTCGGCG CGCGTTCCCG GCGCAGACAC ATACGGCGGG CAGGGGGTGG ACGCCCCGAC CGATCCGAGT GCGGAGCGTC GAGTGCCGCC GCCGTGGCTC GCCGACGACA TGCCGCAGGA GCCACCGGTG CTGCGCCTGG TCGAGCCGCC CCCGCTGGCC GATCGCGCGC TACGCGACGG GGCTGATCCA CGGCTCGAAA CGCCACCGCT ACGCCTCGTC GACGAGGGTG AGCCCAACGA TCACCCAGCC GCCAAACGCC CTGCCGGGCA GCAGATGCCG CCGATGGAGC ACCGGCCCCC GCCGGTCACC GACGACGGGG ATGGCGACCT GCTCATCTTT GCCCAGGCGA AGTCGGCCTG GTTCGTCGGT CAGAGCGACG AGGACGACGA GGTGGACTGG TCGTCGCTGA ACGACACCGG TTGGCAGGCG GCCGAGCAGG CCGCGAAGCC GGCCACCGGC TCCGAGACCC CTTCGGGCCT ACCCAAGCGG GTACCCCAGG CCAATCTGGT CCCCGGCTCA CCGAAGCAGG ACGACCGTCC CCTGCGGATC GTTCGGGATC CGGCCAGCCT CGCGGAGAAC ACAACTGGCT ACTTCCGTGG CTGGCGTCGG GGTCAGGAGA TCGGAGGCTT CGCGGTTGGT GGCCGCCCGG GTCGCGAGGC CGCTGGCGGC TGGGACTTCA CCCGCGACAC TGGCGAACGC GACGAGGACC GGAAGTACGA GTACCGCTCA GCCGGTTACC GGTCCTGA
|
Protein sequence | MSREEDDAVA LVRVYCGLAS ADPADRPASA GSALTSAVVD DAGRLLHISE ISDDPAGYAQ LVTLLVDRSG GPSGAAIAAD SDDHTVTSLL SAAGRPLAIA DDDSVDDFAE RFADDDSPEE VQAPPVERRA VGLARALQAG ALSAVTLPAP RDLAGYKQVL AAHAALANGR HSAAVALREV LRELFPAALQ AYPDPAAPVA LAVLDALPEP GMLTGPGRDQ SATRVSADLT AEGVAASEVI EAAITALRVA ISETPRRASV NPALTAAVST TVRQAVASVR ACDVGCAALV GALDDRAGSP APGRRAATRH GAPLDKPPST DAARTSGRPG VPPRPATTGA RRPQPEPVSG QTPPPSPRPL GPPPVAPAPV APPPVAPQPI APAAVAGAPA PAAPGQPEPM PSRVDAPTNR PISPPPPPPP GITPIPPSQR GSIPPAEAGE PFRATLTTAA IQEARAERQR TATPPRPHTA SEPPTPSGGF SVTDLSVPVP TPRPAQEPAP PGSRANWPLV NNADPADSSA RVPGADTYGG QGVDAPTDPS AERRVPPPWL ADDMPQEPPV LRLVEPPPLA DRALRDGADP RLETPPLRLV DEGEPNDHPA AKRPAGQQMP PMEHRPPPVT DDGDGDLLIF AQAKSAWFVG QSDEDDEVDW SSLNDTGWQA AEQAAKPATG SETPSGLPKR VPQANLVPGS PKQDDRPLRI VRDPASLAEN TTGYFRGWRR GQEIGGFAVG GRPGREAAGG WDFTRDTGER DEDRKYEYRS AGYRS
|
| |