Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3765 |
Symbol | |
ID | 5705668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4301368 |
End bp | 4302864 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273185 |
Product | phage-related major capsid protein |
Protein accession | YP_001538549 |
Protein GI | 159039296 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 134 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1596 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGAGT ACCTTCGCGC GCAGCTTGTC CAGCTGCGCG AGCGGCGGGC CGCGCTGCGT GCCGAACTCG ACGCCGTCAT CAGCGGCGCC CGGTCGGCAA GCCGCGACCT GACCGACGAC GAGCAGGCTC GCCTCGTCGC GGGAACCGAG CAGCTGCGCA AGCTCAACGG TGACGAGGAC GGCCTGAATA GCCAGATCCG CGAGGCGGAG GAGACCGAAC ACCGCGAGCA GGTCGCGGCG GGCGCCCGCG CCGAGTCCGG CCAGGTGGGC GAGCACCGCA CCGGCGGCGC CGTGGTCACC AGTGAGCCGC AGGTGTACGG CAACGGCTCC GGAAACTCCT ACTTCCACGA CCTGGCCAAG GCGCAACTGC GCGCCGACTC CAGCGCCGCG GAACGCCTGC AGCGGCACGC CGCCGAACTA CGGGTCGAAC TACCCGCCCG GGAGCGGCGC CGCGAGGAGC GCGCCCAGCG GGAGATGGAC GGCATGGGCA CGGCCGAGCG CTGGCACGAG GAGCAGCGCA GCCGCGTGTT CGAGAAGCGG GTCAACCCGA ACCGGACCGA CGGCCAGGGC GGCTACTTCG TGCCGCCGCT GTGGCTGATC GACCAGTACA TCGACCTGCC GCGCTTCGGT CGGCCGATCG CCAACGCCGT GCGCAACATG GCACTGCCGG GCGGCACCGA CTCCGTGAAC CTGCCGAAGG TCGCCACCGG CACGTCAACC GCCGCGCAGA CCGCCGATGG TGCCCCGGTG ACCAGCACCG ACATGACCGA TACCAGCGTG TCCGCATCGG TCTACACGGT CGCCGGCCAG CAGGACGCGT CCATGCAGCT ACTCGACCAG TCCCCGGCGC CGGGCTTCGA CGAAATCATC TTCGCCGACC TACTCGCCGA CCTCGCGGTT CGGCAGGACG TGTACGTGAT CAACGGCTCC GGGACTGCCG GGCAGCCGAC CGGCATCCTC AACGTCAGCT CGCCGAACGC GATCACCTAC ACGGACGCGT CGCCGACGCT GCCGGAGATG TACGTGCCGT GGGTCCAGTC GGTCTCCCAG ATCTTCACCA ACCGCAAGCG GCCCGCCACA GCCACCTTTG CGTTGCCGAA GATCTGGTTC TGGGCGACTG CCGGTCTCGA TACCACAAAC CGGCCGCTGA TCCAGCCGTC ACAGGAGGCG CCGTTCAACC CCATGGCCTT ACAGACCGGC GAAATCGCTG AGGGCCCGGT CGGCAAACTG ACCGTCGGCA CACCGGTGAT CCTCGACGGC AACATCCCGG AGAACCTCGG CGCCGGCACC GACGAAACGC GGATCATCAC GCTACGCACC TCCGACCTGT ACCTGTGGGA GGGCGCGATC CAGACCCGTG TCCTCACCGA GGTGCTGTCG GGGACGCTGC AGGTCCGCTT CCAGGTGTAC CGGTACGCGG CGTTCATGGC CACCCGGCTA CCGAAGGCGA TTTCGATCGT CTCGGGCACC GGCATGATCC CGACCTCCGG CTACTGA
|
Protein sequence | MLEYLRAQLV QLRERRAALR AELDAVISGA RSASRDLTDD EQARLVAGTE QLRKLNGDED GLNSQIREAE ETEHREQVAA GARAESGQVG EHRTGGAVVT SEPQVYGNGS GNSYFHDLAK AQLRADSSAA ERLQRHAAEL RVELPARERR REERAQREMD GMGTAERWHE EQRSRVFEKR VNPNRTDGQG GYFVPPLWLI DQYIDLPRFG RPIANAVRNM ALPGGTDSVN LPKVATGTST AAQTADGAPV TSTDMTDTSV SASVYTVAGQ QDASMQLLDQ SPAPGFDEII FADLLADLAV RQDVYVINGS GTAGQPTGIL NVSSPNAITY TDASPTLPEM YVPWVQSVSQ IFTNRKRPAT ATFALPKIWF WATAGLDTTN RPLIQPSQEA PFNPMALQTG EIAEGPVGKL TVGTPVILDG NIPENLGAGT DETRIITLRT SDLYLWEGAI QTRVLTEVLS GTLQVRFQVY RYAAFMATRL PKAISIVSGT GMIPTSGY
|
| |