Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0161 |
Symbol | |
ID | 5706568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 173268 |
End bp | 175280 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641269687 |
Product | hypothetical protein |
Protein accession | YP_001535087 |
Protein GI | 159035834 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000283972 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTCTGG CCCGGCGACC GCCGGCCGCC GAGGGAATCC AGTGGAAGGC GGTGGACGTG TCAACACACC GACGTGCCTG GAAGCAGCGG GCCGGTGTGG TCGCGGCGCT CGTGGTCGGC GCCCTGCTCA CCGTCCCCGC GTCCCCGGCC TCCGCCGCTC CGGGCGTCAA CAATCTGTCG GCGTCGCCGA GCCGGATCGA GGCCGGCGGC ACCATGACGG TCAACTACTC GATCAACATC TCGGACGGCA ATCCCGCCAA CGTGACGGTG GCCTCCAGCA ACGGAGAGCT CACCTGTGTC AGCGGCTGCA GCCACAGCGA TGTCACGAGG AACCAGGGCT TCCAGGCCAG CTTCCGGCTG GCGAGCGACG CGGCCGACGG GCGGGCCACG ATCACGGTTA CGGCGGTCGA CTCCGAGGGG GAGGACACCG CCGACACCAC CGTCACCCTG GTGGGCAAGC CGGAGCCGCA ACCCACCCCG CCGCCCACCC AGGCGCAGAC GGTGAAGTCG GTATCCGGCG AGGTGGTCAA CCAGAGCACC GGCAGCGCGG TGCCGAGCGC CCTGGTGGTC CTCAAGGACT CCAACGGCAA GAAATTCGAC ACCATGACCG ACAACGGGGG CAACTTCCGG TTCACCGGCA GCACCCAGAA CCCGATCGCC CCCGGCCGAA TCGACATCGG CGCGAGCTTC GAGAACGTCT ACGCCACCAA GAGCTTCAGC GCCTCCGCGG GGCAGAGCGT CTCCGGTCAG CGAATCTCCC TCGCGATCAA GAACGAGACG CCGAGTCCTA CCGCGTCGTC GAGTATCGCG CCGGCACCGA CCGTAGGGGC CGAGGAAGAG GTCGAGCAAT CGATCGAAGC GGTCGTGGAC ACCCCGACGG AGGCCGCCGG TAATGAGGAC TCCGGCGGCA TCGGCTCACT GCTGATCATC CTGCTCGGCG GCCTCCTCGT CGCCGCCGGC GTCGGCACCA TCGTGCTGCT GTGGCTACGC CGCAAGGAGA ACGACGACGA AGCCGACGAG GCGGATGGTC CGCCCAGCAC TGCCACCGGC GCGGTCCCGG TGGCCCGCGG CGGCTACGGC GGCCTTGACG ATCACACGCG GATCGTGAAC CCGACCGCGG CTGCTCCGAC GATGGTCGGC GGTGACCCCT CCCTCGCCGA CGCTCCCACG ATGATGCACC GCCCGGTGGT CGACGACGTC CCACCGGATC CCTACGGTGC ACCGCCGAAC CCGTACGGCA CCCCCGCCGG CCAGTCCGGC TGGGCCGGCA GCGGCTACGG TGACCAGCCG GACGCCTACG GCACCGGGGG CTTCGGCGCC ACCGGGCCCG CCGAAGGCGG CTACGGCGGC ACCGCGACGC CCTCATCCGG CGGCGGCTAC GGCAGCACCG CAACGCCCTC ATCCGGCAGC GGCTACGGCG GCGACTACGG CACGCCCACG TCGGCCGGGA GCGGCTATCC GCCGGCCTCC GGCGGGTCGT ACGGTTACGG CGAGCGGTAC AACGAGGCGA CCGGCCACTT CCACCAGGAG GCCGCCACGC AGTATCCGGC CCCGGCCGAC CCGTACCCCA CCGGGCGGTA CCAGCAGGAC GCCGGGTACA GCCACCCGGA TCCCACGTAC GGCCAGGGTG GCGAGCCGGC CGGCGGCTAC GACCAGCGGG GCAGCTATGA CGACCCCGGC TACGGCCAGC ACGGTGGCTA TGGGGGCGGC TACGGTGGCT ACGGCCAGGA GACACCCACC CCGTACGGCG GCTACGGCCA GCAACCGCCA ACCGCGTACG GCGGTTACAG CCAGCAACCC CCCACCCAGT ACGGCGGCTA CGGCCAGGAC CTACCCACCC AGCGCGGTGG CTACGACGGC TACGACCAGG AAGCACCCGG CCAACACGGC GGCTACGACG ACCGCGGGTA CGGCCAGGGC GGCTACGGTG ACCCGAACCA GACCGACCGG CCCCGGTCCG ACGCTCCCCC GCCGGACCGG GGCGGTCGCC GACTGGACTG GCTGGACGAC TGA
|
Protein sequence | MILARRPPAA EGIQWKAVDV STHRRAWKQR AGVVAALVVG ALLTVPASPA SAAPGVNNLS ASPSRIEAGG TMTVNYSINI SDGNPANVTV ASSNGELTCV SGCSHSDVTR NQGFQASFRL ASDAADGRAT ITVTAVDSEG EDTADTTVTL VGKPEPQPTP PPTQAQTVKS VSGEVVNQST GSAVPSALVV LKDSNGKKFD TMTDNGGNFR FTGSTQNPIA PGRIDIGASF ENVYATKSFS ASAGQSVSGQ RISLAIKNET PSPTASSSIA PAPTVGAEEE VEQSIEAVVD TPTEAAGNED SGGIGSLLII LLGGLLVAAG VGTIVLLWLR RKENDDEADE ADGPPSTATG AVPVARGGYG GLDDHTRIVN PTAAAPTMVG GDPSLADAPT MMHRPVVDDV PPDPYGAPPN PYGTPAGQSG WAGSGYGDQP DAYGTGGFGA TGPAEGGYGG TATPSSGGGY GSTATPSSGS GYGGDYGTPT SAGSGYPPAS GGSYGYGERY NEATGHFHQE AATQYPAPAD PYPTGRYQQD AGYSHPDPTY GQGGEPAGGY DQRGSYDDPG YGQHGGYGGG YGGYGQETPT PYGGYGQQPP TAYGGYSQQP PTQYGGYGQD LPTQRGGYDG YDQEAPGQHG GYDDRGYGQG GYGDPNQTDR PRSDAPPPDR GGRRLDWLDD
|
| |