Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3097 |
Symbol | |
ID | 5706571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3518973 |
End bp | 3520616 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641272532 |
Product | hypothetical protein |
Protein accession | YP_001537900 |
Protein GI | 159038647 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0360838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TATTAGCGCT CACCAAAGTT GGCGCCGGAT GCGCGCTGGT CGCAATGGCC GTGGTGACTG GTGTCCAGCC GGCATCGGCG GAGCCCCAGC CCGCCCCTGC CGTATCACAG CTGGTGAACT ATGAGACGGT CGCTTGCGAC CCAACTGGGA CGACCGCCGC AGACGCCGCG CTGGCCAGTC AACTCAATGC CGTACTCACC GCAGATATGC GTGGCTACAT GACAGCGTAC AGAACGTCGT GTGCACGCAT GGTAGTAGCG GCGGTCAAGG CCCGAGGACT CTCACCCCGA GCGGCGGTGA TCGCGGTCAC CACGGTTATT GTGGAAACCC ACCTCCGAAA CATTAGCGAA GAGGTAGACC ATACAAGTCT CGGGTTGTTC CAGCAGCAGG AATGGTGGGG TTCTCGGGCG GAGCGGCTAA ACGCAACCTG GGCTACCAAC AAGTTCATCA GCGTTATGCA TAATAAATAC CCGGACGACT CATGGATGAC CGCCCCGATC GGCGAAGTCT GCCAGGCAGT GCAGGTGTCT GACTTTCCGG ACCGCTACCA GGATCAGGCT GATGACGCAC AAACCATCGT GGACGCTTTA TGGGTGCCTA CGGTCGGTGC GCCGGATGCG GTGTCGCGGG ATGGGGTGGT GGTGTCGTCG TCGGGGCGGA TTTCGGTGTA TGCGGTGCGT GCTGATGGTG ATGTGTGGGG TCGTAGTCAG GAATCGCCGG GTGGTTCGTT CAATGCGTGG CAGCGTTTGT CGACCGGTGG TGGTTTTGCT GGTCAGGTAG CGGTGTTGCG GGATGATCGT GACCGGGTGG CGTTGTATGC GCGGCGGAGT GGGACGATAT TCGGGGCGAG TCAGCAGGAA GTTGGTGGAT CGTTTGGTGT GTGGGGTCCG ATCGGTACGA ACGGTGCGGG GGTGACGGGG GATCCGCGGG CGGTGTATGC GTCTGAGGGG CGGATCGCTA TCTATGCGAC GACGAGTAGT GGGAATGTGT CGGGAGTGAC GCAGACGCAG GCTGGTGGTG GGTTCGGTTC ATGGCAGCAG TTGACCAGTG GTGGTGGCTA CATGGGTAAG CCAGCGGCGG TGGTGGATTC TCAGCAACGG GTGGCGTTGT ATGTGCGTCG GAACGGCATG GTCTATGGGG CCAGTCAGTC GCAGGCTAAC GGTTCATTTG GGACGTGGGC TGCCCGGGGT GTTGATGGTG CGGGTGTGGC CAGTGATCCG GTGGCGGTGT ATGGGGTCGG GGGTAGGATT GCTATTTATG TCACCAGCAC TGCGGGGAAC GTTGCTGGGG TCAATCAGGT AGCCGCTGGT GGTGAGTTCG GTGCTTGGCA GGTGTTGACC AGCACGGGTG GGTATGAGGG CCGGCCGGCG GTGTTGGTTG ACGAGCAGGG TCGGGTAGCG GTCTACGTGC GTCGAAGTGG CGCGATCTAC GGCGCTAGTC AGCCCGAGGC CGGTGGTCCG TTCGGTGCCT GGGCTGCTCG TGGCACCGGT AGTCCCCAAC TCATCGGTGA TCCCACTGCT GTGTATGGCG TTGGTGACCG AATCGCCCTG TATGCCGCCG CTACCAACGA CAGTATCGGC GGTGTTAGCC AGGGCGAAGC CGGCGGCACC TTCGGCAACT GGATCGTCCT TTGA
|
Protein sequence | MKKLLALTKV GAGCALVAMA VVTGVQPASA EPQPAPAVSQ LVNYETVACD PTGTTAADAA LASQLNAVLT ADMRGYMTAY RTSCARMVVA AVKARGLSPR AAVIAVTTVI VETHLRNISE EVDHTSLGLF QQQEWWGSRA ERLNATWATN KFISVMHNKY PDDSWMTAPI GEVCQAVQVS DFPDRYQDQA DDAQTIVDAL WVPTVGAPDA VSRDGVVVSS SGRISVYAVR ADGDVWGRSQ ESPGGSFNAW QRLSTGGGFA GQVAVLRDDR DRVALYARRS GTIFGASQQE VGGSFGVWGP IGTNGAGVTG DPRAVYASEG RIAIYATTSS GNVSGVTQTQ AGGGFGSWQQ LTSGGGYMGK PAAVVDSQQR VALYVRRNGM VYGASQSQAN GSFGTWAARG VDGAGVASDP VAVYGVGGRI AIYVTSTAGN VAGVNQVAAG GEFGAWQVLT STGGYEGRPA VLVDEQGRVA VYVRRSGAIY GASQPEAGGP FGAWAARGTG SPQLIGDPTA VYGVGDRIAL YAAATNDSIG GVSQGEAGGT FGNWIVL
|
| |