Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3334 |
Symbol | |
ID | 5708289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3845329 |
End bp | 3846903 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641272761 |
Product | hypothetical protein |
Protein accession | YP_001538128 |
Protein GI | 159038875 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.176126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000765121 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGGTC AGCGCGGTGA GCAGCGGTGC CGCCGTTGCA GAACACGGCT GGCGCGCGAC AACAGAACCG GACACTGCGC GCCGTGCCAG CTAGCCGGCC GCGACCGGTT CGCACATCCA CTCAGCGTGC CAGCCGAGTT CTGGGATCAT CCGGCGATTA GGGAGGCGAT AGCCGCACGT CACATGGGAC GCCTCATAAG AGCGTACCGC TGCCATCCAA TTCACGGACT ACATCCGCTT CCACAAGCGG TCGTGGCGGG CTGGCTCGGC GTCACCCAAG CGCAACTAAG CCGAATCGAA AAACACTCAC CGGTCGTCCA CCTCGACCGC TTGATACTCT GCGCGAGAGC ACTTCGAATC CCTGCGGACA GGCTATGGTT CGCTCTTCCC GAAAGCAACG GCGCATTCGG TGAAAAGGCA CCGGCGGCGA ACGGTATAAT ATCGGATGGC GAATTCAGCC CGCCCCTACT TTGGGCGTCG ACAAATACGG CTGAAATCGT TAGCCAATTT ACGAGAAGGG ATCTTACCGT GGACCGACGC GAGGCAGCGA AAAATGTTGT CGGCGTCGTG TTCGGCGCGG CACTTCTTGA ACCTATGGAG CGCTGGCTCG GTGATCCCGC ATCTGATCAT GGCGACGGTC GACCGAGTGG TGTGGGATAT CAAGAGGTTG GCCAGATTGA ACTTGTGGCA CGAATGTTCC GGGAATGGGA CGATCAGTTC GGGGGCGGAT TGCGGCGGAA AGCGGTTATC GGCCAGCTGA ACGAAGTTTC CGAACTTCTC CGGGACTCCC ATCCAGCCGA AATCCGTCGC CGACTGTTCG GCACGGTAGC CCACCTCGCC GAAACTGCGG CCGTCATGTC CTGGGATTCT GGACAGCAGG CACTCGCACA ACGGTACTAC ATCCTTGCCC TGCATGCAGC GAAACCGGCC GGCGATTTCG CTTTCGCGGC GAACATTATG GCCGGCATGG CTCGACAGCT TCTCTATCTC GGCCAGACAG GCGACGCCCT TGAGCTGATA AGAGTCGCTC AGGACAGCGC CAAAGATGCG ACGTCAACCG TTCGGTCCAT GCTCTACACA CGCGAGGCAT GGGCCTACTC AAAGCAAGGG CGCATCTCCG CCTTTCGACG TGCGACCGAT AATGCCCAAG AAATGTTCGC TGCCGCTACG CCGGATGAAG ACCCGTACTG GATCACTTAC TTCGATGCGG CTGAGTTGGC CGGCACAACC GGCGGCCGGT TCCTTGATTT GGCTCATACC AACCGAGAGA TGGCGGACGA GGCTGCAGCC GAAATTGAGA GCGCGATCGA CTTGCGCCGT CCGGGGCGTC TCCGAAGTTC CGCGTTGGAC CATATCGGAC TTGCGGAAGC GCGATTGATT CAGGGCGAAT TGGACGAAGC GGTAAGGCTA GGGCACAGTG CCGCCGATGT TGTCGAGCAG ACTTGTTCTG ACCGGGCTCG CGTAAAATTC GCCGAATTCC ACCAACACGT AGCCACCTTC GCCGAAGTGG CGGCTGTCGC GGAACTGCGA GAGCGAATCG GCACCCTGCT GGCCAAGCCT CCGACGACAC TATGA
|
Protein sequence | MIGQRGEQRC RRCRTRLARD NRTGHCAPCQ LAGRDRFAHP LSVPAEFWDH PAIREAIAAR HMGRLIRAYR CHPIHGLHPL PQAVVAGWLG VTQAQLSRIE KHSPVVHLDR LILCARALRI PADRLWFALP ESNGAFGEKA PAANGIISDG EFSPPLLWAS TNTAEIVSQF TRRDLTVDRR EAAKNVVGVV FGAALLEPME RWLGDPASDH GDGRPSGVGY QEVGQIELVA RMFREWDDQF GGGLRRKAVI GQLNEVSELL RDSHPAEIRR RLFGTVAHLA ETAAVMSWDS GQQALAQRYY ILALHAAKPA GDFAFAANIM AGMARQLLYL GQTGDALELI RVAQDSAKDA TSTVRSMLYT REAWAYSKQG RISAFRRATD NAQEMFAAAT PDEDPYWITY FDAAELAGTT GGRFLDLAHT NREMADEAAA EIESAIDLRR PGRLRSSALD HIGLAEARLI QGELDEAVRL GHSAADVVEQ TCSDRARVKF AEFHQHVATF AEVAAVAELR ERIGTLLAKP PTTL
|
| |