Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3103 |
Symbol | |
ID | 5706577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3525823 |
End bp | 3527433 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641272537 |
Product | hypothetical protein |
Protein accession | YP_001537905 |
Protein GI | 159038652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.186273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0036434 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGATATA AATCCAACGG CTTGGGTCGG AAGGCCCTGT CCATATTTCT CGCTTTTACT GTAGCGTCCC TAACGTTCAT CATCGCTGAT ATCCGGATGT CGCTGCCGGT TCAAGCGTCG TCCACGGTCG GCGGAAGCAT TTCCCGTACC GAAGTGCTGA CCCGGGCGGA GTGGTGGGTC AATACGTACG GCGTCATCTA TAGCCAAAAT CAGAATGACC AGAAGCCCGA CCCGGATGGA CACCCCTACC GACCGGACTG CTCGGGATTC ATCTCGATGG CCTGGCACCT GCCAAAGAAG AGTGACGGCT GGGATCGCAA TACTGGTGAC CTTGATGCCT TCGGTGACAC CACCTACCTC AGCAACCTTG GGGAACTCCT TCCAGGCGAT GCGATCCTTG GCAAGAGTTA CGGGCACGTG GCGCTCTTTG ACCGGTGGGC CAACCCGTCC CGTACTGAAA TGTGGATCTA TGATGAATAC AAATCTGGAC GAGAGGGGAG GCACATCATC CAGTCGAGGA GTTGGTACGA GAGCGAGGGC TTTCGTGGAC TGCGATACAA CAAAATCACC TCGATGATGC CGGATGCGCC GGATGCGGTG TCGCGGGATG GGGTGGTGGT GTCGTCGTCG GGGCGGATTT CGGTGTATGC GGTGCGTGCT GATGGTGATG TGTGGGGTCG TAGTCAGGAA TCGCCGGGTG GTTCGTTCAA TGCGTGGCAG CGTTTGTCGA CCGGTGGTGG TTTTGCTGGT CAGGTAGCGG TGTTGCGGGA TGATCGTGAC CGGGTGGCGT TGTATGCGCG GCGGAGTGGG ACGATATTCG GGGCGAGTCA GCAGGAAGTT GGTGGATCGT TTGGTGTGTG GGGTCCGATC GGTACGAACG GTGCGGGGGT GACGGGGGAT CCGCGGGCGG TGTATGCGTC TGAGGGGCGG ATCGCTATCT ATGCGACGAC GAGTAGTGGG AATGTGTCGG GAGTGACGCA GACGCAGGCT GGTGGTGGGT TCGGTTCATG GCAGCAGTTG ACCAGTGGTG GTGGCTACAT GGGTAAGCCA GCGGCGGTGG TGGATTCTCA GCAACGGGTG GCGTTGTATG TGCGTCGGAA CGGCATGGTC TATGGGGCCA GTCAGTCGCA GGCTAACGGT TCATTTGGGA CGTGGGCTGC CCGGGGTGTT GATGGTGCGG GTGTGGCCAG TGATCCGGTG GCGGTGTATG GGGTCGGGGG TAGGATTGCT ATTTATGTCA CCAGCACTGC GGGGAACGTT GCTGGGGTCA ATCAGGTAGC CGCTGGTGGT GAGTTCGGTG CTTGGCAGGT GTTGACCAGC ACGGGTGGGT ATGAGGGCCG GCCGGCGGTG TTGGTTGACG AGCAGGGTCG GGTAGCGGTC TACGTGCGTC GAAGTGGCGC GATCTACGGC GCTAGTCAGC CCGAGGCCGG TGGTCCGTTC GGTGCCTGGG CTGCTCGTGG CACCGGTAGT CCCCAACTCA TCGGTGATCC CACTGCTGTG TATGGCGTTG GTGACCGAAT CGCCCTGTAT GCCGCCGCTA CCAACGACAG TATCGGCGGT GTTAGCCAGG GCGAAGCCGG CGGCACCTTC GGCAACTGGA TCGTCCTGTA G
|
Protein sequence | MRYKSNGLGR KALSIFLAFT VASLTFIIAD IRMSLPVQAS STVGGSISRT EVLTRAEWWV NTYGVIYSQN QNDQKPDPDG HPYRPDCSGF ISMAWHLPKK SDGWDRNTGD LDAFGDTTYL SNLGELLPGD AILGKSYGHV ALFDRWANPS RTEMWIYDEY KSGREGRHII QSRSWYESEG FRGLRYNKIT SMMPDAPDAV SRDGVVVSSS GRISVYAVRA DGDVWGRSQE SPGGSFNAWQ RLSTGGGFAG QVAVLRDDRD RVALYARRSG TIFGASQQEV GGSFGVWGPI GTNGAGVTGD PRAVYASEGR IAIYATTSSG NVSGVTQTQA GGGFGSWQQL TSGGGYMGKP AAVVDSQQRV ALYVRRNGMV YGASQSQANG SFGTWAARGV DGAGVASDPV AVYGVGGRIA IYVTSTAGNV AGVNQVAAGG EFGAWQVLTS TGGYEGRPAV LVDEQGRVAV YVRRSGAIYG ASQPEAGGPF GAWAARGTGS PQLIGDPTAV YGVGDRIALY AAATNDSIGG VSQGEAGGTF GNWIVL
|
| |