Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4086 |
Symbol | |
ID | 5704739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4644959 |
End bp | 4646443 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641273512 |
Product | hypothetical protein |
Protein accession | YP_001538867 |
Protein GI | 159039614 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00423288 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0380271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCGA TGTCCATGCC ACGGCAGCTG CCGGTGTGGC TGCCGCCAGC CCTGCTGGCG CTCGCGGTAA CCCTCACCGG AGTGACCGGG GCACAGCTCT GGCGGGACGA GTTGGCGACC TGGAGTGCCG CCACCCGCCC GGTCGGCGAC CTGGTCCGGC TCGCCGGCAC GATCGACGCC GCGACCGGAC CGTACTACCT GTTCATGCAC GCCTGGGTGA CCGGAGTGGG CGATTCGGTG GTCGCCCTGC GGCTACCGGC CGTGCTCGCC ATGACCGCGA CCGCGGCGCT GACCGCGGTG CTCGGCGCAC GGCTGTACAG CCGGTCGGCC GGGCTGCTGG CCGGGCTGCT CTTCGCCCTG CTTCCCAGCA CTTCCCGGTT TGGGCAGGAG GCCCGCCCGT ACGCGCTGGC CACCGCCCTC GCTGTCCTGT CGACGCTGCT GCTGGTCACC GCGCTGGACC CGCCGCCCGG CAGCCGGCCG GCACGACGTT GGGCCCGCTG GGCCGGTTAC GCCACCGCGC TGGCCGCGCT GGGCCTGACC CACCTGGTCG CCCTCACCCT GCTTCCGGCG CACGCGGTGG TGGTCCTCGC CACCGCTCGG GGGCAAGCTG GGCGACAGCA GGCCCTGGGC GGGAGCGGCG CGCACCGGAG CATCGTCCGA CCCTGGCTGC TCTCGCTCGT TCCGGTGGTG CTGCTGGTCG GCCCGCTGGT AGTGGTCGCC CACGGTCAGC GCGCCCGCCA GCTCGACTGG GTTGACGCCG CCCGTCCCGC CGACCTCGCC GCCCTGGCCG GCGGGGTGAC GCAGAGCGGG GTTGTCGGTG GCCTGTTGGT CGGGCTCGCC GCGCTCGCTG TCGCGGCGCT GGGACGGGCG GCGCTGCTGC CCGGGACGGC CGTGCTCCTA CCGGTGCTGC TGGTCTTCAC CGTCGGCGCG CTGGTTCCCC TCTGGGTACC CCGATACCTG GTCTTCGTCG TGCCGTTCGG GTGTCTGCTG GCCGGTGTCG CGCTGGCCGG GGTGCCGTTC CTTCCAGCGC TGACCATCGT GGCCCTGGCT GGGGCGCTCG GCCTACCGGC CCAGGCCGCG TTGCGGCGTA CCCACGAGTG GCCCCGCTCG GCACTGGTTG ACTACGCCGG GGCGGCCCGG ATCGTGGCGG ACGGGCAGCG ACCCGTCGAC GCGATCGTCT ACTCGCCCCG AGACAGCTGG CTCTTTCTCG ACCTGGGGAT GGCGTACCAC CTGGACGACC ACCGGCCCCG GGACGTCCTG CTCACCGCCA GCCCGGCACG CCGGGGCGAC CTGTGGGCCA CCGAATGTGC CCGCCCGGCG CAGTGCCTGG CCGGCGTGGA CCGAGTCTGG CTGGTGATGG CCGGCAGGCA CGGCGACCCG CTCGCCGCCG TATCCGGCGC GAAGGGGGAC GCGCTACGGG CCGGGCGCAC GGTCGAGCAG GTCTGGCACC CGCCCGGACT GACTGTCGCC CTGATCCGCC GGTAG
|
Protein sequence | MGAMSMPRQL PVWLPPALLA LAVTLTGVTG AQLWRDELAT WSAATRPVGD LVRLAGTIDA ATGPYYLFMH AWVTGVGDSV VALRLPAVLA MTATAALTAV LGARLYSRSA GLLAGLLFAL LPSTSRFGQE ARPYALATAL AVLSTLLLVT ALDPPPGSRP ARRWARWAGY ATALAALGLT HLVALTLLPA HAVVVLATAR GQAGRQQALG GSGAHRSIVR PWLLSLVPVV LLVGPLVVVA HGQRARQLDW VDAARPADLA ALAGGVTQSG VVGGLLVGLA ALAVAALGRA ALLPGTAVLL PVLLVFTVGA LVPLWVPRYL VFVVPFGCLL AGVALAGVPF LPALTIVALA GALGLPAQAA LRRTHEWPRS ALVDYAGAAR IVADGQRPVD AIVYSPRDSW LFLDLGMAYH LDDHRPRDVL LTASPARRGD LWATECARPA QCLAGVDRVW LVMAGRHGDP LAAVSGAKGD ALRAGRTVEQ VWHPPGLTVA LIRR
|
| |