Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1067 |
Symbol | |
ID | 5705680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1194478 |
End bp | 1195899 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641270583 |
Product | hypothetical protein |
Protein accession | YP_001535967 |
Protein GI | 159036714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.119168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00309173 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCGAAA GCGTTCGACG CACCGTAGTC ACGGCACCAC GGCGCCACCC CCTCGCTGTA CTACGCGCCA AGGCCGGCCA CACCCATGGC GAGTACGCCC GCCTGATCGC CGAGACGCAC GCCACGCTGG GGTTCGGCCA CATGGCGGCA CGTCGAGAAA AGGTCTCCCG TTGGGAAGCC GGCCGAGCCA TCCCGGAACG GACCGCACAG CTCGCCATCG CCCACATCCA TGGCGTCGCC CAGGAGCACG TCGACAATCG GGCCTGGCCC GAATGGCTCC ACCTGGCATA CGGCGACGCA CGTCAGCTGG AGCTCCCCTG GACACCCGCG GCTGCCCCAG AGGCCATCCT CGACGCCGTA GCGGAGCGAC AACAGGCGCA ACAGGGATAC CTGCTGGCAA CCGGACCGGC AGCCAAGTCA CTCGCGGAGA ACTGGCAGGA CGCCATGACC GAGGCCCTGA CGAAGGTGCC GCAGCACCTG CCACGACCCG GCCGGGTGCC CCTCATGTGG CAACGCGGAA CGGATGCCGA GCTGGGATCG GTACTCCAGG CGTGCACCCG ACTGCGAACG CTACTCACGT TCGCGGGCTG GTTCACCGCC GGATGGCTGG TGCCCGCCAG CGAGCAGGAA CTGCGGCATG TGGCCAACCA CTTCGCCACC ACGACAGATG TGATCGCGGA GAAGAGCCGT GGGTTGCTGA CGCTCGCCGC AGAGGGACTG TCCCTGTGCG GTTTCATCGC CCGCCTCGAG GGGGAACATG TCAGCGCCCA GCGGTACTAC GTGGCGGGTC TGCGCTGCGC CACGGCCGCC GGCGCGGCGG AGCTCGCCGC GGCGATCATG ACGATCCACG CTGCCCAGTA CCTGGACCTC GGGCTCCACG AGGAGGCCAC CGAGCTCCTG ACGTCCGCAC AGACGTTCCT GCGTCGATCC CGGATACCGG TGCGGGATCC GGCGCTGCCC ACGTTGATGC ACGCGCAGAT CGCCCGGGTG CACGCCCAAC TCGGTGACGA CCTTGGTCGG CGCCGATCAC TCTCGGCGGG GCGCAACGCG TTGGAAAGCG TGCCGTACGG GGATCCCATG GCGATCCTGC CGGCTCGCGG CAGCTGCTGG CTGCAGCTGA TGGACGGAGT GTCTCTCCTG GAACTGGGTC GGCCGGACCA GGCGGTCAAG GCCTTCGATC CGCTGTTCTC CAAACACGTG CCGGAGCTGA ACCTGCCGCC GTCGGTGCGC TCGCTGTACC TGCTGCGAGC CGCGGAGGCC CAGGCGGCGG TCGGGGACAC GGTGGGGAGC GTGGAGTCAG TGGCCCAGGC CACGACGCTA CTCGGCGGTG TTCGGGTGGC GGTCTCCAAA CACGTGCGAC TCGCGCTGCG CGCCTACCAG CACCTGCCAG AGGTGAAGGC GCTGCTGTCG GCGTCCGACT GA
|
Protein sequence | MRESVRRTVV TAPRRHPLAV LRAKAGHTHG EYARLIAETH ATLGFGHMAA RREKVSRWEA GRAIPERTAQ LAIAHIHGVA QEHVDNRAWP EWLHLAYGDA RQLELPWTPA AAPEAILDAV AERQQAQQGY LLATGPAAKS LAENWQDAMT EALTKVPQHL PRPGRVPLMW QRGTDAELGS VLQACTRLRT LLTFAGWFTA GWLVPASEQE LRHVANHFAT TTDVIAEKSR GLLTLAAEGL SLCGFIARLE GEHVSAQRYY VAGLRCATAA GAAELAAAIM TIHAAQYLDL GLHEEATELL TSAQTFLRRS RIPVRDPALP TLMHAQIARV HAQLGDDLGR RRSLSAGRNA LESVPYGDPM AILPARGSCW LQLMDGVSLL ELGRPDQAVK AFDPLFSKHV PELNLPPSVR SLYLLRAAEA QAAVGDTVGS VESVAQATTL LGGVRVAVSK HVRLALRAYQ HLPEVKALLS ASD
|
| |