Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3494 |
Symbol | |
ID | 5704765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4029680 |
End bp | 4030924 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641272921 |
Product | extracellular solute-binding protein |
Protein accession | YP_001538287 |
Protein GI | 159039034 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0194042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGGG CGGCCGTCGC CGCGGTCGGG GCGCTCGCGT TGCTGTCGCC CGCCGCTTGC GGTGGTGCCG ACAGCGAGGC CGACCAGAAG GATGTGGAGG TCTTCACCTG GTGGGCCGAC GGGGGTGAGA AGGCCGGACT GGACGGCCTG GTGGCGACCT TCGACGAGCA GTGTGACTAC TCCTTCGTGA ACGGAGCGGT CGCCGGTGGC GCCGGATCGA ACGCCAAGCA GGTGCTGGCC TCCCGGCTGC AACAGGGCGA TGCGCCGGAC ACCTTCCAGG CCCACGCCGG TGCCGAACTG TCGGACTACA TCGCGGCCGG CCAGGTGGAG GATCTCAGCG CCCTGTACGA GGAATGGGGC CTCACCGAGG CCCTTCCGTC AGGTCTGATC GACAACCTGA GCGTGGACGG CAAGATCTAC TCGGTGCCGG CGAACATCCA CCGGGCCAAC GTCCTCTGGA CGAACAAGTC GGTGCTCAGC GATGCCGGCG TCACGGCCGA ACCGACCACG ATGGCGGACT TCCTTGCCGC CCTCGAGACC CTGAAGGCCA AGGGCGTCAG CGCGCCGCTC GCGATCGGCA AGGACTGGTC CCAGCTGATG TTGCTGGAGG CGGTCCTGAT CAGTGACCTG GGCCCAGAGG GTTTCAACGG CCTGTGGAAC GGCGGAACCG ACTGGAACAG CCCCGATGTC GCCAAGGGGC TGGAGAACTA CAAGCAGCTG CTCAGCTACA CCAACGCGGA TCGGGACACC TACGACTGGA CCGACGCCGG GAAGCTCCTC ATGGAGGGCA AGGCCGGCTT CTTCCTGATG GGGGACTGGG CTCCGAGTGA CTTCGAGGCC AAGGGGTTCG CTGACTTCGG CCATGTCGCC TTTCCGGGTA ACGGGGACAC CTTCCAGTGG CTCGCCGACT CCTTCGTGTT GCCCCAGGGC GCCAAGAACC CCGAGGGCAC CAAGTGCTGG CTGAAGACCG TCGGCAGCGC CGAGGGACAG CAGGCGTTCA ACATCAAGAA GGGCTCCATC CCCGCGCGTA CCGACGTCAC CGCGACCGAC TACCCCGCCT ACCAGCAGTC GGCCATCGAG GCGTGGAAGA CTGCCACGCA GGTGCCGTCC TGCGCACACG GTGCCGCCTG CTCGCAGGGC GCCGTTGAGG CGGCGAATTC CGCGATCGGC AAGTTCTCCA GCGACCAGGA CGCGGCAGGA CTGCAGAAGG CGATGGCCGC CGCCGCTGCG CTCGGCAGGA ACTAG
|
Protein sequence | MRRAAVAAVG ALALLSPAAC GGADSEADQK DVEVFTWWAD GGEKAGLDGL VATFDEQCDY SFVNGAVAGG AGSNAKQVLA SRLQQGDAPD TFQAHAGAEL SDYIAAGQVE DLSALYEEWG LTEALPSGLI DNLSVDGKIY SVPANIHRAN VLWTNKSVLS DAGVTAEPTT MADFLAALET LKAKGVSAPL AIGKDWSQLM LLEAVLISDL GPEGFNGLWN GGTDWNSPDV AKGLENYKQL LSYTNADRDT YDWTDAGKLL MEGKAGFFLM GDWAPSDFEA KGFADFGHVA FPGNGDTFQW LADSFVLPQG AKNPEGTKCW LKTVGSAEGQ QAFNIKKGSI PARTDVTATD YPAYQQSAIE AWKTATQVPS CAHGAACSQG AVEAANSAIG KFSSDQDAAG LQKAMAAAAA LGRN
|
| |