Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2410 |
Symbol | |
ID | 5703694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2775135 |
End bp | 2776514 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641271887 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001537258 |
Protein GI | 159038005 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.840511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.905848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGG GCGTGAGAAT TGCGGCCCTG GTCGGCGTCG TCGCGGTGAC AGCTGCGGGG GCCATGGTGG CCACCGCGAC CGCGACCGCA GCGGCCTCGC CGACCGCGAC GTTCGTCAAG GTTTCCGACT GGGGCACCGG CTGGGAGGGT CGGTACACCA TCACCAACGG GGGAAGCAGC ACCCTGAACT CCTGGCAGGT CGAGTTTGAC CTACCGACGG GCACCACCGT CGGTTCGTAC TGGAACGCGC TGATGAACCA CGATGGGCAG CACTACCGCT TCACCAACCA GCACTGGAAC GGCACGATCA CGCCGGGCGC CTCGGTGACG TTCGGCTTCC TCGGTGCCGG CTCCGGCAGT CCGAGCGGTT GCCGGCTCAA CGGACAGCCG TGCACACCAA CAGCTCCTCC GACGACGAGT CCGCCACCGA CCACCGCGCC CCCCAGCACC ACGCCGGTTG CAGCGAACGG GCAGTTGCGG GTCTGCGGAC AGCGCCTCTG TAACGAGGAC GGCAAGCAGA TCCAACTCCG GGGCATGAGC ACGCACGGAC TCCAGTGGTA CGCCAACTGC GCGAACACGG CCTCGCTCGA CGTACTCGCC CAGGAGTGGG GTGCGGACGT TCTGCGAATC TCGATGTACA TCCAGGAAGG CGGCTACGAA ACTGACCCGC GCCGATTCAC CGACCTGGTC CACGACTACA TCGAACTGGC CACCGCCCGC GGCCTCTACG CGGTCGTCGA CTGGCACATG CTCACCCCTG GAGACCCGAA CTACAACCTC TCGCGAGCGC GAACCTTCTT CGCGGAGATC GCCGACCGCC ACCGGGACAA GGTCAACGTC CTGTACGAGA TCGCGAACGA ACCGAACGGT GTCAGTTGGG GAGCCATCAA GAGCTACGCC GACCAGGTAA TCCCGGTCAT CCGGGAACGG GATCCGGAAG CCGTGGTGCT TGTCGGCACA CCCGACTGGT CGTCGCTCGG TGTGTCTGGC AGTGGCGGCG GCGTCGATGC CATCCTCGCC GATCCGGTGG CAGCGAGCAA TCTCATGTAC GTCTTCCACT TCTACGCGGC ATCACACGGC GACCCGTACT ACAACACCTT GGCCGACGCG GCCGACCGGC TTCCGATCTT TGTGACCGAG TTCGGAACCC AGCAGTACAC CGGCGACGGC CCGAACAACT TCACCATGTC CCAGCGCTAC CTCGACCTCA TGGCGAACAA GAAGATCAGT TGGGTCAACT GGAACTACTC CGACGACTTC CGCTCTGGCG CGGTCTTCAC GACTGGCACG TGCGCCGCCG GCGAGTTCAG CGGTACGGGC CCCCTCAAAC CGGCCGGCGG TTGGATACGC GAACGCATGC GTACCGCGGA CGACTTCTGA
|
Protein sequence | MKLGVRIAAL VGVVAVTAAG AMVATATATA AASPTATFVK VSDWGTGWEG RYTITNGGSS TLNSWQVEFD LPTGTTVGSY WNALMNHDGQ HYRFTNQHWN GTITPGASVT FGFLGAGSGS PSGCRLNGQP CTPTAPPTTS PPPTTAPPST TPVAANGQLR VCGQRLCNED GKQIQLRGMS THGLQWYANC ANTASLDVLA QEWGADVLRI SMYIQEGGYE TDPRRFTDLV HDYIELATAR GLYAVVDWHM LTPGDPNYNL SRARTFFAEI ADRHRDKVNV LYEIANEPNG VSWGAIKSYA DQVIPVIRER DPEAVVLVGT PDWSSLGVSG SGGGVDAILA DPVAASNLMY VFHFYAASHG DPYYNTLADA ADRLPIFVTE FGTQQYTGDG PNNFTMSQRY LDLMANKKIS WVNWNYSDDF RSGAVFTTGT CAAGEFSGTG PLKPAGGWIR ERMRTADDF
|
| |