Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4050 |
Symbol | |
ID | 5706313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4606710 |
End bp | 4608149 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273476 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001538831 |
Protein GI | 159039578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2273] Beta-glucanase/Beta-glucan synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.567483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0446183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACACGC CACCCGGGGC GCTTGCCCCC CTACCCGCGA GACGCCGGAA GCGACTGGCG CTGGCGCTGT TCACTACCCT CACCACCACC GCGCTGGCCG GACTCGCGGT CACCGCGAAT GCCGCCGTAC CCGCCCCTGC GCCCGGCTGG AGTCTGGTCT GGAGTGACGA CTTCGACGGC GCGGCCGGTA CTCTGCCCTC CTCGGCCAAC TGGATCATCG ACGTCGGCAC CAGCTACCCT GGCGGGCCGC CCAACTGGGG TACCGGTGAG ATCCAGACGT ACACCGACAG CACCGCCAAC ATCAGTCACG ACGGTGCCGG GAACCTGCGA ATCACCCCAC TGCGAGACTC GTCGGGCGGG TGGACCTCGG CCCGGATCGA GACCGTGCGT ACCGACTTCA AACCGCCGTC CGGTGGCGTC CTCGCGATCG AGGGACGGCT CCAGGTGCCG AACGTGACCG GCGCCCAGGC GGCCGGCTAC TGGCCGGCGT TCTGGGCGCT CGGGTCACCG TACCGAGGCA ACTACCAGAA CTGGCCCAGC ATCGGCGAGT TCGATGTGAT GGAGAACGTC AACGGGATCA ACTCCGTGTG GGGTGTGCTG CACTGCGGCT ACGCGCCGGG CGGGCCGTGC GACGAGTTCA ATGGCATCGG TGCCTCCCGG ACCTGCCCGG GGGCGACCTG CCAGTCGGCG TTTCACACCT ACCGGTTCGA GTGGGACGCC TCGGTCAGTC CACAGGTGCT GCGCTGGTAC GTCGACGGCG AGCTCTACCA CACGGTGACC GAGACCCGGG TCGGTGAGCC GGCCTGGTCG CAGATGACCG GCCACGCCGG CTACTTCCTG CTGCTCAATG TGGCGATGGG AGGCGCGTTC CCGAACGGTG TCGCCGGGGG AACCACCCCG ACCGCCGCGA CGGTGCCGGG TCGACCAATG GTCGTCGACT ACGTCGCCGT CTACAGCCGT GGTGGGGGCA CCGCGCCGCC GACCACCGCA CCGCCGACCA CCGCGCCACC GACCACTGCG CCGCCCGGCG GGGTGCGGGA TGCCTACGGG AGGATCGAGG CCGAGTCGTT CAACGGTCAG AGTGGGGTCA GGGCGGAGGA CTGCTCCGAG GGCGGACAGA ACATCGGGTA CCTGCGTGAC GGTGACTGGG CCCGGTACGA CAACGTCGAG TTCGGAACAA CGCCACCACG GGACTTCGTC GCTCGGGCCG CCTCCGGCGC CGGAGACGGG GTGAGCGGCT TGGTCGAGGT ACGACTGGGA AGTCCGACCA GCCCGCCGAT CGGTAGCTTC GCGATCGGTG ACACCGGCGG CTGGCAGAGC TGGCGTTCGG TGCCCGGTAA CGTCGCCGGA CCCACCGGCC GCCACACGGT CTACCTGACC TTCACCAGCG GCCAGCCGAA CGACTTCGTC AACATCAACT GGTTCTCCTT CCGCCGCTGA
|
Protein sequence | MHTPPGALAP LPARRRKRLA LALFTTLTTT ALAGLAVTAN AAVPAPAPGW SLVWSDDFDG AAGTLPSSAN WIIDVGTSYP GGPPNWGTGE IQTYTDSTAN ISHDGAGNLR ITPLRDSSGG WTSARIETVR TDFKPPSGGV LAIEGRLQVP NVTGAQAAGY WPAFWALGSP YRGNYQNWPS IGEFDVMENV NGINSVWGVL HCGYAPGGPC DEFNGIGASR TCPGATCQSA FHTYRFEWDA SVSPQVLRWY VDGELYHTVT ETRVGEPAWS QMTGHAGYFL LLNVAMGGAF PNGVAGGTTP TAATVPGRPM VVDYVAVYSR GGGTAPPTTA PPTTAPPTTA PPGGVRDAYG RIEAESFNGQ SGVRAEDCSE GGQNIGYLRD GDWARYDNVE FGTTPPRDFV ARAASGAGDG VSGLVEVRLG SPTSPPIGSF AIGDTGGWQS WRSVPGNVAG PTGRHTVYLT FTSGQPNDFV NINWFSFRR
|
| |