Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4899 |
Symbol | |
ID | 5707415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5565476 |
End bp | 5566651 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641274294 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001539639 |
Protein GI | 159040386 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000537243 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGACGCC TGGCCCAGGC GCTGACCGTA CTCGCGGTCC TCGCCACCGC CACCCTGGTC GCCGCAGCAC CGGCACAGGC CGTGACGATC TGCGAGCAGT ACGGCTCCAC CACCGTCCAG AACACGTACA TCGTCCAGAA CAACCGCTGG GGCAGCGCCG CCCAGCAGTG CATCGACACG ACCAACAGCG GCTTCCGGAT CACCTCGCAG CAGGGATCCA CCTCGCCCTC CGGGCCGCCG CTGTCCTACC CGTCGATGGT GCTCGGATGT CACTACCTGA ACTGCTCACC CGGGACCAAC CTGCCGAAGA AAGTCGGCCA GATCAGCAGC GTCCCATCCT CGATCAGCTA TTCGTACGCC GGCGGAACCT ACAACGCCGC GTACGACATC TGGCTGGACC CTGCTCCGAA GACCGACGGA GTGAACCGGA TGGAGATCAT GATCTGGTTC CACCGGCAGG GGCCGATCCA GCCGATCGGC AGTCCGGTCG GCAACACCTC GGTGGGCGGC CGTAGCTGGC AGGTCTGGCA GGGCAACAAC GGTGGCAACG ACGTGGTCTC CTACCTGGCA CCCGGGGCCA TCGGAAGCTG GTCGTTCGAC GTCAAGGACT TCATCAACGA CGTCGTAGCG CGCACCCAGG TCACCAACGA CTGGTACCTG ACCAGCCTCC AGGCGGGTTT CGAACCGTGG AGCGGCGGTG TCGGGCTGAG CGTCGACAGT TTCTCCGCCA CGGTGACCGT CGGGACGAAC CCACCGCCCC CACCGGGCAC CAGCGGCACG ATCGTCGGTC AGGGCAGCGG CCGCTGTCTG GACCTTTTGG ACCTCGGTAC CGCCGACGGT ACCCCGATCC AGCTGTGGGA CTGCACCGCC AACTGGAACC AGCTCTGGAC CCGCACCGGC AACACCTTCG TCAACCCACA GACCAGCAAG TGCCTCGATG TCGCCGGCGG TTCCACCGCC AACGGTGCCC AGGTGCAGCT GTATACCTGC AACGGCACCG GGGCCCAGAA CTGGCAGGTC AACGGCGATG GCACCATCAC CAACCCGCAG TCGGGCAAGT GCCTCGACGC GATGGAGAGG GGAACCGCCA ACGGCACCCG GATCCAGATC TGGGACTGCT ACGGCGGCGG CACCCAGGCC AACCAGGTCT GGACGGTCAA CGGCCGCACC CGTTGA
|
Protein sequence | MRRLAQALTV LAVLATATLV AAAPAQAVTI CEQYGSTTVQ NTYIVQNNRW GSAAQQCIDT TNSGFRITSQ QGSTSPSGPP LSYPSMVLGC HYLNCSPGTN LPKKVGQISS VPSSISYSYA GGTYNAAYDI WLDPAPKTDG VNRMEIMIWF HRQGPIQPIG SPVGNTSVGG RSWQVWQGNN GGNDVVSYLA PGAIGSWSFD VKDFINDVVA RTQVTNDWYL TSLQAGFEPW SGGVGLSVDS FSATVTVGTN PPPPPGTSGT IVGQGSGRCL DLLDLGTADG TPIQLWDCTA NWNQLWTRTG NTFVNPQTSK CLDVAGGSTA NGAQVQLYTC NGTGAQNWQV NGDGTITNPQ SGKCLDAMER GTANGTRIQI WDCYGGGTQA NQVWTVNGRT R
|
| |