Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2637 |
Symbol | |
ID | 5706900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3002567 |
End bp | 3004405 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272097 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001537467 |
Protein GI | 159038214 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.872982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.317086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGT ACCCGGCCAT CGAGGATCAC GGCCTCATCG GTGACCTGCA GACCGCGGCC CTGATCACCC GAAACGGCAC GATCGACTGG TTCTGCGCGC CCCGGTTCGA CTCCCCGAGT ATCTTCGCGG GTCTGCTCGA CCGCCGCCGC GGCGGCTACT TCTCCGTCGT CCCGGACGGC GTGCAGTACA TCAGCAAGCA GCTCTACCTG CCGAACACCC CGATCCTGAT CACCAGGTTC ATCAGCGCCG ACGGGGTCAG CGAAATCATC GACTTCATGC CGGTCACCGG CGAACACCCG ACCGACCAGC ACCGAATCGT CCGGATCATC AACATGGTGC GCGGCACCAT GCGGTTCCGC CTCGAGTGCT GCCCACGGTT CGACTACGGC CGGCAACGGC ACGTGTTGCG GGTGCACCGC GACGGCAGCG TCTTCCACGC CCCGTCAGGC ACCCTGACCA TCCACACCTC CCGCCGGGGC GGGCTGCTGG TTCCGGCGGA CAACCTGCGC GACGTCAACG GGGACCTTCT CGCCTTCGCG ACCCTGCACG AGGGCGACAC CGGCGCGCTG GTGGCGGAGA CGGCCGCTCC CGGCCACCCC CGACTCTTCC CCGTCGAGGA GGCCCAGGAG CTGTTCGAGG CAACGCGCGA ATACTGGCGC CGCTGGCTCG ACCGGTCCCG GTACACCGGC CGGTGGCGGG AGATGGTCGA ACGCTCGGCG ATCACGCTGA AACTGATGAC CTACGCGCCA ACCGGGGCCA TGATCGCCGC GCCGACCGCC GCGCTGCCCG AGGAGATCGG CGGCACCCGC AACTGGGACT ACCGCTACAC CTGGATCCGG GACACCTCGT TCTCGGTGCA CGCCCTGCTC GGCCTGGGCT TCACCGAGGA GGCCGCTCGG TACACGAACT GGCTCGACGA GCGCATCCGC GAAGCCGGCG ACCACGCGAC CCCCTTGAAG ATCATGTACC GGGTGGACGG CTCCTCCGAC CTACACGAGG AGATCCTCGG CCACCTGGAG GGCTACCTGG GGTCGGGGCC GGTCCGGATC GGCAACGGCG CCGCCGACCA GCTCCAACTC GACATCCACG GCGAGGCCCT GGACGCGATG CTCTTCGCCG ACCAGCAGGG CATCCGGGTC ACGCACCAGG TGTGGCGCAG CACCGTGCGG ATGGTCGACT GGCTCTGCGA CAACTGGAAC CAGCCCGACG CCGGCATCTG GGAGAGCCGC ACCCATCCGC GCGACTACAC GTTCGGCCGG CTCATGTCCT GGGTGGCGAT GGACCGGGCG ATAAGACTGT CCGCCCGCCA CGGCCGTCCC GGCGACATCG CGCGATGGGA ACGCGAACGC GACGCCATCT ACAACCAGAT CATGGCACGG GGTTACGACC GGACCCGTGG CTCCTTCGTG CAGGCGTACG GCGAGCGGGT ACTCGACGCC GCGCTGCTGA TCATGCCGTC GGTCGGGTTC GTCACCCCGA ACGACCCGCT GTGGCAGTCC ACCCTGCGTG CGATCGACGA GGATCTCGTC TCGGACAGCC TGGTCCACCG GTACAAGCCC TCGGAGGCAC CGGACGGCCT TCCCGGCGAC GAGGGCACCT TCAACATGTG CACCTTCTGG TATGTCGCCG CCCTCGCCTG CTCGGGACGG TTGGAGGACG CCCTGCTCAC CTTCGAGAAG ATGTTCACCT TCAGCAACCA CCTTCGGCTC TACGCCGAGG AGATCGCCCT CACCGGGCAG CAGATCGGCA ACTTCCCGCA GGCGTTCAGC CACCTCTCCC TGATCACCAC CGCCCTGGCG CTCGACGGCA TGCTCGACAA CAGTGACCGG GGGCGGTGA
|
Protein sequence | MPEYPAIEDH GLIGDLQTAA LITRNGTIDW FCAPRFDSPS IFAGLLDRRR GGYFSVVPDG VQYISKQLYL PNTPILITRF ISADGVSEII DFMPVTGEHP TDQHRIVRII NMVRGTMRFR LECCPRFDYG RQRHVLRVHR DGSVFHAPSG TLTIHTSRRG GLLVPADNLR DVNGDLLAFA TLHEGDTGAL VAETAAPGHP RLFPVEEAQE LFEATREYWR RWLDRSRYTG RWREMVERSA ITLKLMTYAP TGAMIAAPTA ALPEEIGGTR NWDYRYTWIR DTSFSVHALL GLGFTEEAAR YTNWLDERIR EAGDHATPLK IMYRVDGSSD LHEEILGHLE GYLGSGPVRI GNGAADQLQL DIHGEALDAM LFADQQGIRV THQVWRSTVR MVDWLCDNWN QPDAGIWESR THPRDYTFGR LMSWVAMDRA IRLSARHGRP GDIARWERER DAIYNQIMAR GYDRTRGSFV QAYGERVLDA ALLIMPSVGF VTPNDPLWQS TLRAIDEDLV SDSLVHRYKP SEAPDGLPGD EGTFNMCTFW YVAALACSGR LEDALLTFEK MFTFSNHLRL YAEEIALTGQ QIGNFPQAFS HLSLITTALA LDGMLDNSDR GR
|
| |