Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan7425_2021 |
Symbol | |
ID | 7287946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7425 |
Kingdom | Bacteria |
Replicon accession | NC_011884 |
Strand | + |
Start bp | 1916394 |
End bp | 1918316 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643585016 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002482745 |
Protein GI | 220907434 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCCT TTGGTAGATG CCTTGAGGAT GGAGCCTCCT GTGCATTAGA CGATCCTGCT ACTTTATTCC AACAGCTGGA ACCGGCTTAC CCCCGTCCGC AACTGCAACG ACGGAACTGG CGATCGTTGA ATGGATTGTG GAAGTTTGCC TACGACGATC AGGGTCAATG GCACCACCCG GATCAGGTGA CAAAATGGCC CACCGCGATC GCGGTTCCCT TTGCGCCCGA ATCCCTCCGC AGTGGCATTG CTGACACCGA TTTTCATCCC AACTGTTGGT ACGAGCGGGA CTTTGATCTA GAGGTTGAGG ATCAGGAACG CATCCTCCTC CACTTTGGGG CCGTGGACTA CCGGGCCAGG GTGTGGGTGA ATGGCCAGTT TGTCGCTGAG CATGAGGGGG GCCATACTCC TTTCAGTGTC GATATTACCG GCGTACTTAA GCCTGGGGGG CGGCAACGGG TGACGGTCTG GGCAGAGGAC GATCCCCACG ATCTGGCTAA ACCCCGAGGC AAACAGGACT GGCAACGACA TCCCCATAGC ATCTGGTATC CCCGTACCAG TGGGATCTGG CAAACCGTCT GGATTGAACG GGTTCCTCCC ACCTATATCC AACGCCTGCG CTGGACCCCC CACTTTGAAC GCTGGGAAAT TGGCTTTGAA GCTTTTATTG CCGGACAGCA GTATCCGGGA GTGCAGATCA AGGTACGGTT GAAAGCCGGT CATCAACTAC TCGTGAATGA CACCTACGAA GTGATCAATG GCGAAGTTCA CCGCAGGATT GCCCTCTCCG ATCCAGGGAT TGATGACTAC CGCAATGAAC TGCTCTGGAG CCCAGAGAAA CCCACGTTAA TCTATGCCGA GATTGAACTC TGGGGGCAGG ATCAGCTTCT CGATCAGGTG ACCTCCTACA CGGCGATGCG AACCGTGGGG ATTCAGCGCG ATCGGTTTAT GCTGAACGGC CATCCCTACT ATCTGCGTCT GGTCTTAGAT CAGGGCTATT GGCCAGACAC CTTGATGACC GCGCCCTCGG AGGATGCCCT GCGCCAGGAG GTGGAGTTAA TCAAGCAGAT GGGTTTTAAT GGCGTTCGGA AGCACCAGAA AATTGAAGAT CCCCGTTTTC TTTACTGGGC GGATGTGTTG GGACTGCTGG TATGGGAAGA AATGCCTAGT GCCTACCGCT TTACCCCTCA GGCAGTGCAA CGGATTAGCC GGGAATGGAC AGAAGTGATT GAGCGGGATG CCAGTCACCC CTGTGTCGTT GCCTGGGTGC CGTTTAATGA ATCCTGGGGA GTGCCCAACC TGACGGAAAC CCCGGCCCAT CGCCACTATG TTCAGGCCCT TTACTATCTC ACCAAAACCC TTGACCCCAC CCGACCCGTG ATTGGCAATG ATGGCTGGGA AAGTACAACC ACAGATATTA TTGCGATTCA CGACTACGAC AATAATCCCC AAACCCTGGC CAAACGCTAT GGTTCAGAAG TAAAACTGGC TGATCTGCTC CATCAACAAC GGCCGGGGGG TCGCATCCTG ACGCTGGATG GTTATCCCCA TCAAGGCCAG CCGGTGATGT TAACAGAGTT TGGTGGCATT GCCTATACGC CTTCTGAGCA GCGGGATTCC ACCTGGGGCT ATGCCCGCTC TGGAGATGCC TCTGAACTGG AACAACGCTA TAGTGCTCTG CTCAATACGG TCAACCGGAT CGAATTGTTC AGTGGTTTCT GTTACACGCA ATTAACGGAT ACGTTTCAGG AAGCGAATGG ACTACTCTAT GCCGATCGCA CCCCCAAGTT TTCGATCGCC TCGATCGCGG CTGCTACCTG TGGGAGGATA AAAAGCTCAC CTACAGCCGG AACCAATGCA GCGTTCTGGC AGAACAATAG CCTATCCCAT GCTGCTGAGC AATGTGGTGT GAATGGGTGT TAG
|
Protein sequence | MKSFGRCLED GASCALDDPA TLFQQLEPAY PRPQLQRRNW RSLNGLWKFA YDDQGQWHHP DQVTKWPTAI AVPFAPESLR SGIADTDFHP NCWYERDFDL EVEDQERILL HFGAVDYRAR VWVNGQFVAE HEGGHTPFSV DITGVLKPGG RQRVTVWAED DPHDLAKPRG KQDWQRHPHS IWYPRTSGIW QTVWIERVPP TYIQRLRWTP HFERWEIGFE AFIAGQQYPG VQIKVRLKAG HQLLVNDTYE VINGEVHRRI ALSDPGIDDY RNELLWSPEK PTLIYAEIEL WGQDQLLDQV TSYTAMRTVG IQRDRFMLNG HPYYLRLVLD QGYWPDTLMT APSEDALRQE VELIKQMGFN GVRKHQKIED PRFLYWADVL GLLVWEEMPS AYRFTPQAVQ RISREWTEVI ERDASHPCVV AWVPFNESWG VPNLTETPAH RHYVQALYYL TKTLDPTRPV IGNDGWESTT TDIIAIHDYD NNPQTLAKRY GSEVKLADLL HQQRPGGRIL TLDGYPHQGQ PVMLTEFGGI AYTPSEQRDS TWGYARSGDA SELEQRYSAL LNTVNRIELF SGFCYTQLTD TFQEANGLLY ADRTPKFSIA SIAAATCGRI KSSPTAGTNA AFWQNNSLSH AAEQCGVNGC
|
| |