Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08971 |
Symbol | |
ID | 4717603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 771397 |
End bp | 773157 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640078609 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001009288 |
Protein GI | 123968430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.540308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCAAA TTGATTCAGA GAAAAAATTA GATAGATTAA TAATTGATAA ATTGTTAAAA ACAATTTATT CAAATCTTAC TACAGAAGAA ATTAATTTTA TTTCAAACCA ATTATTACAG ATTTTAGATG ATTTCTCAGA GAAATCTTCT TATGAAGAAA TAAGATATAA GGAAAGGTGG AATGAATCTC ATTCGGTTTT GATAACTTAT GCAGATAGTA TTTATAAAGA TGGCGAGGCA ACATTAATAA CTCTTAGAAA GTTGTTGAGT AAACATTTTG GCAGTCTTTC TAAAGTTGTA CATATTCTTC CTTTTTTGAA ATCTACAAGT GATGGAGGTT TCGCGGTTTC AAGTTATGAT TCCTTAGAAG AAAAATTTGG TGGTTGGGAT GATCTCAAAA GTATATCTAA AAATCATGAT TTGATGGCTG ATTTAGTACT AAACCATGTC TCATCATCTC ACCCATGGGT TCAACAATTT ATTAAATCCC AGGAACCAGG GATATCAAAT GTTTTTTCAC CGAAACAAAG TCTTGACTGG TCAAATGTAG TTAGACCTAG AAGTTCCTCT TTGTTTTCTC AAATAAATAC TGATGATGGT CCTAAACAAG TTTGGACAAC TTTTGGGCCA GATCAAATTG ACTTGAATTG GCATAATCCT AAAATGACTC TTGAGTTCTT AAATTTAATT ACTACTTATT TATCTAATGG AATTAAATGG TTCAGGCTTG ACGCTGTAGG TTTTATTTGG AAGGAATCAG GGACTACCTG CTTACATTTA CCTAAAGCGC ATTCAATTGT GAAACTATTA AGAGTTCTTT TAAATAATCT TCTTGATGAT GGCGTTTTAA TAACAGAAAC CAATGTTCCT CAGAAAGAGA ATCTATCTTA TCTAGTTCCT GATGATGAAG CTCATATGGC ATACAATTTC CCATTACCTC CAATTCTCCT AGAAGCAATT ATTACTTCCA GAGCTGATAT TCTAAACTCA TGGATTTTTG ATTGGCCGGA ATTACCTGAA GATACTACTC TTTTTAACTT TACTGCATCG CACGATGGTG TTGGGCTAAG AGCTCTTGAG GGTTTAATGA ATGAGCAGAG AATCAAGGAT TTATTAATTA ATTGTGAGAA AAGAGGAGGA TTAGTAAGTC ATAGACGTTT ATCAAATGGT GATGATAAAC CTTATGAATT GAATATTAGT TGGTGGAGTG CAATGGAAGA CTCCAGTAGA GATTCTAAAA GATTTCAATA TGAGAGATTT ATTTTGAGTC AACTATTAGT AATGGCTCTG AAAGGAGTCC CTGCATTTTA TTTGCCAGCA TTACTAGCTT CAGAAAACGA TATCAAAAGT TTTTCTATGA CAGGTCAAAG AAGAGATCTA AACAGAGAAA AGTTTAAATC AGAAAATCTT TCAGCTGTTT TAAATAATCC TGAATCTAAT GCTAATAAAA ACTTAAAATG TCTTCGTAAT GCTATGGATG TCCGATCAAA ATTAAAGCAA TTTCACCCTT GTTCACAAAT GAAATGTTTG TCTAAAGGTA GAAGTGATAT TGTTGTAATC AAAAGAGGTA TAGGTCCTGA GTCTGTTTTT GCAATCCATA ATATGACTGA AAATAAAATT AATTATCAAT TGAATGATAA TGATCTACCC AAAATAATTG ATAATGATTT CAACATCCAT GATTTTTTGA CATCCACTAA ATACAATTGC AAAAATATTA GTCTTGATCC TTTTCAAGTA ATTTGGCTTA GTGCTTTATA A
|
Protein sequence | MKQIDSEKKL DRLIIDKLLK TIYSNLTTEE INFISNQLLQ ILDDFSEKSS YEEIRYKERW NESHSVLITY ADSIYKDGEA TLITLRKLLS KHFGSLSKVV HILPFLKSTS DGGFAVSSYD SLEEKFGGWD DLKSISKNHD LMADLVLNHV SSSHPWVQQF IKSQEPGISN VFSPKQSLDW SNVVRPRSSS LFSQINTDDG PKQVWTTFGP DQIDLNWHNP KMTLEFLNLI TTYLSNGIKW FRLDAVGFIW KESGTTCLHL PKAHSIVKLL RVLLNNLLDD GVLITETNVP QKENLSYLVP DDEAHMAYNF PLPPILLEAI ITSRADILNS WIFDWPELPE DTTLFNFTAS HDGVGLRALE GLMNEQRIKD LLINCEKRGG LVSHRRLSNG DDKPYELNIS WWSAMEDSSR DSKRFQYERF ILSQLLVMAL KGVPAFYLPA LLASENDIKS FSMTGQRRDL NREKFKSENL SAVLNNPESN ANKNLKCLRN AMDVRSKLKQ FHPCSQMKCL SKGRSDIVVI KRGIGPESVF AIHNMTENKI NYQLNDNDLP KIIDNDFNIH DFLTSTKYNC KNISLDPFQV IWLSAL
|
| |