Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_08951 |
Symbol | |
ID | 4912379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 769851 |
End bp | 771611 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640160477 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001091119 |
Protein GI | 126696233 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.497034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCAAA TTGATTCAGA GAAAAAAATA GATAGATTAA AAATTGATAA ATTGCTAAAA ACAATTTATT CAAATAATAC TACAGAAGAA ATTAATTTTA TTTCAAATCA ATTATTACAG ATTTTAGATG ATTTCTCAGA GAAATCTGCT TATGAAGAAA AAAGAGACAA GGAAAGGTGG AATGAATCTC ATTCGGTTTT GATAACTTAT GCAGATAGTA TTTATAAAAA TGGCGAGGCA ACATTAACAA CTCTTAATAA GTTTTTAAGT AAACATTTTG GCAGTCTTTC TAAAGTTGTA CATATTCTTC CTTTTTTGAA ATCCACAAGT GATGGAGGTT TTGCCGTCTC AAGTTATGAT TCCTTAGAAG AAAAATTTGG TGGTTGGGAT GATCTCAAAA GTATTTCTAA AAATCATGAT TTGATGGCTG ATTTAGTACT AAACCATGTT TCGTCATCTC ATCCATGGGT TCAACAATTT ATTAAATACC AAGAACCGGG TATATCAAAT GTTTTTTCAC CAAAACAAAA TCTTGACTGG TCAAATGTAG TTAGACCAAG AAGTTCCTCC TTGTTTTCTC AAATAAATAC TGAAGATGGC CCTAAGCAAG TTTGGACAAC TTTTGGTCCA GATCAAATTG ATTTGAATTG GCACAATCCA AAAATGACTA TTGAGTTCTT AAATTTAATT ATTACTTATT TATCTAATGG AATTAAATGG TTAAGGCTTG ATGCTGTAGG TTTTATTTGG AAGGAATCAG GGACAACATG CTTACATTTG CCGAAAGCAC ATTCAATCGT GAAACTCTTG AGAGTTCTTT TAAATAATCT TCTTGATGAG GGAGTTTTAA TAACTGAAAC TAATGTTCCC CAGAAGGAAA ATCTATCTTA TCTGATTCCT GATGATGAGG CCCATATGGC ATACAATTTC CCATTGCCTC CCCTTCTCCT AGAGGCAATT ATTACTTCAA GAGCTGATAT TCTAAACTCA TGGATTTTTG ATTGGCCCAT ACTACCTAAA GAAACTACTT TATTTAATTT CACTGCATCG CACGATGGTG TTGGGCTAAG AGCTCTTGAG GGTTTAATGA ATGAACAGAG AATTAAAGAT TTATTAATTA ATTGTGAGAA AAGAGGTGGA TTAGTAAGTC ATAGACGTTT ATCAAATGGT GATGATAAGC CTTATGAATT AAATATTAGT TGGTGGAGTG CAATGGAAGA CTCCAGTAGA GATGCTAAAA GATTTCAATA TGAGAGATTT ATTTTGAGTC AATTATTAGT AATGGCTCTA AAAGGGGTTC CTGCATTTTA TTTGCCAGCA TTATTAGCTT CAGAAAATGA TATAAAGAGT TTTTCTTTGA CAGGTCAAAG AAGAGACCTT AATAGAGAAA AGTTTAAATC AGAAAATCTT TTAGCGGTTT TAAATAATCC TGAATCTAAT GCTAATAAAA ACTTAAAATG TCTTCGTAAT GCAATGGATG TCAGATCAAA ATTAAAGCAA TTTCACCCTT GTTCAGAAAT GAAATGTTTG TCTAAAGGTA GAAGTGATAT TGTTGTAATC AAACGAGGTA ATGGTCCTGA GTCGGTTTTT GCAATCCATA ATATGACTGA AAATAAAATT AACTATCAAC TGAATGATAA TGATTTACCA AAAATAATTG ATAACGATTT CAATACCCAT GATTTTTTAT CATCCATTAA ATATAATCGC AAAAATATTA GTCTTGATCC TTTTCAAGTA ATTTGGCTTA GTGCTTTATA A
|
Protein sequence | MKQIDSEKKI DRLKIDKLLK TIYSNNTTEE INFISNQLLQ ILDDFSEKSA YEEKRDKERW NESHSVLITY ADSIYKNGEA TLTTLNKFLS KHFGSLSKVV HILPFLKSTS DGGFAVSSYD SLEEKFGGWD DLKSISKNHD LMADLVLNHV SSSHPWVQQF IKYQEPGISN VFSPKQNLDW SNVVRPRSSS LFSQINTEDG PKQVWTTFGP DQIDLNWHNP KMTIEFLNLI ITYLSNGIKW LRLDAVGFIW KESGTTCLHL PKAHSIVKLL RVLLNNLLDE GVLITETNVP QKENLSYLIP DDEAHMAYNF PLPPLLLEAI ITSRADILNS WIFDWPILPK ETTLFNFTAS HDGVGLRALE GLMNEQRIKD LLINCEKRGG LVSHRRLSNG DDKPYELNIS WWSAMEDSSR DAKRFQYERF ILSQLLVMAL KGVPAFYLPA LLASENDIKS FSLTGQRRDL NREKFKSENL LAVLNNPESN ANKNLKCLRN AMDVRSKLKQ FHPCSEMKCL SKGRSDIVVI KRGNGPESVF AIHNMTENKI NYQLNDNDLP KIIDNDFNTH DFLSSIKYNR KNISLDPFQV IWLSAL
|
| |