Gene P9301_08951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_08951 
Symbol 
ID4912379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp769851 
End bp771611 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content31% 
IMG OID640160477 
Productglycoside hydrolase family protein 
Protein accessionYP_001091119 
Protein GI126696233 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.497034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCAAA TTGATTCAGA GAAAAAAATA GATAGATTAA AAATTGATAA ATTGCTAAAA 
ACAATTTATT CAAATAATAC TACAGAAGAA ATTAATTTTA TTTCAAATCA ATTATTACAG
ATTTTAGATG ATTTCTCAGA GAAATCTGCT TATGAAGAAA AAAGAGACAA GGAAAGGTGG
AATGAATCTC ATTCGGTTTT GATAACTTAT GCAGATAGTA TTTATAAAAA TGGCGAGGCA
ACATTAACAA CTCTTAATAA GTTTTTAAGT AAACATTTTG GCAGTCTTTC TAAAGTTGTA
CATATTCTTC CTTTTTTGAA ATCCACAAGT GATGGAGGTT TTGCCGTCTC AAGTTATGAT
TCCTTAGAAG AAAAATTTGG TGGTTGGGAT GATCTCAAAA GTATTTCTAA AAATCATGAT
TTGATGGCTG ATTTAGTACT AAACCATGTT TCGTCATCTC ATCCATGGGT TCAACAATTT
ATTAAATACC AAGAACCGGG TATATCAAAT GTTTTTTCAC CAAAACAAAA TCTTGACTGG
TCAAATGTAG TTAGACCAAG AAGTTCCTCC TTGTTTTCTC AAATAAATAC TGAAGATGGC
CCTAAGCAAG TTTGGACAAC TTTTGGTCCA GATCAAATTG ATTTGAATTG GCACAATCCA
AAAATGACTA TTGAGTTCTT AAATTTAATT ATTACTTATT TATCTAATGG AATTAAATGG
TTAAGGCTTG ATGCTGTAGG TTTTATTTGG AAGGAATCAG GGACAACATG CTTACATTTG
CCGAAAGCAC ATTCAATCGT GAAACTCTTG AGAGTTCTTT TAAATAATCT TCTTGATGAG
GGAGTTTTAA TAACTGAAAC TAATGTTCCC CAGAAGGAAA ATCTATCTTA TCTGATTCCT
GATGATGAGG CCCATATGGC ATACAATTTC CCATTGCCTC CCCTTCTCCT AGAGGCAATT
ATTACTTCAA GAGCTGATAT TCTAAACTCA TGGATTTTTG ATTGGCCCAT ACTACCTAAA
GAAACTACTT TATTTAATTT CACTGCATCG CACGATGGTG TTGGGCTAAG AGCTCTTGAG
GGTTTAATGA ATGAACAGAG AATTAAAGAT TTATTAATTA ATTGTGAGAA AAGAGGTGGA
TTAGTAAGTC ATAGACGTTT ATCAAATGGT GATGATAAGC CTTATGAATT AAATATTAGT
TGGTGGAGTG CAATGGAAGA CTCCAGTAGA GATGCTAAAA GATTTCAATA TGAGAGATTT
ATTTTGAGTC AATTATTAGT AATGGCTCTA AAAGGGGTTC CTGCATTTTA TTTGCCAGCA
TTATTAGCTT CAGAAAATGA TATAAAGAGT TTTTCTTTGA CAGGTCAAAG AAGAGACCTT
AATAGAGAAA AGTTTAAATC AGAAAATCTT TTAGCGGTTT TAAATAATCC TGAATCTAAT
GCTAATAAAA ACTTAAAATG TCTTCGTAAT GCAATGGATG TCAGATCAAA ATTAAAGCAA
TTTCACCCTT GTTCAGAAAT GAAATGTTTG TCTAAAGGTA GAAGTGATAT TGTTGTAATC
AAACGAGGTA ATGGTCCTGA GTCGGTTTTT GCAATCCATA ATATGACTGA AAATAAAATT
AACTATCAAC TGAATGATAA TGATTTACCA AAAATAATTG ATAACGATTT CAATACCCAT
GATTTTTTAT CATCCATTAA ATATAATCGC AAAAATATTA GTCTTGATCC TTTTCAAGTA
ATTTGGCTTA GTGCTTTATA A
 
Protein sequence
MKQIDSEKKI DRLKIDKLLK TIYSNNTTEE INFISNQLLQ ILDDFSEKSA YEEKRDKERW 
NESHSVLITY ADSIYKNGEA TLTTLNKFLS KHFGSLSKVV HILPFLKSTS DGGFAVSSYD
SLEEKFGGWD DLKSISKNHD LMADLVLNHV SSSHPWVQQF IKYQEPGISN VFSPKQNLDW
SNVVRPRSSS LFSQINTEDG PKQVWTTFGP DQIDLNWHNP KMTIEFLNLI ITYLSNGIKW
LRLDAVGFIW KESGTTCLHL PKAHSIVKLL RVLLNNLLDE GVLITETNVP QKENLSYLIP
DDEAHMAYNF PLPPLLLEAI ITSRADILNS WIFDWPILPK ETTLFNFTAS HDGVGLRALE
GLMNEQRIKD LLINCEKRGG LVSHRRLSNG DDKPYELNIS WWSAMEDSSR DAKRFQYERF
ILSQLLVMAL KGVPAFYLPA LLASENDIKS FSLTGQRRDL NREKFKSENL LAVLNNPESN
ANKNLKCLRN AMDVRSKLKQ FHPCSEMKCL SKGRSDIVVI KRGNGPESVF AIHNMTENKI
NYQLNDNDLP KIIDNDFNTH DFLSSIKYNR KNISLDPFQV IWLSAL