Gene A9601_08971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08971 
Symbol 
ID4717603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp771397 
End bp773157 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content32% 
IMG OID640078609 
Productglycoside hydrolase family protein 
Protein accessionYP_001009288 
Protein GI123968430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.540308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCAAA TTGATTCAGA GAAAAAATTA GATAGATTAA TAATTGATAA ATTGTTAAAA 
ACAATTTATT CAAATCTTAC TACAGAAGAA ATTAATTTTA TTTCAAACCA ATTATTACAG
ATTTTAGATG ATTTCTCAGA GAAATCTTCT TATGAAGAAA TAAGATATAA GGAAAGGTGG
AATGAATCTC ATTCGGTTTT GATAACTTAT GCAGATAGTA TTTATAAAGA TGGCGAGGCA
ACATTAATAA CTCTTAGAAA GTTGTTGAGT AAACATTTTG GCAGTCTTTC TAAAGTTGTA
CATATTCTTC CTTTTTTGAA ATCTACAAGT GATGGAGGTT TCGCGGTTTC AAGTTATGAT
TCCTTAGAAG AAAAATTTGG TGGTTGGGAT GATCTCAAAA GTATATCTAA AAATCATGAT
TTGATGGCTG ATTTAGTACT AAACCATGTC TCATCATCTC ACCCATGGGT TCAACAATTT
ATTAAATCCC AGGAACCAGG GATATCAAAT GTTTTTTCAC CGAAACAAAG TCTTGACTGG
TCAAATGTAG TTAGACCTAG AAGTTCCTCT TTGTTTTCTC AAATAAATAC TGATGATGGT
CCTAAACAAG TTTGGACAAC TTTTGGGCCA GATCAAATTG ACTTGAATTG GCATAATCCT
AAAATGACTC TTGAGTTCTT AAATTTAATT ACTACTTATT TATCTAATGG AATTAAATGG
TTCAGGCTTG ACGCTGTAGG TTTTATTTGG AAGGAATCAG GGACTACCTG CTTACATTTA
CCTAAAGCGC ATTCAATTGT GAAACTATTA AGAGTTCTTT TAAATAATCT TCTTGATGAT
GGCGTTTTAA TAACAGAAAC CAATGTTCCT CAGAAAGAGA ATCTATCTTA TCTAGTTCCT
GATGATGAAG CTCATATGGC ATACAATTTC CCATTACCTC CAATTCTCCT AGAAGCAATT
ATTACTTCCA GAGCTGATAT TCTAAACTCA TGGATTTTTG ATTGGCCGGA ATTACCTGAA
GATACTACTC TTTTTAACTT TACTGCATCG CACGATGGTG TTGGGCTAAG AGCTCTTGAG
GGTTTAATGA ATGAGCAGAG AATCAAGGAT TTATTAATTA ATTGTGAGAA AAGAGGAGGA
TTAGTAAGTC ATAGACGTTT ATCAAATGGT GATGATAAAC CTTATGAATT GAATATTAGT
TGGTGGAGTG CAATGGAAGA CTCCAGTAGA GATTCTAAAA GATTTCAATA TGAGAGATTT
ATTTTGAGTC AACTATTAGT AATGGCTCTG AAAGGAGTCC CTGCATTTTA TTTGCCAGCA
TTACTAGCTT CAGAAAACGA TATCAAAAGT TTTTCTATGA CAGGTCAAAG AAGAGATCTA
AACAGAGAAA AGTTTAAATC AGAAAATCTT TCAGCTGTTT TAAATAATCC TGAATCTAAT
GCTAATAAAA ACTTAAAATG TCTTCGTAAT GCTATGGATG TCCGATCAAA ATTAAAGCAA
TTTCACCCTT GTTCACAAAT GAAATGTTTG TCTAAAGGTA GAAGTGATAT TGTTGTAATC
AAAAGAGGTA TAGGTCCTGA GTCTGTTTTT GCAATCCATA ATATGACTGA AAATAAAATT
AATTATCAAT TGAATGATAA TGATCTACCC AAAATAATTG ATAATGATTT CAACATCCAT
GATTTTTTGA CATCCACTAA ATACAATTGC AAAAATATTA GTCTTGATCC TTTTCAAGTA
ATTTGGCTTA GTGCTTTATA A
 
Protein sequence
MKQIDSEKKL DRLIIDKLLK TIYSNLTTEE INFISNQLLQ ILDDFSEKSS YEEIRYKERW 
NESHSVLITY ADSIYKDGEA TLITLRKLLS KHFGSLSKVV HILPFLKSTS DGGFAVSSYD
SLEEKFGGWD DLKSISKNHD LMADLVLNHV SSSHPWVQQF IKSQEPGISN VFSPKQSLDW
SNVVRPRSSS LFSQINTDDG PKQVWTTFGP DQIDLNWHNP KMTLEFLNLI TTYLSNGIKW
FRLDAVGFIW KESGTTCLHL PKAHSIVKLL RVLLNNLLDD GVLITETNVP QKENLSYLVP
DDEAHMAYNF PLPPILLEAI ITSRADILNS WIFDWPELPE DTTLFNFTAS HDGVGLRALE
GLMNEQRIKD LLINCEKRGG LVSHRRLSNG DDKPYELNIS WWSAMEDSSR DSKRFQYERF
ILSQLLVMAL KGVPAFYLPA LLASENDIKS FSMTGQRRDL NREKFKSENL SAVLNNPESN
ANKNLKCLRN AMDVRSKLKQ FHPCSQMKCL SKGRSDIVVI KRGIGPESVF AIHNMTENKI
NYQLNDNDLP KIIDNDFNIH DFLTSTKYNC KNISLDPFQV IWLSAL