Gene P9211_06751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_06751 
SymbolamyA 
ID5731626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp592932 
End bp594668 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content35% 
IMG OID641285037 
Productglycoside hydrolase family protein 
Protein accessionYP_001550560 
Protein GI159903216 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.335085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC AACTAGTGAG ATTGAGTGAA TTACTCAATG AAGTTTACAG AGAACACTCT 
GCAGAAGAAA TTGATTATAT GTGGTCACAA TTGCTGCAGA TTTTGAATCA GCATAGTGAT
AAACGAGATA ATTATGCTGA ACTCTCCGAA CTTTGGAACT CTTCTAGCGC CGTTTTGATT
ACTTATGCTG ATGGTGTATA CAAGTCAGGA GAGCCAACTT TAAAGACCCT TAAAGATTTA
ATCGATTTGC ATTTAAATGA CTTTGCATCG GTTATACATG TCTTGCCTTT TTTGTGTTCC
ACAAGTGATG GTGGGTTTGC TGTATCAGAT TTTGAGAAAT TAGAAACACG TTTTGGCGAA
TGGGATCATT TAAAAGCTCT CTCGAAGAAT CACATATTGA TGGCAGACTT GGTTCTAAAT
CATGTTTCAT CTTCTCATCC ATGGGTCCAA CAATTTATTC AATCTAAGGA GCCTGGGAGT
AAATATATTC TTTCCCCTTC ATCATCTGAA AACTGGGAAG ATGTTACTAG GCCAAGGAAT
ACTTCTCTTT TTACTAACCT TTCTACTACT AAAGGTAAGA AAGATGTTTG GACAACATTT
GGTCCAGATC AAATTGATAT TAATTGGAAA GAGCCATATG TTTTGATAGA ATTTTTAAGA
TTAATTATTA GATATATAGA TTCTGGAATA AAATGGATTC GACTTGATGC CGTCGGCTTT
ATATGGAAAG AGCCAGGTAC AACATGTTTG CACAGAAATG AAGTTCATAA GATAGTTAAG
GCATTAAGAA TTCAGATTAA TGAACTTATA AACTCTAGTG TTTTAATTAC TGAAACTAAT
GTTCCAGAAA AAGAGAATGT TTCATACCTT AGTTCAGGCG ATGAAGCGCA TCTTGCATAT
AACTTTCCTC TCCCTCCCCT TTTACTGGAA AGTTTAATTA CCAATAAAGC AGATTTACTT
AACAATTGGT TATCCTCATG GCCTGAGTTG CCTAAAAACA CAGGGTTTTT AAACTTTACG
GCTTCCCATG ACGGTGTTGG GCTAAGAGCC TTGGAGGGTT TAATGGATCA GAAAAGAATT
CGTGAATTAT TAATAGCTTG TGAGAAAAGA GGAGGTTTAA TCAGCCATAG AAGAATGTCT
AATGGTGAGG ATCAACCTTA TGAATTAAAT ATTAGTTGGT GGAGTGCAAT GGCAGATAAA
GGAAGAGATA CTTCCTTATT TCAGTTTGAG CGCTTTTTAT TGAGTCAACT TTTTGTAATG
GCTTTAAAAG GTGTTCCAGC TTTTTATTTG CAGGCGTTAA TGGCATCGGA AAATGATTTA
ACAACCTTTG CCAAATCTGG CCAAAGGAGA GATTTGAATC GTGAAAAGTT TGAAGCAAAT
ACTTTACGCA TCAAACTGGA GGACGAAAAG TCACATCCAA GTAGAAATTT AACTTCTCTT
AAGAAAGCAA TGCAGGTAAG AAGAAAATTA AATGCTTTTC ATCCCAACCA ACCAATGAAA
TGCCTTAGTA AGAGTCGCAG TGATCTTGTG ATAATTTCTC GTGGTGAAGG TAATGAAACT
ATTTGGGCAT TACATAATAT GACCAATTCA AAACTATGCT TTTCTCTTTC AGAAGGTTTA
AATGTTAATG GAGAATCTAC TGTCTCTTGG GATGATTGCT TAAATGATTA TAAGCGACAT
CAAAATAGAA TAGACTTGCA TCCCTACTCT GTTCATTGGT TAATGAAATC AAACTAA
 
Protein sequence
MEQQLVRLSE LLNEVYREHS AEEIDYMWSQ LLQILNQHSD KRDNYAELSE LWNSSSAVLI 
TYADGVYKSG EPTLKTLKDL IDLHLNDFAS VIHVLPFLCS TSDGGFAVSD FEKLETRFGE
WDHLKALSKN HILMADLVLN HVSSSHPWVQ QFIQSKEPGS KYILSPSSSE NWEDVTRPRN
TSLFTNLSTT KGKKDVWTTF GPDQIDINWK EPYVLIEFLR LIIRYIDSGI KWIRLDAVGF
IWKEPGTTCL HRNEVHKIVK ALRIQINELI NSSVLITETN VPEKENVSYL SSGDEAHLAY
NFPLPPLLLE SLITNKADLL NNWLSSWPEL PKNTGFLNFT ASHDGVGLRA LEGLMDQKRI
RELLIACEKR GGLISHRRMS NGEDQPYELN ISWWSAMADK GRDTSLFQFE RFLLSQLFVM
ALKGVPAFYL QALMASENDL TTFAKSGQRR DLNREKFEAN TLRIKLEDEK SHPSRNLTSL
KKAMQVRRKL NAFHPNQPMK CLSKSRSDLV IISRGEGNET IWALHNMTNS KLCFSLSEGL
NVNGESTVSW DDCLNDYKRH QNRIDLHPYS VHWLMKSN