Gene P9303_21691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21691 
Symbol 
ID4777500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1924892 
End bp1926343 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content55% 
IMG OID640087679 
Productglycoside hydrolase family protein 
Protein accessionYP_001018169 
Protein GI124023862 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0470032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATGGG GCATGCAGCA GATCGATGGG CTGACCAATA CTCCCCGCTG GGTTGCCCAG 
GCGGTGGTGT ATCAGATCTT CCCTGATCGT TTTCGTTGTA GTGGCCGTGT CTTAGCTCAT
CAGCATTTGG CTTTGCGCTG CTGGGGCAGT GACCCTTCTG AGCAGGGTTT TCAGGGGGGA
GATCTCTACG GGGTGATCGA GGCCCTTGAT CATCTCCAGG CGCTTGGTAT CAGCTGCCTT
TACTTGACAC CCGTCTTTAG CTCTGCCGCT AACCATCGCT ATCACGCTTA TGACTACTTG
CAGGTGGATC CGCTTCTTGG TGGCAATGCA GCGCTAGAGG CTTTGATTGA GGCGGTGCAT
CGCCGCGGGA TGCGCATCAT TTTGGATGGC GTGTTTAATC ACTGTGGTCG CGGGTTCTGG
GCTTTTCATC ATCTTTTGGA AAATGGTGAG GCTTCGCCTT ATCGCGATTG GTTTGAGGTG
CGGCAATGGC CGCTTCATCC CTATCCACGG CGTGGGCAGG ATTGTGGTTA CAGCTGCTGG
TGGAACGATC CAGCCTTGCC AAAGTTCAAT CATGCCCATG CCCCTGTGCG TGAGTATTTG
ATTGCTGTAG CCCGCTATTG GCTCGAGCAG GGAATCGATG GTTGGCGACT TGACGTTGCT
GATGAGGTGC CTGCTGAGTT TTGGCTGGAG TTTCGGCAAA TGGTTAAGGC CGTGAATCCA
GACGCTTGGA TCTTGGCTGA GATCTGGGGT GATGCGAGAT CGTGGCTACA GGGGCAGCAC
TTTGATGGTG TGATGAATTA TCGGATGGGT TGGAGCAGCC TTTGCTGGGT TGCTGGTAAG
CGATTACGCC GTCGGTATCG CAATCCTGCC TATCCCCTTG ACCCTCTGAG TGGGGAGGCT
TTTGTTGAGC TATTGGCAAC AACGCTGGGT TGGTATCGAC CTGAGGTGAA CCGCAGCCAG
TTGAACCTGC TTGATAGCCA CGATGTGCCG AGAGCTCTGC ACACACTTCA CGGTGATCTT
GCGGCGTTGA AGTTGGCCTT GCTGTTGCTG TTTTTGCAAC CAGGGGCGCC TTGCATCTAC
TACGGCACAG AGGCGGGTTT GCAGGGTGGC CCTGAACCAG GTTGCCGCGA AGGGTTTCCT
TGGCATACGC CTTGGCCTGC AGACCTGCGC GATTTCATTC AGTCGTTGAG TGATCTGCGC
CAACGTTGCC CAGCGTTTGC TGATGGCGGT TTGCAATGGC AACCGATTGG AGCTGATGCA
CTTCATGCTT GGTGGATGCA GCCCGAGACA ACCACAACGC AAAGGGAGAC GTCGATTCAG
GTGTGGGTCA ATCGCAGTCG CAGGTCATGG TTGCCGACGA AAGTCTCATC GACAGACCCT
CTTTGGCTGG AAGGAGCATT TGAATGCAAT GGCCGGGGAT TAGGCCCTCA ATCAGCAGTG
TTGTTGAGCT GA
 
Protein sequence
MQWGMQQIDG LTNTPRWVAQ AVVYQIFPDR FRCSGRVLAH QHLALRCWGS DPSEQGFQGG 
DLYGVIEALD HLQALGISCL YLTPVFSSAA NHRYHAYDYL QVDPLLGGNA ALEALIEAVH
RRGMRIILDG VFNHCGRGFW AFHHLLENGE ASPYRDWFEV RQWPLHPYPR RGQDCGYSCW
WNDPALPKFN HAHAPVREYL IAVARYWLEQ GIDGWRLDVA DEVPAEFWLE FRQMVKAVNP
DAWILAEIWG DARSWLQGQH FDGVMNYRMG WSSLCWVAGK RLRRRYRNPA YPLDPLSGEA
FVELLATTLG WYRPEVNRSQ LNLLDSHDVP RALHTLHGDL AALKLALLLL FLQPGAPCIY
YGTEAGLQGG PEPGCREGFP WHTPWPADLR DFIQSLSDLR QRCPAFADGG LQWQPIGADA
LHAWWMQPET TTTQRETSIQ VWVNRSRRSW LPTKVSSTDP LWLEGAFECN GRGLGPQSAV
LLS