Gene P9303_22191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22191 
SymbolxylB 
ID4778067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1969067 
End bp1970350 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content58% 
IMG OID640087735 
Productcarbohydrate kinase, FGGY family protein 
Protein accessionYP_001018219 
Protein GI124023912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACT CGTCCCTGGC GCTCGGCATC GACCTTGGCA CCAGCGGCGT ACGGCTTGCC 
GTGCTCAACG AGCACGGCAA GCTGATCCAC ACAAGCACAG CGGACTATCC AAAAGGGCTT
GAGATCCCTG AAGACTGGAA AACCTGCTGC ACAGAGCTGA TTCGGGCTTT GCCCACCAAC
CTTCGGCTGG CCCTGAGGGC ATTAGCCGTG GATGGCACAT CAGGAACTCT GCTGGCCTGC
GACCACACTG GCACAGCCCT TAGCAGAGCC CTTCCATACA ATCTGAGCTG CCCAGAGCAA
AGGCAAACAC TCATCTCCCT TGTCTCCCAT GGAGAACCAG CCTCAAGTGT GAGCAGCAGC
TTGGCAAGGG CTCTACGACT AATCAGCACC CATGGCCAAA GCGTTCTGCT ACGCCATCAA
GCGGACTGGA TCAGCGGCTG GTTGCTAGGC AATTGGTGCT GGGGAGAAGA AGGCAACAAC
CTGCGCTTAG GCTGGGATTT AGTCAATCAG ACCTGGCCTG CCAGCATTGC CGAAACAGCC
TGGCGGGCAG CACTTCCTGA GATCGTAAGC AGCGGCAAAG TTCTGGGTAA GGTGGCACCT
GAGCAGTCCC AAAGCCTTGG CCTACCGAAA CAACTCCTCG TAGTAGCAGG GACCACCGAC
GCCAATGCTG CTGTTTTAAC TGCCAATGCA GGTCCCGACG ACGGCATCAC CGTGCTGGGC
AGCACCCTTG TGCTGAAACG TTTTACGGAA GGTCCAATCC GTGGTGCTGG CATCACCAAC
CATCGCGTTG GCGGACGATG GCTCTGCGGC GGAGCCTCCA ATGCCGGTGG CAGCGTCCTT
CGACAACTGT TCAGTGATAC CGAGCTCAAA GAGTTAAGCC GCCAGATCAA TCCAGAGTTC
AACAGTGGTC TAATGCTGCG CCCTCTTCCC GGCCCCGGCG AACGCTTCCC AATTGACGAT
CCCACACTCG AACCACAGCT AACGCCACGA CCTGTGAGCG ATTCCCTCTA CCTCCATGGC
CTGCTGGAAG GCCTCGCACA CATCGAATTG CAAGGCTGGC AACGTCTCAA AGAGCTTGGC
GCTCCCCCTC CCAAGCAAGT GATCAGCCTG GGAGGGGGAG CACGCAATCC CCAATGGCGT
CGATTAAGAG AACGGATCCT TGGCATACCC GTCAAGACTT GCACCAACCC ACCAGCTGCC
GGAGTAGCCC GTCTGGCCTT GCAAGCGATC TCTCCTCAAC ACAACTTGGT TAGTACCAAG
CAAGAATCGG ATCAACAGCT CTGA
 
Protein sequence
MPNSSLALGI DLGTSGVRLA VLNEHGKLIH TSTADYPKGL EIPEDWKTCC TELIRALPTN 
LRLALRALAV DGTSGTLLAC DHTGTALSRA LPYNLSCPEQ RQTLISLVSH GEPASSVSSS
LARALRLIST HGQSVLLRHQ ADWISGWLLG NWCWGEEGNN LRLGWDLVNQ TWPASIAETA
WRAALPEIVS SGKVLGKVAP EQSQSLGLPK QLLVVAGTTD ANAAVLTANA GPDDGITVLG
STLVLKRFTE GPIRGAGITN HRVGGRWLCG GASNAGGSVL RQLFSDTELK ELSRQINPEF
NSGLMLRPLP GPGERFPIDD PTLEPQLTPR PVSDSLYLHG LLEGLAHIEL QGWQRLKELG
APPPKQVISL GGGARNPQWR RLRERILGIP VKTCTNPPAA GVARLALQAI SPQHNLVSTK
QESDQQL