Gene P9303_15281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_15281 
Symbol 
ID4778670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1331545 
End bp1332813 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content46% 
IMG OID640087037 
ProductABC transporter for sugars, solute-binding protein 
Protein accessionYP_001017537 
Protein GI124023230 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.493249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA GGACATTTTG GAAAATAGCT CTGCTGTTCT CTTTGACTAG CGTTGCACTA 
TTTGCAAGCT GGGCCCTGAG CACTCGCCCT GTTCAGATCA ACATCTTAAT GCCAGCTCCA
TTTGCTGAGT CAACAACTGA TCTAGTCCAA AAATTCAACA AAGATCATCA CGGCAGTATC
CAACTTCAGG TGACTCGAGG TCCACTTGAA ACAGAAGCAG TATCGGATTT AGCCATCAGC
AGTCTTCTTT TAGGTAAAAG TCCATTCGAT GCACTTTTAA TTGATGTCAC TTGGTTGCCG
AAATACGCCG CCGCAGGATG GCTTATTCCC TTGGATCCAT GGATCGATCA ACAACAGATT
GATTCCATTG CTCCGGGGGC AATGCTTGGA AATAATTTCG ACGGGAAACT GTATCGATGG
CCACTTGTGG CTGACATGGG ATTGCTTTAT TGGCGGACAG ATTTGATGAG TGAACCACCG
CGTACCCCTG AGGAACTCAT TAAAGTTAGT CTTAAGCTCC AAAAAGAAGG CCGTATTGCT
TTTGGCTATG TCTGGCAAGG CCGTCAATAT GAAGGCCTAA GTTGTGTATT TCTAGAAGTT
CTTGACGGAT TTGGAGGACA ATGGCTTGAA CCCGAAACCG ATAATGTTGG CCTCGACAGC
TCCGCAAGTC TTCAGGCTGC AAGTTGGTTG CGTGAACTGA TCAGCAGTGG AGTGAGCCCC
GAAGCAGTGA TCAACTATGC CGAAAATGAA ACTCTTCAAG CCTTTAAGTC TGGAGATGTT
GCGCTAATGC GTAACTGGCC TTATGCCTGG GGAGAACTGC AGAAGCCTAA CAGCGATGTT
AGAGGCAATG TTGGAGTAAC CACGATGGTT GCTACCGCTG CCAATCGATC CACATCAACC
CTAGGCAGCT GGGGCTTCTC GATCCTCAAG GGCTCATCTA ATCCTCAAGC TGCAGCAGAG
GCCATTGCCT TCCTCACATC AACATCTGCA CAAAAAAGAC TGTTCTTGAA CGACAGCTAT
ACCCCAACTA AAGCAGAACT ATTTAAAGAC CCAGAACTGC TCTCAAAATC ACAAATTCTT
CCGGAGCTTG CTAATGCCCT ACAAAGCACT GATCAACGTC CAGCAACCCC TCTATATGCA
CAGATTAGTG ATGTACTACA ACGAAATCTG AGTTCAATTT TTACAGGCCA ATCCACCGTT
AGCGATGCAA TGTCCAACGC CCAAGCAAAC ACCAAGAAGA TTCTTATGGC GGCAAGAGAA
ACTAAATGA
 
Protein sequence
MKRRTFWKIA LLFSLTSVAL FASWALSTRP VQINILMPAP FAESTTDLVQ KFNKDHHGSI 
QLQVTRGPLE TEAVSDLAIS SLLLGKSPFD ALLIDVTWLP KYAAAGWLIP LDPWIDQQQI
DSIAPGAMLG NNFDGKLYRW PLVADMGLLY WRTDLMSEPP RTPEELIKVS LKLQKEGRIA
FGYVWQGRQY EGLSCVFLEV LDGFGGQWLE PETDNVGLDS SASLQAASWL RELISSGVSP
EAVINYAENE TLQAFKSGDV ALMRNWPYAW GELQKPNSDV RGNVGVTTMV ATAANRSTST
LGSWGFSILK GSSNPQAAAE AIAFLTSTSA QKRLFLNDSY TPTKAELFKD PELLSKSQIL
PELANALQST DQRPATPLYA QISDVLQRNL SSIFTGQSTV SDAMSNAQAN TKKILMAARE
TK