Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15281 |
Symbol | |
ID | 4778670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1331545 |
End bp | 1332813 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640087037 |
Product | ABC transporter for sugars, solute-binding protein |
Protein accession | YP_001017537 |
Protein GI | 124023230 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.493249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAA GGACATTTTG GAAAATAGCT CTGCTGTTCT CTTTGACTAG CGTTGCACTA TTTGCAAGCT GGGCCCTGAG CACTCGCCCT GTTCAGATCA ACATCTTAAT GCCAGCTCCA TTTGCTGAGT CAACAACTGA TCTAGTCCAA AAATTCAACA AAGATCATCA CGGCAGTATC CAACTTCAGG TGACTCGAGG TCCACTTGAA ACAGAAGCAG TATCGGATTT AGCCATCAGC AGTCTTCTTT TAGGTAAAAG TCCATTCGAT GCACTTTTAA TTGATGTCAC TTGGTTGCCG AAATACGCCG CCGCAGGATG GCTTATTCCC TTGGATCCAT GGATCGATCA ACAACAGATT GATTCCATTG CTCCGGGGGC AATGCTTGGA AATAATTTCG ACGGGAAACT GTATCGATGG CCACTTGTGG CTGACATGGG ATTGCTTTAT TGGCGGACAG ATTTGATGAG TGAACCACCG CGTACCCCTG AGGAACTCAT TAAAGTTAGT CTTAAGCTCC AAAAAGAAGG CCGTATTGCT TTTGGCTATG TCTGGCAAGG CCGTCAATAT GAAGGCCTAA GTTGTGTATT TCTAGAAGTT CTTGACGGAT TTGGAGGACA ATGGCTTGAA CCCGAAACCG ATAATGTTGG CCTCGACAGC TCCGCAAGTC TTCAGGCTGC AAGTTGGTTG CGTGAACTGA TCAGCAGTGG AGTGAGCCCC GAAGCAGTGA TCAACTATGC CGAAAATGAA ACTCTTCAAG CCTTTAAGTC TGGAGATGTT GCGCTAATGC GTAACTGGCC TTATGCCTGG GGAGAACTGC AGAAGCCTAA CAGCGATGTT AGAGGCAATG TTGGAGTAAC CACGATGGTT GCTACCGCTG CCAATCGATC CACATCAACC CTAGGCAGCT GGGGCTTCTC GATCCTCAAG GGCTCATCTA ATCCTCAAGC TGCAGCAGAG GCCATTGCCT TCCTCACATC AACATCTGCA CAAAAAAGAC TGTTCTTGAA CGACAGCTAT ACCCCAACTA AAGCAGAACT ATTTAAAGAC CCAGAACTGC TCTCAAAATC ACAAATTCTT CCGGAGCTTG CTAATGCCCT ACAAAGCACT GATCAACGTC CAGCAACCCC TCTATATGCA CAGATTAGTG ATGTACTACA ACGAAATCTG AGTTCAATTT TTACAGGCCA ATCCACCGTT AGCGATGCAA TGTCCAACGC CCAAGCAAAC ACCAAGAAGA TTCTTATGGC GGCAAGAGAA ACTAAATGA
|
Protein sequence | MKRRTFWKIA LLFSLTSVAL FASWALSTRP VQINILMPAP FAESTTDLVQ KFNKDHHGSI QLQVTRGPLE TEAVSDLAIS SLLLGKSPFD ALLIDVTWLP KYAAAGWLIP LDPWIDQQQI DSIAPGAMLG NNFDGKLYRW PLVADMGLLY WRTDLMSEPP RTPEELIKVS LKLQKEGRIA FGYVWQGRQY EGLSCVFLEV LDGFGGQWLE PETDNVGLDS SASLQAASWL RELISSGVSP EAVINYAENE TLQAFKSGDV ALMRNWPYAW GELQKPNSDV RGNVGVTTMV ATAANRSTST LGSWGFSILK GSSNPQAAAE AIAFLTSTSA QKRLFLNDSY TPTKAELFKD PELLSKSQIL PELANALQST DQRPATPLYA QISDVLQRNL SSIFTGQSTV SDAMSNAQAN TKKILMAARE TK
|
| |