Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01821 |
Symbol | |
ID | 4777098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 200562 |
End bp | 201866 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085681 |
Product | solute-binding family 1 protein |
Protein accession | YP_001016202 |
Protein GI | 124021895 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCGGT CACGAAGAAA GTTTTTTGTT GCAGGCCTCG TGGTGCTTGG AGCAGCCTTG ATTGGTTGGG GTTGTCGACC GAAGCAGGGA TCTTCAACTG ATTTGCAGCT CTGGACGTTG CAACTGGCGC CGAAGTTCAA CCCCTATATG GAGGGGGTCA TCAAAAACTG GCAGCAGAAT CACCCCGGCG CGCTGGTGCG TTGGACAGAT CTCCCCTGGG GATCAGTGGA GCGCAAGTTA TTGGCTGCGG TGTTCGCGCG CACCGCACCA GACGTGGTCA ATCTCAATCC CCCCTTTGCT GCAAATCTTG CCAGCAAAGG GGGGATTAGG GATCTGACGC CTTTGCTTCC AGACGATGCA GCCGACCGCT ACCTTCCCTC GGTTTGGGAG TCTGGCCTGG ATGCTGAGGG GCGACAGATC GCCATTCCTT GGTATCTCAC GGTACGGCTG AGTTTGGTTA ACCAGCAGTT GTTGCGACAA GCCGAACTTG AGGCTCCACC GCGACGTTGG CAGGATGTAC CTTCCTATGC GCGCCGCATT CGTGAGCGCA CAGGTCGCTA TGGCCTATTC GTCACTGTGG TCCCTGACGA TTCGACTGAA CTGCTTGAGT CGATGGTTCA GATGGGAGTC ACTTTGCTGG ATTCTCGGAG GCGGGCTGGT TTTGCTACTC CAAAGGGGCA ACGTGCTTTT GCCTTCTGGA CCGATCTTTA TCGGCAAGGT CTCCTGCCTC GAGAGGTGAT AAGCCAGGGT CAGAAGCGAG CGATAGAGCT TTATCAGAGC GGTGAGTTGG CCATGTTGGC GAGCGGGGCT GAATTCCTAC GCACGATCCA GACAAACGCT CCGGCTGTGG CCAGGGTGAC TCACTCCTAT CCACCGCTTG TCGGTGGTGA TGGCAAAGCG AATGTCGCTG TGATGACGTT GGTTGTGCCT AGTCAGAGCA GGCGCCAGCA GGAAGCCGTT GATTTTGCCT TGTTTCTAAC CAATGGGGTT AATCAGGCAA CTTTTGCGCA GCAAGCCAAG GTGTTGCCAT CGTCTAGAAA CGCCTTGCGA CAGGTTCAGA TTGCTCTTAA TGCTGAGCGT CCGGAGTCGC GTGAGGCTGC TCAGATCCGC TCTGCCAGAG CATTGTCTGC CAAGACTTTG AAGCGGGCGA AAGTTTTGGT TCCGGCTTTA CCTGGGATCA AGCGTCTGCA GAGCATTATT TATACCCAGT TACAGCGGGC GATGCTCGAT CAAATCAGCA GCGATGAGGC TGTGGAAGAG GCTGCTCGTC AGTGGAACCG CTACGCAGAA GCTAGATGGC CCTAA
|
Protein sequence | MLRSRRKFFV AGLVVLGAAL IGWGCRPKQG SSTDLQLWTL QLAPKFNPYM EGVIKNWQQN HPGALVRWTD LPWGSVERKL LAAVFARTAP DVVNLNPPFA ANLASKGGIR DLTPLLPDDA ADRYLPSVWE SGLDAEGRQI AIPWYLTVRL SLVNQQLLRQ AELEAPPRRW QDVPSYARRI RERTGRYGLF VTVVPDDSTE LLESMVQMGV TLLDSRRRAG FATPKGQRAF AFWTDLYRQG LLPREVISQG QKRAIELYQS GELAMLASGA EFLRTIQTNA PAVARVTHSY PPLVGGDGKA NVAVMTLVVP SQSRRQQEAV DFALFLTNGV NQATFAQQAK VLPSSRNALR QVQIALNAER PESREAAQIR SARALSAKTL KRAKVLVPAL PGIKRLQSII YTQLQRAMLD QISSDEAVEE AARQWNRYAE ARWP
|
| |