Gene P9303_01821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01821 
Symbol 
ID4777098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp200562 
End bp201866 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content56% 
IMG OID640085681 
Productsolute-binding family 1 protein 
Protein accessionYP_001016202 
Protein GI124021895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGGT CACGAAGAAA GTTTTTTGTT GCAGGCCTCG TGGTGCTTGG AGCAGCCTTG 
ATTGGTTGGG GTTGTCGACC GAAGCAGGGA TCTTCAACTG ATTTGCAGCT CTGGACGTTG
CAACTGGCGC CGAAGTTCAA CCCCTATATG GAGGGGGTCA TCAAAAACTG GCAGCAGAAT
CACCCCGGCG CGCTGGTGCG TTGGACAGAT CTCCCCTGGG GATCAGTGGA GCGCAAGTTA
TTGGCTGCGG TGTTCGCGCG CACCGCACCA GACGTGGTCA ATCTCAATCC CCCCTTTGCT
GCAAATCTTG CCAGCAAAGG GGGGATTAGG GATCTGACGC CTTTGCTTCC AGACGATGCA
GCCGACCGCT ACCTTCCCTC GGTTTGGGAG TCTGGCCTGG ATGCTGAGGG GCGACAGATC
GCCATTCCTT GGTATCTCAC GGTACGGCTG AGTTTGGTTA ACCAGCAGTT GTTGCGACAA
GCCGAACTTG AGGCTCCACC GCGACGTTGG CAGGATGTAC CTTCCTATGC GCGCCGCATT
CGTGAGCGCA CAGGTCGCTA TGGCCTATTC GTCACTGTGG TCCCTGACGA TTCGACTGAA
CTGCTTGAGT CGATGGTTCA GATGGGAGTC ACTTTGCTGG ATTCTCGGAG GCGGGCTGGT
TTTGCTACTC CAAAGGGGCA ACGTGCTTTT GCCTTCTGGA CCGATCTTTA TCGGCAAGGT
CTCCTGCCTC GAGAGGTGAT AAGCCAGGGT CAGAAGCGAG CGATAGAGCT TTATCAGAGC
GGTGAGTTGG CCATGTTGGC GAGCGGGGCT GAATTCCTAC GCACGATCCA GACAAACGCT
CCGGCTGTGG CCAGGGTGAC TCACTCCTAT CCACCGCTTG TCGGTGGTGA TGGCAAAGCG
AATGTCGCTG TGATGACGTT GGTTGTGCCT AGTCAGAGCA GGCGCCAGCA GGAAGCCGTT
GATTTTGCCT TGTTTCTAAC CAATGGGGTT AATCAGGCAA CTTTTGCGCA GCAAGCCAAG
GTGTTGCCAT CGTCTAGAAA CGCCTTGCGA CAGGTTCAGA TTGCTCTTAA TGCTGAGCGT
CCGGAGTCGC GTGAGGCTGC TCAGATCCGC TCTGCCAGAG CATTGTCTGC CAAGACTTTG
AAGCGGGCGA AAGTTTTGGT TCCGGCTTTA CCTGGGATCA AGCGTCTGCA GAGCATTATT
TATACCCAGT TACAGCGGGC GATGCTCGAT CAAATCAGCA GCGATGAGGC TGTGGAAGAG
GCTGCTCGTC AGTGGAACCG CTACGCAGAA GCTAGATGGC CCTAA
 
Protein sequence
MLRSRRKFFV AGLVVLGAAL IGWGCRPKQG SSTDLQLWTL QLAPKFNPYM EGVIKNWQQN 
HPGALVRWTD LPWGSVERKL LAAVFARTAP DVVNLNPPFA ANLASKGGIR DLTPLLPDDA
ADRYLPSVWE SGLDAEGRQI AIPWYLTVRL SLVNQQLLRQ AELEAPPRRW QDVPSYARRI
RERTGRYGLF VTVVPDDSTE LLESMVQMGV TLLDSRRRAG FATPKGQRAF AFWTDLYRQG
LLPREVISQG QKRAIELYQS GELAMLASGA EFLRTIQTNA PAVARVTHSY PPLVGGDGKA
NVAVMTLVVP SQSRRQQEAV DFALFLTNGV NQATFAQQAK VLPSSRNALR QVQIALNAER
PESREAAQIR SARALSAKTL KRAKVLVPAL PGIKRLQSII YTQLQRAMLD QISSDEAVEE
AARQWNRYAE ARWP