Gene P9303_16981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16981 
SymbolproW 
ID4778008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1485369 
End bp1486283 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content52% 
IMG OID640087207 
ProductABC transporter,membrane component, glycine betaine/proline family protein 
Protein accessionYP_001017707 
Protein GI124023400 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.659645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATG TGATTGACCA AGTTTTTGCA ACCGTCAGCA CCGGCCCTGT AGGCGAGGGC 
ATGAGCAGAT TCGTGGAATG GTTGCTCAAT CATGCGCAAC CAATTTTCCT AGTGATTGAT
TCTGCAATTA ATGGGCTAGC CGGAGCAATC GAACAGATAC TCTCCGTGCC AGCGCCCTGG
CTGCTAGCTC CCCTGATCGC CATACTCGCA GCTTGGCGAG TGAGTTTGAG CTTCGCCATT
CTCAGCCTTC TTGGTCTGAA CCTGGTTTTG TTCATGGGGC TTTGGCAGCC AATGATCTCA
ACCCTGGCCC TAGTTATTGC AGCCTCATTA CTGGCACTAA TCATTGGCAT CCCGATCGGC
ATTTTTTCTG CCCGTCGCCA ACACATCTGG GCGATTACCC GCCCAGTATT GGATCTAATG
CAAACCATGC CGGCATTCGT TTATCTCATT CCGGCAGTGA TGTTTTTTGG CACAGGGCTT
GTCCCATCCA CTATCGCAAC GCTGATCTTC TCGATGCCAC CCGTAGTACG CCTGACGTAC
CTGGGCATAC GACAAGTCCC TGTTGATCTA ATCGAAGCCG GGCGCGCCTT CGGTTGCTCA
GAACGACAAT TGCTCTGGAA AGTGCAACTT CCAAACGCCT TACCAACCTT GATGGCTGGT
GTTAATCAAA CCATCATGCT CGCTCTATCG ATGGTTGTTA TCGCATCAAT GATTGGTGGT
GGCGGCCTAG GTGATGTGGT GCTAAGAGGC ATCCAACAAC TTGACGTAGG CCTTGGCTTC
GAGGGTGGGC TCGCCGTGGT AATTCTGGCT GTAATCCTGG ATCGCCTCAC ACAAAGCCTG
GCTGCCTCAA ACTTTTCGAG AAAGTCATTG CCGCAGCGCT TTAAAGCCTT TACTAACCTC
TGGACTTCCT CATGA
 
Protein sequence
MMDVIDQVFA TVSTGPVGEG MSRFVEWLLN HAQPIFLVID SAINGLAGAI EQILSVPAPW 
LLAPLIAILA AWRVSLSFAI LSLLGLNLVL FMGLWQPMIS TLALVIAASL LALIIGIPIG
IFSARRQHIW AITRPVLDLM QTMPAFVYLI PAVMFFGTGL VPSTIATLIF SMPPVVRLTY
LGIRQVPVDL IEAGRAFGCS ERQLLWKVQL PNALPTLMAG VNQTIMLALS MVVIASMIGG
GGLGDVVLRG IQQLDVGLGF EGGLAVVILA VILDRLTQSL AASNFSRKSL PQRFKAFTNL
WTSS