Gene P9211_08251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_08251 
Symbol 
ID5730945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp726420 
End bp727712 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content34% 
IMG OID641285189 
Producthypothetical protein 
Protein accessionYP_001550710 
Protein GI159903366 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2165] Type II secretory pathway, pseudopilin PulG 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.328672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000133255 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACAAA AAAACAATCC ACTAAGTAAT AAGAATCAGC CAGACAATGG TTTTACCCTA 
ATAGAAGTAG CAGTTGTAAT GGCTGTATTA TCTGCCTTAA GCAGTTTTGC TATACCTAAC
ATAATTAATA CAGTTAAATT GTCTAGGATA GAAGAGACCA AAGCATTGAT GAATTCATAT
GCTGCAGATT GCCTTGGGCA ATACAGGGTA TCGACAGACA TAACTGAGTT AAAAGAAAAA
GTACCAGAGT ACCTAAGTGA TCAGAAATTG GCGACACTGG GTTACCAATT AGATCCTAAA
AACAATAATT GTGAGACTCT AGCAGTCAAA CCATTAAATA ATAAGGACAA AGATCTGCTC
TATGAAATGC AGTTTCGAAT TTATGAAGAT GATAAAACAG GTTCAGTAAA AGTATTTAAA
GGTGCAACTC CTTCAGATTC ACCTAACCCA AGAAGCTTAC CTTCATGCAG AGGATGGGCT
GGAGAAAATT GTGGTTTAAG TGAAGAAGCA CAAGCCAGAA TTGATCGTCT TAATCTAATT
GCAGAAGAAA GAAACAAATG CACTACAAAC TTCAATAACA AACAAATAAA CAAGGCAACT
GGTCCAGTAA AAACATGGAG GGCACCAGTA AATGATGAAG ATATGGGCGC TTGTGAAGAC
CAAGGAATTT GTTTATTTGA AGGTAAATCT TATAGAAGTT GTGATGAGGT AGAAGTAGCT
AGACAAAAGA AATATGGTGA TCAATGTAAA GACTGGACTA AAGATATGGC TAAACAAAAG
AATAATAAAA AAAGTGAAGA AGGAGAAGGT CAAACTTTAG ACCCTCAATG TGGAGGTCAA
CTCTATTGGT TTCACTCTGG AGACATATTA ACAAGTTTTG AAGAATGGGA AGAGAAAAAT
GAAGACATGA AGAAGTCTCA ATGCGAAAAA GATAGATCTA GAATAAAAAC AACTAGTCAT
AAGGGAGAAT ATGTAATTAA GCCAGCTGAT GGCATTAAAG AGCCTTGTGG CAATAAGATA
TTTGTGTATG ATGGGGAAAT ATTGAACTCT GTTGACTATG ACGCTAAATT AAAACAAATA
GAAGCAGATA AGAAGAAGAG AGAGGAAGAC AATCGAAATA AGCAAAAGAA AGAGAAAGAA
ACAGATAAAA GAGGCAATAT ATGTCCTAAG AAAACATATA CAGATAATCA AGGTTTAAAA
TGCTGTCCTT CTAATCCCAC TAAAAAATGT AATAAAGATA AGAAGTATAG AAAGAAAGCT
TCTATTTGCG GATGTTGGTA TAAACAGAAA TAA
 
Protein sequence
MKQKNNPLSN KNQPDNGFTL IEVAVVMAVL SALSSFAIPN IINTVKLSRI EETKALMNSY 
AADCLGQYRV STDITELKEK VPEYLSDQKL ATLGYQLDPK NNNCETLAVK PLNNKDKDLL
YEMQFRIYED DKTGSVKVFK GATPSDSPNP RSLPSCRGWA GENCGLSEEA QARIDRLNLI
AEERNKCTTN FNNKQINKAT GPVKTWRAPV NDEDMGACED QGICLFEGKS YRSCDEVEVA
RQKKYGDQCK DWTKDMAKQK NNKKSEEGEG QTLDPQCGGQ LYWFHSGDIL TSFEEWEEKN
EDMKKSQCEK DRSRIKTTSH KGEYVIKPAD GIKEPCGNKI FVYDGEILNS VDYDAKLKQI
EADKKKREED NRNKQKKEKE TDKRGNICPK KTYTDNQGLK CCPSNPTKKC NKDKKYRKKA
SICGCWYKQK