Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_08251 |
Symbol | |
ID | 5730945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 726420 |
End bp | 727712 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641285189 |
Product | hypothetical protein |
Protein accession | YP_001550710 |
Protein GI | 159903366 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2165] Type II secretory pathway, pseudopilin PulG |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.328672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000133255 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACAAA AAAACAATCC ACTAAGTAAT AAGAATCAGC CAGACAATGG TTTTACCCTA ATAGAAGTAG CAGTTGTAAT GGCTGTATTA TCTGCCTTAA GCAGTTTTGC TATACCTAAC ATAATTAATA CAGTTAAATT GTCTAGGATA GAAGAGACCA AAGCATTGAT GAATTCATAT GCTGCAGATT GCCTTGGGCA ATACAGGGTA TCGACAGACA TAACTGAGTT AAAAGAAAAA GTACCAGAGT ACCTAAGTGA TCAGAAATTG GCGACACTGG GTTACCAATT AGATCCTAAA AACAATAATT GTGAGACTCT AGCAGTCAAA CCATTAAATA ATAAGGACAA AGATCTGCTC TATGAAATGC AGTTTCGAAT TTATGAAGAT GATAAAACAG GTTCAGTAAA AGTATTTAAA GGTGCAACTC CTTCAGATTC ACCTAACCCA AGAAGCTTAC CTTCATGCAG AGGATGGGCT GGAGAAAATT GTGGTTTAAG TGAAGAAGCA CAAGCCAGAA TTGATCGTCT TAATCTAATT GCAGAAGAAA GAAACAAATG CACTACAAAC TTCAATAACA AACAAATAAA CAAGGCAACT GGTCCAGTAA AAACATGGAG GGCACCAGTA AATGATGAAG ATATGGGCGC TTGTGAAGAC CAAGGAATTT GTTTATTTGA AGGTAAATCT TATAGAAGTT GTGATGAGGT AGAAGTAGCT AGACAAAAGA AATATGGTGA TCAATGTAAA GACTGGACTA AAGATATGGC TAAACAAAAG AATAATAAAA AAAGTGAAGA AGGAGAAGGT CAAACTTTAG ACCCTCAATG TGGAGGTCAA CTCTATTGGT TTCACTCTGG AGACATATTA ACAAGTTTTG AAGAATGGGA AGAGAAAAAT GAAGACATGA AGAAGTCTCA ATGCGAAAAA GATAGATCTA GAATAAAAAC AACTAGTCAT AAGGGAGAAT ATGTAATTAA GCCAGCTGAT GGCATTAAAG AGCCTTGTGG CAATAAGATA TTTGTGTATG ATGGGGAAAT ATTGAACTCT GTTGACTATG ACGCTAAATT AAAACAAATA GAAGCAGATA AGAAGAAGAG AGAGGAAGAC AATCGAAATA AGCAAAAGAA AGAGAAAGAA ACAGATAAAA GAGGCAATAT ATGTCCTAAG AAAACATATA CAGATAATCA AGGTTTAAAA TGCTGTCCTT CTAATCCCAC TAAAAAATGT AATAAAGATA AGAAGTATAG AAAGAAAGCT TCTATTTGCG GATGTTGGTA TAAACAGAAA TAA
|
Protein sequence | MKQKNNPLSN KNQPDNGFTL IEVAVVMAVL SALSSFAIPN IINTVKLSRI EETKALMNSY AADCLGQYRV STDITELKEK VPEYLSDQKL ATLGYQLDPK NNNCETLAVK PLNNKDKDLL YEMQFRIYED DKTGSVKVFK GATPSDSPNP RSLPSCRGWA GENCGLSEEA QARIDRLNLI AEERNKCTTN FNNKQINKAT GPVKTWRAPV NDEDMGACED QGICLFEGKS YRSCDEVEVA RQKKYGDQCK DWTKDMAKQK NNKKSEEGEG QTLDPQCGGQ LYWFHSGDIL TSFEEWEEKN EDMKKSQCEK DRSRIKTTSH KGEYVIKPAD GIKEPCGNKI FVYDGEILNS VDYDAKLKQI EADKKKREED NRNKQKKEKE TDKRGNICPK KTYTDNQGLK CCPSNPTKKC NKDKKYRKKA SICGCWYKQK
|
| |