Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12541 |
Symbol | |
ID | 5731405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1127168 |
End bp | 1128496 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641285622 |
Product | Tfp pilus assembly protein PilF |
Protein accession | YP_001551139 |
Protein GI | 159903795 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.34619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00140851 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTTG GTTCTTCATT CGAACTTTCT TTAGTCCCCC TAATCTGGTC TCTGTTCCTT TCAGGGATAC TCTTTGTATT TAATTTCTTT TTCCCTATAG TTCCAGATTG GTTATTAACT ATATCTGTAA ATTTAGTAAG ATTTTCATTT ATATTTCCAG TTTATATTCT TTTATTTGAT TCTTATATTG ATTTCAAAGA AACACGTGCC CGACGTAAAT CGAGTAAAAA TAAACAAATG AAAGATTTAG AAAATGTAGC AGAGATAAAT AATAGAAAAT CAACGAATTA TAGCAAAGAA GAAGATGAGT TGTTCCACAA GGAAGAAATC AAATATTATA GTAAAGTATT AAATAATACC CCTAATGATA TAGATGCTTA TTTTCAAAGA GGGCAATCCA AGGCAGAGTT GAATGATAAT CAAGGTGCAA TAGAAGATTT TACTAAGATA ATAGATATCG ATAATAATTT TGTTGATGCA TATTTATATC GCGGATATTT TTTAATGCTT TTAGAAAATA ATAAAGAAGC AATCGATGAT TTCAACCAGG TTATAAGAAT AGATCCTAAG AATGAGGTGG CATATAATCA TCGTGCTTGT GCTAAAGAAA ATCTAGGTGA CCGTAAAGGT GCGTTAGATG ATTTCAATAA AGCAATAGAA TTAAATCCAG AATTTGCAAT GGCATATAAC GATCGTGGTC AAATGATATT TTCATCTGGA GATCACAAAA GTGCATTCAA TGACTTCGAT AAAGCGATAT CACTAGATCA AGAAAGTGGT CCTGCTTATC TCAATAGAGG TTTATGTAAA AATATCGATG AAGATTATCA AGGTGCTATT TCCGATTTTA CTAAAGCAGT CACGATCGAC CCCTTTTTTA CAGAAGGATA TGGAAGTCGT GCAATATCAA AAATAAATAT TAACGACTAC GAATCAGCAC TGTATGATCT CAACAAAGTC ATGGAAATTG ATCCTGAAGA GAGTCAAAGA GGGGAAGCAC TCTATTACAG AGGAATTGCT AAGCATGAAT TAGGAGATTA TCAAGGTGCA ATTTTAGACT TCAATAAACT AATAGAATTT GATCAGAGTA ATTCTGATAT CTATAACCGT AGAGGATGTG CCAAAATGTG TTTAGAAAAC TATAGAGGTG CGATATTAGA TTTTAATAAA GCAATAGAAA ATAATTCAAC ATTTTCATTG CCATATTTAA ATAGAGGTTT AGCAAAAGCG AATCTAAATA ATCTTACGAG TGCTTGTTCT GATTGGAAGA AAGCATCAGA ACTTGGAGAA GAAAGTGCTG TTGAATTATT GAAGGAACAT GGCGAATAG
|
Protein sequence | MKLGSSFELS LVPLIWSLFL SGILFVFNFF FPIVPDWLLT ISVNLVRFSF IFPVYILLFD SYIDFKETRA RRKSSKNKQM KDLENVAEIN NRKSTNYSKE EDELFHKEEI KYYSKVLNNT PNDIDAYFQR GQSKAELNDN QGAIEDFTKI IDIDNNFVDA YLYRGYFLML LENNKEAIDD FNQVIRIDPK NEVAYNHRAC AKENLGDRKG ALDDFNKAIE LNPEFAMAYN DRGQMIFSSG DHKSAFNDFD KAISLDQESG PAYLNRGLCK NIDEDYQGAI SDFTKAVTID PFFTEGYGSR AISKININDY ESALYDLNKV MEIDPEESQR GEALYYRGIA KHELGDYQGA ILDFNKLIEF DQSNSDIYNR RGCAKMCLEN YRGAILDFNK AIENNSTFSL PYLNRGLAKA NLNNLTSACS DWKKASELGE ESAVELLKEH GE
|
| |