Gene P9303_24101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24101 
Symbol 
ID4777689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2120842 
End bp2123019 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content46% 
IMG OID640087931 
Producthypothetical protein 
Protein accessionYP_001018408 
Protein GI124024101 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAAG AGGAGATCAT GCAGCAGCTG CAGGTTGCAG TGCAGGCATA TCAGAGCAAA 
GATCTTGATG GCGCTGAGGC AGTTTTTAAG CAGATTCTTG CTGTAAACCC AAAAGAGCCC
AACGCACTGC ATCTTCTTGG TTGTATTTAC AAAGATCGTG GACAACACCA GCAGGCTGTT
GAGTTAATTC AGGCGTCTAT TCGAGAAGAT GAGAGTAATC CAATTCCATT CTTTAATCTT
GGCAAGATCC TTGCTATCGC TGGTCAGCAT GAGAATGCAG TGGGCGTCTT CCAGGAGGCA
TTGAAGAGAA ACCAGCAGAT CCCTGAAACG TGGTTTTGCT TTGCCAATGC TCTGAGGGAG
ATTGGGAAAA CAGAGGAAGC AAAGCAGGCA TATCGAAATG CACTGCAATT AAATCCTGCA
CATGCTGGAG CAGCAGGAAA TTTAGGAGCA CTGCTCACTG ATGATGGTGA GTTGGATGAG
GCTGAGCAAT TATTTGTAAA GGCTGTAGAC CAGTATCCTA ATAATGTGAA TCTCAGAATT
AATTATGGCA GGTTGTTGGC TGAGAAAGCG GAGCATGCTG CGGCAATCAT GCAGTATCAG
ATTGCTTTGC CTTTGGCTCC TCAGTCTCCT GAGTTGCACT ACAACTTTGC AAATGCATTG
AAGGAAGAGG GGGATGTTGA GGAAGCGATC GCAAGTTATC GGAAGGCGAT TGAGGTGAAG
CCCGATTTTG CAGATGCTTA TTTTGCTCTT GGGTTGGTGA TGAAGGAAGA GGGGGATGTT
GAGGAAGCGA TCGCAAGTTA TCGGAAGGCG ATTGAGGTGA AGCCCGATTT TGCAGATGCT
TATTTTGCTC TTGGGTTGGT GATGAAGGAA GAGGGGGATG TTGAGGAAGC GATCGCAAGT
TATCGGAAGG CGATTGAGGT GAAGCCCGAT TTTGCAGATG CTTATTTTGC TCTTGGGTTG
GTGATGAAGG AAGAGGGGGA TGTTGAGGAA GCGATCGCAA GTTATCGGAA GGCGATTGAG
GTGAAGCCCG ATTTTGCAGA TGCTTATTTT GCTCTTGGGT TGGTGATGAA GGAAGAGGGG
GATGTTGAGG AAGCGATCGC AAGTTATCGG AAAGCGATTG AAGTGAAGCC CGATTTTGCA
GATGCGTATT TGAATCTGGG TAATGTGTTG AAGGAAGAGG GGGAGATAGA TGAGGCTAGG
CAAATCATTA CTACTCTTCG ACAGATGAAG TCTTTCGAGA AGGAGACTTG GACCAGGATT
CAAGATAAGA CTTTGGTTTT TGATTGGCAT CACAGAAGAG CTTTGGGGCT TCTCTGGCAG
GTCGAGCTTG CTGCCTTTTC TGGAATGGAG CCCGCCTCTT GTCTTCCTGC TGTAAAGCAA
GCCGATGCTC TTTGCTTTCC TCCTTCATTC TTGGGAGATG AGATTTCTTG TGAAGATGGG
AAGCTGTTGT ATGAAAAGGG ATATTTGGTT GAAGAAGGTC TGGTTTCGCC AGGTTTATGT
GCCCAACTGA TCAGTCAATT CAATAATGGG AAAGCGCCGA TGAGTGCTGC GCTAATCGAA
TCAGTTGAGG TTAATGGTAT TTTGCGATTT GCTTTGGAAA GAATATTTCT GCATACGGGT
TTCCCGCATT TGATTTGGAA TTGCATTTAC TTTGCCAAAG GACCCGACGA TGAAACGGTA
TCTGATACTT GGCATTACGA TAATCATTAC AATGCTTGGA CTCCAAAGTT GATGATATAC
CTTAACTCTC AGGGTGAAGA GTGTGGTGCC ACTGAATTTG TTGATGCAGG TCTCTCGCGG
AAAATCTCTG AGAAATCAGA CTATATGGGT CTAGTTTGGC AGCGTAAGTC TTATCCGGAT
ATGGTGAAGG ACTTAGTTGA AGATCTAAAT CTCGATCCTG TCTCATTAGA TCCTGAGCAT
TACACCTTCT CGCCAGATCA TGTTGGATCG GGTGTGTGGT TTTGTCCTGC CAGGGCTCTG
CATCGAGGTG TTAGTCCTAA GAAAGGTCTG AGACATGTCC TTTCTTTTTC ATTGACACCT
TTGCCTAAGG ACTGTGGCTG GAGTGTTGAC CAGTGTGAGG AGAAATCAAT TGAAATTCTG
AAGGATAAAA TTGAAAAGGG TATGCAAAAG ACTGATGTCA ATCCTTATTG GATGTTGTCT
GAGGTTAGGT CTGCTTAA
 
Protein sequence
MNQEEIMQQL QVAVQAYQSK DLDGAEAVFK QILAVNPKEP NALHLLGCIY KDRGQHQQAV 
ELIQASIRED ESNPIPFFNL GKILAIAGQH ENAVGVFQEA LKRNQQIPET WFCFANALRE
IGKTEEAKQA YRNALQLNPA HAGAAGNLGA LLTDDGELDE AEQLFVKAVD QYPNNVNLRI
NYGRLLAEKA EHAAAIMQYQ IALPLAPQSP ELHYNFANAL KEEGDVEEAI ASYRKAIEVK
PDFADAYFAL GLVMKEEGDV EEAIASYRKA IEVKPDFADA YFALGLVMKE EGDVEEAIAS
YRKAIEVKPD FADAYFALGL VMKEEGDVEE AIASYRKAIE VKPDFADAYF ALGLVMKEEG
DVEEAIASYR KAIEVKPDFA DAYLNLGNVL KEEGEIDEAR QIITTLRQMK SFEKETWTRI
QDKTLVFDWH HRRALGLLWQ VELAAFSGME PASCLPAVKQ ADALCFPPSF LGDEISCEDG
KLLYEKGYLV EEGLVSPGLC AQLISQFNNG KAPMSAALIE SVEVNGILRF ALERIFLHTG
FPHLIWNCIY FAKGPDDETV SDTWHYDNHY NAWTPKLMIY LNSQGEECGA TEFVDAGLSR
KISEKSDYMG LVWQRKSYPD MVKDLVEDLN LDPVSLDPEH YTFSPDHVGS GVWFCPARAL
HRGVSPKKGL RHVLSFSLTP LPKDCGWSVD QCEEKSIEIL KDKIEKGMQK TDVNPYWMLS
EVRSA