Gene P9303_28641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28641 
Symbol 
ID4778321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2533073 
End bp2534818 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content46% 
IMG OID640088387 
Producthypothetical protein 
Protein accessionYP_001018859 
Protein GI124024552 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA AGGGTGCTCT TACGGCTGCT GCCCTGTCTT TGCTGCCACT AGGGCAACCA 
CTGCTACTAG GCACTGCTGG CATCACCACA GCAACCACCG CAGTCCTTCT TCAAGCGCTA
GCAGCAGTTG CTCAAGATGC TTCTGCTGTT GCCAAGGTCG CCAAGGCAAT CACTGTTCGT
ATAGAAGGAG CAACGCAAGG ATCTGGCGTT CTCGTCAAAA AAGACGGCAA TCGCTACACA
GTTCTCACAG CATGGCATGT GGTCAGCAGC AATAGACCTG GAGAAGAGGT TGGGATCTAT
ACCTCCGATG GCCAGGATCA TCAACTGAAG CAAGGCAGTA TCCAACGTTT AGGTGAGATT
GATATGGCAG TCCTTACCTT CTCCAGTTCT GGAAATTATG AGGTGGCCTC AATTGGAGAT
GCAAAAACAG TTCAATACGA TGATCCGATC TACGTCGCTG GATTCCCTCT AGCTAATTCA
CAAAACCTTC GTTATGAGAC TGGAGATGTT GTTGCCAACG CAGAAGTAGG CATTGATCAG
GGCTATCAAC TGCTGTATGA CAACAAGACA GCCGCTGGAA TGAGTGGTGG TGTCCTCCTC
AATGCTGATG GAGAGTTGAT TGGTCTTCAC GGGAGGGGTG AAAAAAATGA ATATGCTTCC
AATGGGAATG AAGTCTCAAT GAAGACTGGT GTCAACCAAG GTGTACCGAT TAGTTATTAC
AAGCTTTTCC TTAGTGGATC GCCAGTTGTT GTTGCAAACA ACACTGCTGC AAATGCTGAT
GACTACTATG CACAAGTGCT TGCCTCGGCC AATAAGAAAG GAAGAGAGCA GACTATGGTC
CGCCTAGCAG ATCAGGCATT GAAATTACGC AAAACGGGCT TTGCATACAT CATGCGTGCG
TATGCGAAGA ATGATTTGGG TGATTACCAA GGAGCAATTG ATGATCAAAA TAATGCCCTC
GAGATTAATC CTGATAATGC AGTCGCTTAC GTCAATCGTG GATTAGCTAG GAGTAATATG
GGTGATCCTA AAAGTGCCCT TTCTGATTTT AGCAAGGCAA TAAAGATAGA CCCTGCCAAT
GCGATGGCAT TCAGTAATCG GGGTGTTTCT AAGCAGGCGC TAGGAGATCC TCAAGGGGCG
CTAGATGATT ACAATAAGGC GATAAAGATT GATCCTCGCA ATGCAAATGC CTATGCTAAT
CGCGGTGTTA ACAAGGGCGA TTTAGGAGAT TATCAAGGAG CAATTGCTGA TTACAGCAAG
GCAATTGGAA TCAATCCGCA GCATTCTGAT GCATACTACA ACCGTGGTAT TGCAAAGCTT
GAATCCAAGG ATTATCAAGG AGCAATTGCT GATTACAATA AGGCAATAAG GATTGGCACG
CAGAATGCGA GGATCTATCT TAATCGTGGT CTTGTCTACG ATAATTTAGG CGATTACCAG
CGTGCAATTG CTGATTACAA TAAGGCAATA GAGCTTGATC CGCAGTATGC TCTTGCCTAC
GTGAACCGTG GTCTTGCCAA GATTAAATCA GGAGATATTC AAGGAGCAAT TGCTGATTCC
AATAAGGCAA TAGAACTTGA TCCGCGTATG GCAAAAGCCT ATGCCAATCG TGGCGCAGCA
AAAGGCATGC TAGATGATGC TAAAGGAGGT TGTGCAGATT TCAAAAAAGC AGCATCACTT
GGTTCTCAAC TAGCGGCTCA ATGGTTAAAC CGCGCAGATG CTGCCTGGTG TCGTAATATG
CGATGA
 
Protein sequence
MKAKGALTAA ALSLLPLGQP LLLGTAGITT ATTAVLLQAL AAVAQDASAV AKVAKAITVR 
IEGATQGSGV LVKKDGNRYT VLTAWHVVSS NRPGEEVGIY TSDGQDHQLK QGSIQRLGEI
DMAVLTFSSS GNYEVASIGD AKTVQYDDPI YVAGFPLANS QNLRYETGDV VANAEVGIDQ
GYQLLYDNKT AAGMSGGVLL NADGELIGLH GRGEKNEYAS NGNEVSMKTG VNQGVPISYY
KLFLSGSPVV VANNTAANAD DYYAQVLASA NKKGREQTMV RLADQALKLR KTGFAYIMRA
YAKNDLGDYQ GAIDDQNNAL EINPDNAVAY VNRGLARSNM GDPKSALSDF SKAIKIDPAN
AMAFSNRGVS KQALGDPQGA LDDYNKAIKI DPRNANAYAN RGVNKGDLGD YQGAIADYSK
AIGINPQHSD AYYNRGIAKL ESKDYQGAIA DYNKAIRIGT QNARIYLNRG LVYDNLGDYQ
RAIADYNKAI ELDPQYALAY VNRGLAKIKS GDIQGAIADS NKAIELDPRM AKAYANRGAA
KGMLDDAKGG CADFKKAASL GSQLAAQWLN RADAAWCRNM R