Gene P9303_28481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28481 
Symbol 
ID4776115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2518732 
End bp2520120 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content44% 
IMG OID640088371 
Producthypothetical protein 
Protein accessionYP_001018843 
Protein GI124024536 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTC GCACCAATGC TCTTGCTGCT GCTCTATCGC TGCTGGTGCT TGGGTCGCCT 
TTGATTGCTG GGTGTGGAGT TCAGACAACT ATTGCTGAAC TCACAAAGTT AATAGAGACC
AACCCTCAGG TTGCCGCTGC CTACTACAAC CGCGGTATTG CCAAGTATGA CTTAAAAGAT
TATCAAGGAG CAATTGTTGA TTGGGCTAAG GCGATAGATA TTAATCCGCA GTATGTCGCT
GCCTACAACA ACCGCGGTCT TGCCAAGAGG AAATCAGGAG ACTATCAAGG AGCAATTGCT
GATTACAACA AGGCAATAGA GATTAATCCG CAGTATGCCG ATGCCTACTA CAACCGCGGT
AATGCCAAGA GGAAATCAGG AGACTGTCAA GGAGCAATTG CTGATTACAA CAAGTCAATA
GAGATTAATC CGCAGTATGC CGATGCCTAC TACAACCGCG GTAATGCCAA GAGGAAATCA
GGAGACTATC AAGGAGCAAT TGCTGATTAC AACAAGTCAA TAGAGGTTAA TCCGCAGTAT
GCCGATGCTT ATAACAACCG CGGTCGTGCT AAGTATAAAT TAAAAGATAC TCAAGGAGCA
ATTGATGATT TTAATAAGGC GATAGAGATT AATCCGCAAC TTGCCATTGC CTACTCCAAC
CGCGGTAATG CCAAGGATGA ATTAAAAGAT TATCAAGGAG CAATATCTGA TTTCAATAAG
GCAATAGAGA TTATTCCGCA GGATGCCGCT GCCTACTACA ACCGCGGTAA TGCCAAGGAT
GAATTAAAAG ATTATCAAGG AGCAATATCT GATTTCAATA AGGCAATAGA GATTAATCCG
CAGTATGCCG CTGCCTACTA CAACCGCGGT ATTGTCAAGA GGGAATCAGG AGATACTCAA
GAAGCAATTG CTGATTTTAA TAAGGCGATA GAGATTAATC CGCAACTTGC CATTGCCTAC
TCCAACCGCG GTATTGTCAA GAGGGAATCA GGAGATACTC AAGAAGCAAT TGCTGATTTC
AATAGGGCAA TAGAGATTAA CCCGGAGTAT GCCGCTGCCT ACAACAACCG CGGTATTGCT
AAGAAAAACT TAGGTAATTA TCAAGAAGCA ATTGCTGACT ACAACAAGGC AATAGAGATT
AATCCGCAGT ATGCCGCTGC CTACTACAAC CGCGGTATTG CCAAGTATAA ATTAAAAGAT
ACTCAAGGAG CAATTGCTGA ACTTAACAAG GCAATAGAGA TTAATCCGCA GTATGCCGCT
GCCTACAAGA CCCGCGGGAT TGTCAAGGAA TTAGTTGGAG ACCTGAAAGG TGCTTGTGCT
GATTGGAAAG AAGCATCATC TCTTGGTCAA AAAGATGCTG CTGGGTGGGT AGAAGAGGAA
TGCCAATAA
 
Protein sequence
MSRRTNALAA ALSLLVLGSP LIAGCGVQTT IAELTKLIET NPQVAAAYYN RGIAKYDLKD 
YQGAIVDWAK AIDINPQYVA AYNNRGLAKR KSGDYQGAIA DYNKAIEINP QYADAYYNRG
NAKRKSGDCQ GAIADYNKSI EINPQYADAY YNRGNAKRKS GDYQGAIADY NKSIEVNPQY
ADAYNNRGRA KYKLKDTQGA IDDFNKAIEI NPQLAIAYSN RGNAKDELKD YQGAISDFNK
AIEIIPQDAA AYYNRGNAKD ELKDYQGAIS DFNKAIEINP QYAAAYYNRG IVKRESGDTQ
EAIADFNKAI EINPQLAIAY SNRGIVKRES GDTQEAIADF NRAIEINPEY AAAYNNRGIA
KKNLGNYQEA IADYNKAIEI NPQYAAAYYN RGIAKYKLKD TQGAIAELNK AIEINPQYAA
AYKTRGIVKE LVGDLKGACA DWKEASSLGQ KDAAGWVEEE CQ