Gene P9303_17941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17941 
Symbol 
ID4778951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1563868 
End bp1565082 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content40% 
IMG OID640087302 
ProductTPR repeat-containing protein 
Protein accessionYP_001017801 
Protein GI124023494 
COG category[R] General function prediction only 
COG ID[COG4785] Lipoprotein NlpI, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.398676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTAG AGACTCCTGA AGCCATGGCA AATGAGTCGT CCAATCATTC CGTATCTGAG 
CAAACTGCAA ATGACTACTT CAAAGAAGGT GAAAAAAGAT TTCATTTAAA AGATTATCAA
GGGGCAATTG ATTGTTACAG CAAAGCAATT GAGATCAATC CAAATAATGC CATTGCATAC
AATAATCGAG GGAATGTAAA AGATGAACTA GGCGATTATC AAAGCGCAAT GAATGATTAC
AATAAAGCAA TTGACATTAA CAGTCTGGAT GCCAGCTTTT ACATCAATAG AGGTGTCGTC
AAGAGACACT CAAATAACAT CGAAGGGGCA ATCGATGATT ACACAAAAGC TATTGAACTA
GATCCACAAC ACGCTACTGC TTATTACAAT AGGGGGATTG CTAAAGTCAA TCTAAGCGAC
AACAAAGGGG CTATCTTTGA TTATACTAAG GCACTTACCG TAAATCCAAG ACATGCTAAA
TCATACTACA ATAGAGCGAT TAGCAAAAAC AATATTAATG ATATCAAAGG GGCAATTTCT
GATTACACAA AAGCAATTGA GGCCATGCCG GTGTTTGCCT CTGCCTATTA CAATCGCGGC
AATTTAATGG AGAGACTGGG CCGAAGGCAA GCAGCGGTTA CTGACCATGA GAAGGCGCTA
GTAATAAAAC CACAACTTCT CACTGCGATG AACGAGCGTG GTGAAAATAA AAACTTAGTT
GAGAATAAGA TTGTAAATGA TTTGAACAAT GAAGAAGACA GAAGTCAGCT AGATGCATTT
AATTATTATA GCCAAGGCAA TGCTGAACAA AAGCGAGGCA ACAATCAATC AGCGATCGAC
TGTTACACCA AGGCGATAGA AGTCAATCCA CACTATGCCG AGGCATACAA CTACAGGGGC
CTAGCTAATT ACAACCTTTG TGACTATCAA GCTGCGCTTG ATGATTACAA CAAGGCAATA
GAAATTAACT CGATATATGA AGATGCCTAC ATTGGTTGCG GTCTTGCAAA GTCTGCATTA
AGTGATTACC AAGGTGCAAT TGGAGCCTAT GAGAGGGTAC TAGTCATTAA CCCTAAGAAT
GTTGCTGCCT ATAGAAATCG TGGTATTGCC AAAGAATTGG AGGGAAATCT AGAGGGTGCT
TGTTCTGATT GGAGGCAGGC CTCCTCTCTG GGAGATGAAG ATGCTGCAGA ATGGGTAAAG
GCACAATGTT TTTAA
 
Protein sequence
MALETPEAMA NESSNHSVSE QTANDYFKEG EKRFHLKDYQ GAIDCYSKAI EINPNNAIAY 
NNRGNVKDEL GDYQSAMNDY NKAIDINSLD ASFYINRGVV KRHSNNIEGA IDDYTKAIEL
DPQHATAYYN RGIAKVNLSD NKGAIFDYTK ALTVNPRHAK SYYNRAISKN NINDIKGAIS
DYTKAIEAMP VFASAYYNRG NLMERLGRRQ AAVTDHEKAL VIKPQLLTAM NERGENKNLV
ENKIVNDLNN EEDRSQLDAF NYYSQGNAEQ KRGNNQSAID CYTKAIEVNP HYAEAYNYRG
LANYNLCDYQ AALDDYNKAI EINSIYEDAY IGCGLAKSAL SDYQGAIGAY ERVLVINPKN
VAAYRNRGIA KELEGNLEGA CSDWRQASSL GDEDAAEWVK AQCF