Gene P9303_24171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24171 
Symbol 
ID4778999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2126498 
End bp2128699 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content39% 
IMG OID640087938 
Producthypothetical protein 
Protein accessionYP_001018415 
Protein GI124024108 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAA AGCCAGAACC AAATGCTCTT CACTCTGTGC GCGACAAACA AGCCGATTAT 
GCTTTCTCTC TATTTCAAGA AGGCAGGATA AAAGAAGCGG AAGAAATCTA TACCAGACTT
ATTAAGGCTG GTACCAATAA TCACCTGACC TATGGAAGCC TTGGCATTAT TTATGGAATA
GAAGGTCGAT GGCAAGAATT AATTGCTATT CTAGAAAAAG CACTCAAGCT AGAACCCAAC
TATTCTGATG CTCATAATTA CATCGGTATT GCTCTTAAGA GGTTAGACAA CTTAGAAGCA
GCTGTTGAAT CATTTCAGAA AGCGCTAAGC ATCAATCCAA ACTGCCCTAA ATCAAATTAT
AATTTAGGCA ATGCACTATT AGAAGAAGGT AAACTAGATT CTGCAATCGC ATTTTTAAAA
ACAGCAGTTG ATTTTAAACC TGACTTCTCA GAAGCACACT ATAACCTGGG GATAGCCTAT
CTTGCGATTG ATAACATCGC AGCTGCTATT AACTATTTAA ATAACTCACT TCATTTAAAA
CCAGCTTTTC CTGAAGCACA CAATAGTCTT GGGCTCGCTC TTCAGGCTAA AGGAGAAAAA
AATCTTGCTA TTAGCTCTTT CATAAAAGCA CTAGAAATCA AACCAGAGTT CCCTGAAGCT
TGCTACAACT TGGGCTTTAT TTATCTCAAT CAAGGCGATA TAGAAACTGC GATTAATTAT
TTCAATAAAG CACTTCTCTT AAAATGGAAC TATCCAGAAG CTCTCAATAA TCTTGGCATC
GCCTTCAAAG CAAAAGGTGA AATAAGTCCT GCTATAAATT CTTGGAGGAA AGCACTTGAA
ATCAAAACAG ATTTCCCCGA AGTTTATTAT AATTTGGGCT CTATTTATCT AGATCAAGGC
AATATAGAAA CTGCAATTAA TTTTTTCAAA AAAGCACTTA TTTTAAAAGA GAACTATCCA
GAAGCCCTCA ATAATCTCGG CAACTCTCTC CAAGAAAAAG GTGAGCTAGA TGCTGCAATT
GCAGCTTATA AGAAAGCACT CAATCATAAA CCAAGCTATC GAGAGGCCCA AAATAATCTG
GGTTGTGTCT ACAGAGCACA AGGTGATCTT GAAAATTCTA TCCGCATCTT CAAAAAAGCT
CTTGCTCTAC ATCCTGATCA TCCGGAAATA TTATCGAACC TAGGCACATC CCTTGAGGAG
AAAGGCGATC TCGAAGCAGC AATTTCTTCA TTTAACAATG CCATCAGCAA TAACAGTAAT
TATCCAACTG CTCATTACAA CCTATCTTTA TGCCTACTAT CCAAAGGCGA TTATCATAAT
GGTTGGCAGC AACATGAATG GCGCTGGAAG AATTGGGAAA AGAATGGAGG CAATAGTGGG
GCGCTAACAA CAACTAGGCC CACCTGGGCA CCTGGAAAGA AAGGAAGTGT TTTGCTTTGG
CCAGAGCAGG GAATTGGAGA CGAAATTCTT TTTGCTTCTG TTATTCCAGA TATATATGAA
GCCTGTGACA GGCTAATTGT TCAGGCAGAT AAGCGACTGC TGCCTTTATT TAGTCGTTCC
TTCCCTAAAG ATATTGAATA CAGAGAACGA GGTGAGATCA TTGCTGAAAA TGATTACGAT
TACCATATCC CAATGGGCTC TGCTCCACAA TTCTTTCGAC TATCTGCTGA CAGCTTCAAA
CCATCTTCAA AAGGCTACCT TAAAGAAGAC ACTGCAAAGG CGCGGGAATT TAGATTGAAA
TTACTGGATA AAAGATTTGA ACATTTAATA GGCCTTAGTT GGCACTCTAC AGCAAAAAGA
TCCATTGCTA AAAATAAAAG CATAAGCCTT GAACAAATTG TCATCGCAGC AGATAGTCCA
AAATGTAGAT TGATAAATTT GCAATATGGA AATGTTGACA ATGAAATCAA CAACCTCGCC
AAATTCAAGG GCATTGAAAT TTACGACAAC AAAGAGCTGG ATCTTTATAA TGACCTTGAT
GGCCTGGCTG CCTTGATCGG TGCTTGCGAT GAAGTAATAT CCATTAGCAA TGTCACTGTA
AATCTGGCTG GAGCACTGGG GAAAGCGGCT AAATTACTTC TGCCTTTTAG CTCCGACTGG
CGCTGGGGAT CTACAGGGAA ACAATGTGAT TGGTATGAAA GCATAGCGAT TTATCGACAA
CAAGAAATTG GCAACTGGCA AGATATTCTT GCAAAGATTT AA
 
Protein sequence
MNEKPEPNAL HSVRDKQADY AFSLFQEGRI KEAEEIYTRL IKAGTNNHLT YGSLGIIYGI 
EGRWQELIAI LEKALKLEPN YSDAHNYIGI ALKRLDNLEA AVESFQKALS INPNCPKSNY
NLGNALLEEG KLDSAIAFLK TAVDFKPDFS EAHYNLGIAY LAIDNIAAAI NYLNNSLHLK
PAFPEAHNSL GLALQAKGEK NLAISSFIKA LEIKPEFPEA CYNLGFIYLN QGDIETAINY
FNKALLLKWN YPEALNNLGI AFKAKGEISP AINSWRKALE IKTDFPEVYY NLGSIYLDQG
NIETAINFFK KALILKENYP EALNNLGNSL QEKGELDAAI AAYKKALNHK PSYREAQNNL
GCVYRAQGDL ENSIRIFKKA LALHPDHPEI LSNLGTSLEE KGDLEAAISS FNNAISNNSN
YPTAHYNLSL CLLSKGDYHN GWQQHEWRWK NWEKNGGNSG ALTTTRPTWA PGKKGSVLLW
PEQGIGDEIL FASVIPDIYE ACDRLIVQAD KRLLPLFSRS FPKDIEYRER GEIIAENDYD
YHIPMGSAPQ FFRLSADSFK PSSKGYLKED TAKAREFRLK LLDKRFEHLI GLSWHSTAKR
SIAKNKSISL EQIVIAADSP KCRLINLQYG NVDNEINNLA KFKGIEIYDN KELDLYNDLD
GLAALIGACD EVISISNVTV NLAGALGKAA KLLLPFSSDW RWGSTGKQCD WYESIAIYRQ
QEIGNWQDIL AKI