Gene P9303_02801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_02801 
Symbol 
ID4778667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp292866 
End bp294101 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content43% 
IMG OID640085784 
Producthypothetical protein 
Protein accessionYP_001016300 
Protein GI124021993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.484397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGTC GATACCTCGA TAAACAGTGC CCAGCATGCG GCTACACGAT TGCAAAGATG 
GTCTTTGATG CTGGAGTTAA ACCACTAGCA ACAATTGCAT GGGCAGAGTC TGAAGAAGAA
GCAAAGGACG TCAAGTCATT TAAACAAGAA TATATTCAGT GCCTAAACTG CTCCCATGTG
TGGAATCATT TATTTGACTG GGAACATGTA CCTTATGGAA ACAAACCTAA CAAGATGTAC
AACAATGGAT CACAATGGAA GAAGCATATT GAGTATTTGC GCGGGTGGTT ATCAGATCGA
ATGCCTGCCA AACCCACAAT TGTAGACATT GGCTGTGGTG ATGGTAGTTT TCTCATCTGC
ATGGCAAATC ACTATAAACA AAAAGGCAGA TTTCTTGGCT TCGATCCGAG TGGAGATGTT
GATGCTCAGC AGTCAGAAAT CCACTTTGAT CGCACATTAT TCTCTCCTTT AAAAGACACC
GCCAAACACA AGCCAGACCT CATTGTCATG AGACATGTGA TCGAACATCT CACTGCTCCG
TCTTCATTTT TGCATTCGCT GGCATGGGGT GCATCTAGTT ACGAAAAGAC AACATATGTA
TATTGTGAAG TCCCTTGCAT CGACCGTGTT TTTCAAACAA GTCGTTTAGC AGACTTTTAC
TACGAGCACC CATCCCAATT CACAACCTTG TCTTTCACGA GGATGCTAAA AACAGCTGGT
CAAATTATTG ATATTCAACA CTCCTATGAT GGAGAAGTGA TTTGCGGGCT AGTGGAACTA
AAACCTTCTT CAGAACAGAC CAAAATAAGC AATGGTTCAG ATGCATATTT CTTTAAGACC
TCAACCTCAA TCCATCAGAT TGAGCGACAA ATCGACAATC TTTTAGCGGC TCACAAGCTA
ATTGCAATCT GGGGCGGGAC CGGCAAGTGT GCTGCCTTTA TGCATCATTA TAGTGTCTCT
TGCGATGATA TATCTACTGT TGTCGACTCA GACGAACGCA AATGGGGGAC GTATGTTCCA
GGTGTTGGGC AAGAAATCAA ACCACCATCT TATCTGTTGA ACAAGCTGAT TGATGTTCTG
CTTATCCCAA CACAGTGGAG AGCTCAAGAC ATTCTCATAG AAGCGTATTC AATGGGCCTG
ACTTTCAAGC AAGTGCTCAT CGAGCATAAT GGCAGACTTG TTGACTTCAG AGATGATGAG
CATCCATACG CGAAAGATGA GCTCCAGCAA GAATAG
 
Protein sequence
MSGRYLDKQC PACGYTIAKM VFDAGVKPLA TIAWAESEEE AKDVKSFKQE YIQCLNCSHV 
WNHLFDWEHV PYGNKPNKMY NNGSQWKKHI EYLRGWLSDR MPAKPTIVDI GCGDGSFLIC
MANHYKQKGR FLGFDPSGDV DAQQSEIHFD RTLFSPLKDT AKHKPDLIVM RHVIEHLTAP
SSFLHSLAWG ASSYEKTTYV YCEVPCIDRV FQTSRLADFY YEHPSQFTTL SFTRMLKTAG
QIIDIQHSYD GEVICGLVEL KPSSEQTKIS NGSDAYFFKT STSIHQIERQ IDNLLAAHKL
IAIWGGTGKC AAFMHHYSVS CDDISTVVDS DERKWGTYVP GVGQEIKPPS YLLNKLIDVL
LIPTQWRAQD ILIEAYSMGL TFKQVLIEHN GRLVDFRDDE HPYAKDELQQ E