Gene P9303_03401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03401 
Symbol 
ID4778547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp347511 
End bp349448 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content62% 
IMG OID640085843 
Producthypothetical protein 
Protein accessionYP_001016357 
Protein GI124022050 
COG category[T] Signal transduction mechanisms 
COG ID[COG4252] Predicted transmembrane sensor domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.689704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCTCA GCCAGCGCAT CCGCGACGGT CTCGTTCAAG CCGGGCTGAT CGGCGCCGCG 
GCCCTGTTCC TGGGGGGACT GTCGACAACG GGAATCAGTG CATCGATCGA TTGGCTGCTG
TACGACAGCG TCATCACCCT GCGTTCGCGC GATTCCGCCC AACGACATCC GGTGACGATC
GTCGGAATCG ACGAAGACGA CATCAGCCAC TACGGATGGC CGATCGACGA TGCCGTGCTC
TGTCGTGCCC TTCGGAACGC CCTGCAAGCC AACGCCAGTG CTATCGGCCT GGATCTCTAC
CGCGACCAGG GAATTGGTCC GCAGCAGAGC TGTCTGCCGG AGCTGATCCG GCAAAACTCC
GAGATCGTGG CCATCTTCAA CGCTGCCGAG GGCATCACCG CTCCACCGGG GACACCTGCC
GCCCAGCAGG CCTTCAATGA CCTGGTGGTC GATGCCGATG GCGTAATCCG TCGGGACCTG
ATCCATGTCA GCGGTCAGGA CGCCGCCACC GTGAGCCTGC CGGTCCGACT GATCGAGACA
TCCGGTCTGC AGCCCGGCCT GCTCGATCTG CTGAAGAAGC CAGACAGAGC AGAACAACTC
GGACCATGGC TGCTGCCCCA TTCCGGGGGA TACCGCGACC TCGATGCTGC CGGTTACCAG
CGACTGCTGC CGTTTCATCA ACCCGGGAGC TTCCGCACCA TCAGCCTGCG TACGCTGGCT
GATGGCAAAT GGGCTGCAGA GGCCCTGCAG CAGGGAGACA TTGTGCTGCT CGGGAGCACC
GCGCCCAGCC TGAAAGATCT GTTCGAAATC CCGCACAGCC GTTTCAGCCA AAGCAACAAA
TTCCTCATGC CCGGGGTGGA AGTGCATGCT CTGCGCGTCG CCGCCCTGCT GAATGGTCTG
GATCAGCCAT GGACGCTCCG AACACTGCCG CCCTGGAACG AGCAGGGCCT GGAGCTGATC
GCCATCCTCG TCGGGATCAG CCTTGGTGCC AGCTGCAGCA AGCTGCAACG CAGCATCACG
ATCACCACAG TGCTAACGGT CGTTCTGGCG GGCTGCGGTG CCGCTTTGCT CTGGACGCAA
GGACTTTGGA TTGGCCTCAC CCTGCCGGTG ATCTCCCTAC CCGTGATGGC TGGGGTGGGT
TGGCTGCGCC GCGGAGCGCT CCTGCAACGT CAAAAACAAC AAATCGAACG CCTGCTGGGC
CAAACCACCT CCCCTGCCGT GGCACAACAG TTGTGGGAAC AACGCGACTC CCTCCTGCGG
GATGGCCAGT TCGAAGGGAA GCAGGTCACC GCAACAGTGT TGTTCACCGA CACCCAAGAC
TTCACCAGCA TTTCCGAACA GCTGTCACCC TCCGAGCTGC TGACATGGCT CAACCGTGGC
ATGAGCCTGT TGGTGCAGGA GATCACCAAC CACGGCGGCA TCATCAATAA ATTCACCGGT
GATGGACTTC TGGCGGTTTT TGGAGCACCG ATCAGCCAGG GAATGGCCGT GGATGCAGGC
CATGCGATTG ATGCGTCTTT GGCGATTACG GCTCGGCTAG CCGAGCTCAA TCAAGCATTG
AAACTGGAGC AGGCGCCAGC CATGCGCATG CGGATCGGCA TTCATTCCGG TCCGGTGATC
GCCGGCTCGA TGGGAAGCAG CGCGCGGCTG GAATTCACGG TGATGGGGGA CACGGTGAAT
TGCGCGTCGC GGTTGGAAAG CCTGGCCAGG GTTCCAGCCG ACGACAGCTG CCGCACCCTC
TTCAGCCAAG AGACCCTGAT GCGGTGTGAG CGCGACGACC TGCTCTGGCA TTCGGTGGGG
CGATTGCAGG TGAAAGGGCG TCAGCAAGAG CTGGACGTTC TCGAACTCAA GGGCACCAAA
CCAGCCGCCA ATGTCAGAAC AGGCAGCGCA CCGGCAGACG ATCGGGCCAG GAGCGCAGAT
CAAGAGCTGC CAGGTTGA
 
Protein sequence
MKLSQRIRDG LVQAGLIGAA ALFLGGLSTT GISASIDWLL YDSVITLRSR DSAQRHPVTI 
VGIDEDDISH YGWPIDDAVL CRALRNALQA NASAIGLDLY RDQGIGPQQS CLPELIRQNS
EIVAIFNAAE GITAPPGTPA AQQAFNDLVV DADGVIRRDL IHVSGQDAAT VSLPVRLIET
SGLQPGLLDL LKKPDRAEQL GPWLLPHSGG YRDLDAAGYQ RLLPFHQPGS FRTISLRTLA
DGKWAAEALQ QGDIVLLGST APSLKDLFEI PHSRFSQSNK FLMPGVEVHA LRVAALLNGL
DQPWTLRTLP PWNEQGLELI AILVGISLGA SCSKLQRSIT ITTVLTVVLA GCGAALLWTQ
GLWIGLTLPV ISLPVMAGVG WLRRGALLQR QKQQIERLLG QTTSPAVAQQ LWEQRDSLLR
DGQFEGKQVT ATVLFTDTQD FTSISEQLSP SELLTWLNRG MSLLVQEITN HGGIINKFTG
DGLLAVFGAP ISQGMAVDAG HAIDASLAIT ARLAELNQAL KLEQAPAMRM RIGIHSGPVI
AGSMGSSARL EFTVMGDTVN CASRLESLAR VPADDSCRTL FSQETLMRCE RDDLLWHSVG
RLQVKGRQQE LDVLELKGTK PAANVRTGSA PADDRARSAD QELPG