Gene P9211_13341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13341 
Symbol 
ID5730986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1200669 
End bp1202708 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content38% 
IMG OID641285705 
Producthypothetical protein 
Protein accessionYP_001551219 
Protein GI159903875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0846301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAACTCC CAATAGATCA TTTTCGTTTG CTCGGAGTTA GTCCATCTGC TGATGCAGAG 
GAGGTTCTTA GGTTCTTTCA GCTGAGATTA AATCGTATTC CGCATCCTGG ATTTACTCCT
GAAGTTATTG CGCAGAGGTC TGAACTTTTA CGTCTTTCTG CTGATTTGCT TTGTGATAAA
GATTTGAGGG AGGATTATGA GTCAGCTCTT TTAAATGGAG CCGTAGGGCT TGATCTTTCA
TTTAATAGAG AAGTAGCAGG ACTCATTCTG CTTTGGGAAG GCGGAGTAGC GGATGAAGCT
TTTAAGCTTG CAAGAAAAGC ACTTCAACCT CCTCAAACAC CAGCACTGGG AAGCGGCCGT
GAGGCGGATT TGGCATTGAT AGGGGCCTTG GCTTGTAGAG ATGCTGCAAT ACAGGAGCAA
GAACTAAGAC GTTACGCTTC GGCTGCGGAA CTCCTTGAGG AAGGTATTCA ATTGCTTCAA
AGGATGGGGA AACTTCCTGA ACAGAGAAAG ATAATTGAAA GAGATTTAGA AGTGTTACTA
CCTTATAGAA TCCTTGATTT GTTAAGCAGA GATTTATCTG ATGAGAAATC TCACGAAGAA
GGATTGAATC TATTAGATAG TCTTGTCCTT AAGAGGGGAG GCTTGGAAGG GGATAATTTG
TCGAATTCTT CTATTGAACT ATCACAGCGA GAGTTTGAAC TTTTCTTTCA GCAAATAAGG
AATTTCCTAA CTGCGCAAGA GCAGATTGAC TTATTTCTCC ATTGGCAAAG GAGAGGATCT
CCAGATGCTG GTTTTTTGGG TGCCCTGGCT TTGGTCGCAT CAGGCTTCCA TTGGCGGAAA
CCTGAATTTT TGCAAAAAGC AAAGAAACAG TTGAAGGCAT TGAATCAGCA GGGCTTTGAT
TCAATGCCTT TGCTTGGCTG TATAGATCTT TTGCTGGCGG ATGTTCAGCA GGCAGGCGTT
CGTTTTAAAA GTAGTCCTGA TAAAGGATTA CAAGATTGGC TAAATGCATA TCCTGGAGAA
GAATTGGCTG CTTTATGTCA TTACTGTAGA AATTGGCTGC TTAGAGACGT TCTACCAGGT
TTCAGAGATA TTGAGATTGA CACTGTTGAT TTAGAGGCTT GGTTCGCAGA TAGGGATGTT
CAAGAATATG TAGAGCAAAT AGAACGCAGA GGGGCTTTTG GGATTGCTCG AGCAGGGTTT
TCTCTTTTTT CTGGACTGTC TTCAGATAAA ACAAATGATT CAATAAACTC CTTAGAGAAT
GATTCGACTC TTTCTAATGT TGATGAAATT GAGAAGGATT CTGAAAAAAA CAACAAATAC
CTAGGTTCTC CTGAAGAGGA GCTCAGTGAT GAGAAAACTT TCTTGGAAAA CTTAGTTCAA
TTGTTGAAGT GGAGACCTTT TTATATAGAA ATAGCAAAAC CACGAATTAA AATTCCAGAA
AATAACTTTT TTAAAGCAAC TTTAGCGTTG TTCCTATTAC TGTTTTCAGG AACATTTACT
GCTTTGATTT TATATAGAAA TAATCCAACT GAAGATAATA TATCTGAAAG CTTTAAAGAG
CCTTCTGAAA AAATTGTCAG TAAAAAAACA GATATTAATT TGAATATAAA GCAACAGGAT
CAGACTAAAT TAGAAAAAAG ATACAACACT TTAACAAACA AATCACCATC AAAAGATGAA
GTCCAACAAT TAATTGAGGC TTGGCTTTCT GGGAAGGCAG GTATCTTATC TGGGGTTAAC
AATTTAGATT TATCAAATGT AGCAAGACCT TCTCTTGTGA AAATTGTTCT CGAGCAAAGG
GAGAAAGATA TTGCTCTTGG AGAAAGACAG ATTATCTATG CAAATATTAA AAGTTTAGAG
ATAGAGGAAC AAACTGAAAA GAGAATATCT GTAAAAGCAG TACTTAACTA TAAAGACCAA
AGAGTTAATT CTTCTGATCA GATCATTTCC GAGACAACGA TTCCCTCATT AAAGCTAAAG
TATGTTTTAG GAAGAGAAAA AAATATATGG CAGTTGCTTG ATTTTTCAAG TAGCACATAG
 
Protein sequence
MELPIDHFRL LGVSPSADAE EVLRFFQLRL NRIPHPGFTP EVIAQRSELL RLSADLLCDK 
DLREDYESAL LNGAVGLDLS FNREVAGLIL LWEGGVADEA FKLARKALQP PQTPALGSGR
EADLALIGAL ACRDAAIQEQ ELRRYASAAE LLEEGIQLLQ RMGKLPEQRK IIERDLEVLL
PYRILDLLSR DLSDEKSHEE GLNLLDSLVL KRGGLEGDNL SNSSIELSQR EFELFFQQIR
NFLTAQEQID LFLHWQRRGS PDAGFLGALA LVASGFHWRK PEFLQKAKKQ LKALNQQGFD
SMPLLGCIDL LLADVQQAGV RFKSSPDKGL QDWLNAYPGE ELAALCHYCR NWLLRDVLPG
FRDIEIDTVD LEAWFADRDV QEYVEQIERR GAFGIARAGF SLFSGLSSDK TNDSINSLEN
DSTLSNVDEI EKDSEKNNKY LGSPEEELSD EKTFLENLVQ LLKWRPFYIE IAKPRIKIPE
NNFFKATLAL FLLLFSGTFT ALILYRNNPT EDNISESFKE PSEKIVSKKT DINLNIKQQD
QTKLEKRYNT LTNKSPSKDE VQQLIEAWLS GKAGILSGVN NLDLSNVARP SLVKIVLEQR
EKDIALGERQ IIYANIKSLE IEEQTEKRIS VKAVLNYKDQ RVNSSDQIIS ETTIPSLKLK
YVLGREKNIW QLLDFSSST