Gene P9211_03951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03951 
Symbol 
ID5731667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp370876 
End bp372174 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content35% 
IMG OID641284752 
Producthypothetical protein 
Protein accessionYP_001550280 
Protein GI159902936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAAGAA AAAAATATCT TTATAAAAAA GAAAGTGGCC TAAAGGAAGT AATTGCAATT 
AATTACTTTC GAAAAATTTG GATAGCACAG GTATTTTCTC AACTAGGAGA TAAGTTTTTA
ATAGTTTTAA CGATCTTTAT TATCAGTAAA AACTGGTCAA ACTCCATACC TACTTTTTCA
GATCAACCAG AAAAGGCAAT TACACTTTTG GCCAGTGGTG TTTATTTAGC AAATAGTCTT
CCTGCAATAT TTTTAGGCGC TATTGCTGGA GTTTTTTCAG ATAGATGGCC AAAAGTTCCC
ATTATGATTG CATCAAATGT ACTTAGGGCA GTATTGGTAA TCTTAATCCC AGTTTGCTTA
ATACCTGGGC CAATTATAAA TGGCGTATCC TGGGGGTATT GGTTTTTACT AATAATTATA
TTTCTTATAT CATGTCTTAC ACAATTATTT ACTCCAGCCG AACAATCAGC AATACCATTA
ATAGTTAGTA GTAATAAGCT TTTAGCAGCC AATTCTATTT ATCAGTCAAC TACGATGGGG
GCAATAATAA TTGGCTTTGC TTTTGGCGAG CCTTTACTAA AAATACTTGG AGAGACCTTT
CAATCAATAG GTATTTTAGG AGGAGAGTTT CTTCTTCTAC CTATCTGCTA TGGAGCAGCA
TCATTAATAC TAAATAAAAT AAAGTTAAAA GAATCTCTGA AGCCTAAACT GAAGAAGAAT
ATCTTTTACG ACTTGAATTC TGGATTAATA GTTCTCAAAA AAGTCCCAAC TGTTAGGAAG
GCAATTCTTC AACTCGTAAT ACTTTATTGC TTAATGGCAA GCTTATATGT TTTATCTATA
GGACTTGCTT CTTCCATAAC TATTCTAGGT CCTACAAAGT TTGGGGTACT TCTTTCTTTC
ACTGCTATTG GCCTTGCATT TGGAGCTTTT CTTATGGCTC AGAAAGCAAA TCTGATTCAA
GGGCATAAAT TACCAGCAAT AGGTCTTTGC TTAATAGCAT TGAGCTTACT TTTACTAGAG
CAATCCAAAG GATTATTAGG TTTTACTCTC GCAATTTGTA CTCTTCTTGG TGTAGGGGCA
TCTCTAGTGG CAATCCCAGC TCAAACAAGT GTTCAAAAAA ACACACCAGA AGAATCACTT
GGGAAAGTAC TTGGTCTTCA AAACAATGTA ATCAATATTG CTCTGAGCCT CCCTCTCTTA
ATCGCAGGTG CATTAGTAAC GCAATTGGGA GTTACACCTG TTCTATTTCT TTTGGCAGGA
CTTGCACTAT TTGGTGCATT ATTCGAAAAT TTTCTGTAA
 
Protein sequence
MTRKKYLYKK ESGLKEVIAI NYFRKIWIAQ VFSQLGDKFL IVLTIFIISK NWSNSIPTFS 
DQPEKAITLL ASGVYLANSL PAIFLGAIAG VFSDRWPKVP IMIASNVLRA VLVILIPVCL
IPGPIINGVS WGYWFLLIII FLISCLTQLF TPAEQSAIPL IVSSNKLLAA NSIYQSTTMG
AIIIGFAFGE PLLKILGETF QSIGILGGEF LLLPICYGAA SLILNKIKLK ESLKPKLKKN
IFYDLNSGLI VLKKVPTVRK AILQLVILYC LMASLYVLSI GLASSITILG PTKFGVLLSF
TAIGLAFGAF LMAQKANLIQ GHKLPAIGLC LIALSLLLLE QSKGLLGFTL AICTLLGVGA
SLVAIPAQTS VQKNTPEESL GKVLGLQNNV INIALSLPLL IAGALVTQLG VTPVLFLLAG
LALFGALFEN FL