Gene P9211_01791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01791 
Symbol 
ID5730920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp172009 
End bp173613 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content34% 
IMG OID641284523 
Producthypothetical protein 
Protein accessionYP_001550064 
Protein GI159902720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.806064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAC GCACCTTCCT AATTGCATTA AGTGTCACAA TAGGGTTATT GCTCATCATC 
TCTTTTGGTA TTTGGAGGGG CGCTGCAGCT CAAAGCCCAT TAAACCTAAA AAATGAACCA
ATAACACTAC CCTCAACAGC AAAGTTTTTT CCTAAAGAAG CACTTTTAAC AATTCATTTA
AAACTTGATG TAAATCGTTT CCCTAGATAT ATCGAGGCTG TTGTTCCTGA GAAGAAAAGA
AATAAAGCAC GAAGTGAAAC TGTAAAGATA AGAAATGGTT TCTTTGCACT TGCAGGCCTA
GACTTTGAAA GGGACTTATC GTCATGGGTT AATCCTAAAT TTAGTTTATC AATTATTAAA
GTTAATGGAG AGCAAGAACA TTTTGGCTGG CTATTAGCTA TTCAAGGTGA TGACAAAGAA
GGTCCTTCAT CATTTATGAA AAGTTTTTGG GATAAAAAAG TTTTAGATGG GGAAGATGTG
TTGCAAGAAA ATTACAATAA TTTTGAGATT TATTCTTATA ACGACCCTTT ATTTCCTAAA
AAGAGAAAAG AGATCTCAAC TACAATTGTA GAAGATAAAG TGGTTTTAGC TTCATCGGAA
AAGGTAATCC TGAAAAAAGC AATAGATACT TCAAAAGATC CACAATTAAG TCAACTAAAT
GATCAAGAGC TTATTCAATC AATTGAAAAA ATGAATACTG GCATAGGTCT CATTAATGCT
TCTGAAGAAG CACTTGAAAC CTTGCTTGAT TTACCAAAAT CTCTGACTCA GAAAAATTCC
TTAGAAAGGC TTGTAGCGTC AATTAAGGCT GAAGGTCCTG AATTATTATT AGATGGGTTA
TTTAAATTTA AAGAGAGCGG AACGGAGATT AAAAGTCAAA GAGAATCTGC AATAAGCTTA
GTCAATGGTT CAGGAGGACC TATTCAAGAT ATAGCTATCT TAAGTGAACC CTTTAAATTA
ATAGATAGTG CTAGTGAAGA CCCAGAAGCA AAGTTATTAG GCCCTATATT AAGGCAATAC
ATCAATGATC TAGATTCTAC TGCTATAAAT AGAATAACTA ATTCTGAAAA AGGTCCATTA
GTCTGGATTA ATGAGGATGC TGGATGGATT ATAGGGACTA AAGACAATTC TCAGGATTTA
GAAATAGATA AATCTCTAAG AAGTAATGGA TTCTCTAAAA GTTCACTTGC TTTAAAGGAA
AAGAAAATAG ATGTCTGGTC AAAACTAGCA ATAAATAAAT CTGGTAAATA TGACAATATA
ATCAATAATG TTGAAATAAT ATTGTCTCAA GAGAATGAAA GTAATTGGTG GGGTAACAAT
ATTGCCGCTT TAGAACAAAG GCTGCAAGTT AATTCATTAA CGAATAACAA TAAAAGATTT
CAGAACTTGA TTTCTAATGA CGGAAATTAC TTTGATCAGC AAGTTTTTCT AGGTCCTACT
TCATCCCAAA AAATACTTAG CGACTGGAAA CCTTGGAAGC TTCTCCAGGC AGTAATAGGT
CACTCACTCA AGCAAAATAT TAAAAGCATA GCCATCTCTA TAGGGGCATC AAAAGATGAC
ATAGAACAGA CAATTAATTT TCATGCAAAG CTTTTACTAG GTTGA
 
Protein sequence
MKARTFLIAL SVTIGLLLII SFGIWRGAAA QSPLNLKNEP ITLPSTAKFF PKEALLTIHL 
KLDVNRFPRY IEAVVPEKKR NKARSETVKI RNGFFALAGL DFERDLSSWV NPKFSLSIIK
VNGEQEHFGW LLAIQGDDKE GPSSFMKSFW DKKVLDGEDV LQENYNNFEI YSYNDPLFPK
KRKEISTTIV EDKVVLASSE KVILKKAIDT SKDPQLSQLN DQELIQSIEK MNTGIGLINA
SEEALETLLD LPKSLTQKNS LERLVASIKA EGPELLLDGL FKFKESGTEI KSQRESAISL
VNGSGGPIQD IAILSEPFKL IDSASEDPEA KLLGPILRQY INDLDSTAIN RITNSEKGPL
VWINEDAGWI IGTKDNSQDL EIDKSLRSNG FSKSSLALKE KKIDVWSKLA INKSGKYDNI
INNVEIILSQ ENESNWWGNN IAALEQRLQV NSLTNNNKRF QNLISNDGNY FDQQVFLGPT
SSQKILSDWK PWKLLQAVIG HSLKQNIKSI AISIGASKDD IEQTINFHAK LLLG