Gene P9303_19081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19081 
Symbol 
ID4776177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1672917 
End bp1674740 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content37% 
IMG OID640087417 
Producthypothetical protein 
Protein accessionYP_001017915 
Protein GI124023608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0528969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCACG CTGCAAAAAT ATCCTCAGTC AAAGAAAATA GAAACAGCTT ATCTGACCAC 
ATCAGAAAAG CTTGGGGATT ACCTGAAGAT GAGTATATAG AGCAGGCAAT AAGCGAACTA
GGATCAGTAT TATTAGATTT TGGATTTATA ACAGCCTGGC ATAGCCTAAA GGCAGAGTCA
GCAATGAAAG GGGTCGAGTT ATCTAATGAG CAGCACCATC TAGTTGATCA ACTTGAAAAG
TTCAGCTCTG AGCTGCTAGA TGAGTTATTA CGGATACAGT CACCACGTGA ACTTAGAGAT
GACTGCCGCA TAATCGGACT GTTGCACTAT ATTCAGACGG TTGGTAGAGA TCTTCCTCCT
AATATTGCAG AACTAGCGGC ATTGCCTCGA GATGCATTTG ATAATTATAT TTTGACGTCT
CTAATCAGTA GAGATCATAT AACAACTATC AGAGTAATGG AGGTTCTTAA AGGATCTACG
AGATTGTCAG AATCATATCA ACACTACATA TCATCATATA GCTCTCCATT AGTCCACAGA
AATCATAGAA TAGCCAGCCA ATCTAAAAAA TCTATTAGTC CATGGAGAGA AATTAGTGAG
GCTATTGACT CATTAGTAGA AGTAGTATCA TATATTCACG ACATGCAGTA TTCAGATGCA
AAGAGTATAT GTGATCAGGT TTTGGCGGTA TTACCAAATT TACAGGAAGC AAAGTTTCTA
AATAGAGTAA TCGAGCATTA TCACTCAAGT GTGTATGACA AGCCAGACAT CGACCCTCTC
ATGCTTTATG GGTCAGAGGT TCTTGGAATA AGCTGGTCAG ATGTCTGGGA TAAATATAAA
AATACAGATA ATATTCGTGA ATACATTAAT AATATTATGT CGATGAAAGC ATTTGACGTA
AGTTTGTACA ATAGAGCACA GTTTCTCACT ATTCAGTGTA AGTATCAGGA CGAAATGTAC
GAATCGCTTT GGAAGGAAGT AGAGGCATTG AATGATAATT TAATAAGGGT TAGTCAATAT
GATGTTTTTA GCCTTACTGC ATTTGCAACT CGTCTAGAGG GATTGTCAAC CATGCCAGTT
AATTACAACA AATATGAAGA TAGCATACTT AACCATATTA CATTTATCAC TGGATTGCCG
GGCCCACATT TTTCACGTCT GCGTAGTTGG CTAGCAAAGT CAGAGGATGT GGCGTGTATG
TACTCGGATC AAGTCTTGAA AGGCATGGCC CAATATGTAT ATGACAAGTT TGAAGAGAAA
TACCCCAGCT CAATAGAACA GTTAGATACT GATGATCTTT TACAACTTAG AAAAATCTAT
TTACTACATC TTGCCTTTTG CTATGGGGAC AGGCAAATCG AGAGAATAAT AGACATAATA
CCCTCGGGGT TTAAGCATAT TGGATTACTG TCTCTTATTT TCCCAGAGGC AAAGTTTATA
GCGATCCATC CAAATCTCTA TGATCATCTT AAATATAGCT TTATGTCATT ATTTGGCTTT
GATAGTTCTC ACGCAAATGC TACGATGAGT GAATTGGTAG GTTATGCATA TGATTATTCA
TTAATAATAG ATAGGTGGAA AAAGCATCTG GGTTCAAAGC TTGATTATGT AAATCTAGAT
ATCTTTAAAT TGTCTGATTA CGACGACCTG AAAAATTTGA TTATTCCAGA AAGTGCTAAT
ATAAAGCTCA TGGATATAAA GCCCACCCTC GAGATGCCCA AACGTCCTAA AGTATCTATT
GAAAACGTTG AGTCAAAGCT CTTAGAGTTT GTAGACGATA TAAATGAAAT CAATGAAAAG
GTTAATATGC TAAAAGGAGC TTAA
 
Protein sequence
MTHAAKISSV KENRNSLSDH IRKAWGLPED EYIEQAISEL GSVLLDFGFI TAWHSLKAES 
AMKGVELSNE QHHLVDQLEK FSSELLDELL RIQSPRELRD DCRIIGLLHY IQTVGRDLPP
NIAELAALPR DAFDNYILTS LISRDHITTI RVMEVLKGST RLSESYQHYI SSYSSPLVHR
NHRIASQSKK SISPWREISE AIDSLVEVVS YIHDMQYSDA KSICDQVLAV LPNLQEAKFL
NRVIEHYHSS VYDKPDIDPL MLYGSEVLGI SWSDVWDKYK NTDNIREYIN NIMSMKAFDV
SLYNRAQFLT IQCKYQDEMY ESLWKEVEAL NDNLIRVSQY DVFSLTAFAT RLEGLSTMPV
NYNKYEDSIL NHITFITGLP GPHFSRLRSW LAKSEDVACM YSDQVLKGMA QYVYDKFEEK
YPSSIEQLDT DDLLQLRKIY LLHLAFCYGD RQIERIIDII PSGFKHIGLL SLIFPEAKFI
AIHPNLYDHL KYSFMSLFGF DSSHANATMS ELVGYAYDYS LIIDRWKKHL GSKLDYVNLD
IFKLSDYDDL KNLIIPESAN IKLMDIKPTL EMPKRPKVSI ENVESKLLEF VDDINEINEK
VNMLKGA