Gene P9211_17071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17071 
Symbol 
ID5730068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1532858 
End bp1534273 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content41% 
IMG OID641286089 
Producthypothetical protein 
Protein accessionYP_001551592 
Protein GI159904248 
COG category[S] Function unknown 
COG ID[COG4370] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03492] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA GAGATCTAAT AATTGAGTTC TTGCAACTGA TTACAGGAAT CGGTCTCAAG 
AAAGGTAAGA AGGCGGCACC CAGATTCGAA CTGGGGATAA AGGATTTGCA ATCCTCTGCC
TTACCACTTG GCCATGCCGC CGAAGGAAAT AAAGATTTTA CCTCTGGGGA TCGTATCAGT
CAGACGACCA AAGATCTTTT AGTTCTTTCT AATGGACACG GTGAAGATCT TATAGCCCTT
AGGATTTTAG AGGCTCTACA TCTCTTGGAA CCAAGCTTAA CCTTTGAGGT ACTCCCTTTG
GTTGGAGAAG GTAAGGCTTT TGAAAAGGCA GTTTATGAAA AGTGGTTAAT CAAAATAGGA
CCTTCTTTTC GCTTGCCTAG TGGAGGATTT AGTAATCAAA GCTTTTCAGG ATTGATTCGA
GATATTTCTG CAGGTGTCTT TTGTTTTGCT TATAAGCATT GGCGGTATGT CAGACGATCT
GCATTACATG GGAAAGTGAT TCTTGCAGTT GGGGATTTGT TGCCTTTGTT TTTTGCCTGG
AGTGGTGGCG GTATGTATGG GTTTATTGGG ACTCCCAAAA GCGATTACAC ATGGACATCA
TCTTCAGGGG CTTTGTTGAG TGATTATTAT CATCGCTGCA AAGGCTCAGA ATGGGACCCT
TGGGAATGGG TTTTGATGAG ATCTTTAAGA TGTAAATTTG TAGGAGTTAG AGATAAGTTG
ACTGCTAGAG GTTTACAGCG GAAGTCAATT AGGGCTTTTG CTCCAGGCAA TCCAATGATG
GATGGTTTTC ATAAAGCTGA ATGTCCACAA GACTTATTAA TGTTTAGAAG ATTGTTATTG
CTTTGTGGAA GTAGAATGCC AGAGGCATTG ATGAATTTTC GAAGATTAAT ATCTGCAGCT
TTGCAAATTA AAAGTCCAAC ACCATTAGCA ATTTTGGTTA CTACTGGGGC AGACCCATCT
CTACATGAGC TTGAGCTATG TTTAGAGAAA TTAGGCTTTT CGAAATTTTG TTTGCAAAAC
AATTCATTAG GTGTAGATAC CTTTTGGCAG AAGGATCGAT TTAGGGTTTT TATTGGTATT
GGAAAATTTC ATGAGTGGGC CACTTATGCT GAGATTGGCC TTGCAAATGC AGGCACTGCT
ACTGAGCAAT TAGTAGGACT TGGTACTCCT TGTGTTTCAT TGCCAGGTAA AGGTCCACAA
TTTAAAAAAT CATTTGCAAT GCGTCAGGCT CGCCTGCTAG GTGGAGCTGT CTTTCCTTGT
AGAAATTCCA AACATTTAGC CGAATCAGTT GAGGTGCTGC TTCGCAATGA CTCATTTCGC
GAACAGTTAT CTTTGCAAGG AGTAAAGAGA ATGGGCGCGC ATGGTGGAAG TGCAGCTTTA
GCACAATTTG CTTTAGAATT ATTAGTAAGG AGTTAA
 
Protein sequence
MRKRDLIIEF LQLITGIGLK KGKKAAPRFE LGIKDLQSSA LPLGHAAEGN KDFTSGDRIS 
QTTKDLLVLS NGHGEDLIAL RILEALHLLE PSLTFEVLPL VGEGKAFEKA VYEKWLIKIG
PSFRLPSGGF SNQSFSGLIR DISAGVFCFA YKHWRYVRRS ALHGKVILAV GDLLPLFFAW
SGGGMYGFIG TPKSDYTWTS SSGALLSDYY HRCKGSEWDP WEWVLMRSLR CKFVGVRDKL
TARGLQRKSI RAFAPGNPMM DGFHKAECPQ DLLMFRRLLL LCGSRMPEAL MNFRRLISAA
LQIKSPTPLA ILVTTGADPS LHELELCLEK LGFSKFCLQN NSLGVDTFWQ KDRFRVFIGI
GKFHEWATYA EIGLANAGTA TEQLVGLGTP CVSLPGKGPQ FKKSFAMRQA RLLGGAVFPC
RNSKHLAESV EVLLRNDSFR EQLSLQGVKR MGAHGGSAAL AQFALELLVR S