Gene P9211_00501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00501 
SymbolhepA 
ID5730888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp49870 
End bp53073 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content39% 
IMG OID641284392 
ProductSNF2 family DNA/RNA helicase 
Protein accessionYP_001549935 
Protein GI159902591 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGC TACACGCTAC TTGGCTGCCA GCAATGCGAA CCGGAAGTTC GCATAATCCA 
GGACTACTCA TCTGGGCTGA TTCATGGAGA GTTGCAAAAC CAAGCATAGT CAGCAATCAG
CCTGTAATAC ATCCATTTGC CTTATCAGCA GCAGATTTAC GTATTTGGCT ATTGCAAAAA
AAGCTTTTAC CTAAAGAAAG TATTGAATGT ACAGCCTTAT TAACTCTACC TAGTAAATCT
ATTAAAAACT CATTAGACAA AAAATTAAAT GGAGTAACGG ACTCACAAAA TACTAGCGAT
CAACCTCAAT GGAGTGGACT ACCTTTACAA GCAGGAGAGC CAGTAACTAA ACAATGTGAA
TGGTGGCCCT GGCAAGTTGA AGGTATAGCA ATCAAACCCA GTGAAGCTGC ATCGTGGCTT
GCAAACTTAC CTCTCACGAA AAAAGATCCT GAGCTTAGTG AAGAGATCCT ATGGTGGAGT
CATTTAGAAC GTTGGTCTCT AAGTTTAATT GCTCGTGGCC TTTGGTTGCC ACAAGTTGAA
TTAAATACAA TTGATAATAT TGGAGCTAGA GCTAGGTGGA GTCCTTTACT TAATAACGAA
AACGAGCGCA AAAGATTAGA AGAATTCTCT ATCAGGCTTC CATTAGTAGC AACATGTGCC
ATAAAAAGAG AGGAAACTTC TGAAGAAAAT CAAAACCATA TATTAAAGAC TACTCCTAGG
GAAACACTCG ATGAATACGG ACTTGCAGTA TGTCGACCAA TCAATAGTCG ACTTCAAGTG
GCTTATCTCT TAGAAGAACT CGTGGATGGA CAGCTAAGAA AAGATTTTGA GGAAAGTTCT
GAAGACCTTG ATCCATTGCT GAAAGCTTGG CAAGAGGCAT TAGGATCACA TAATGGAGTC
ATTCGTCTTC CGTTGGAAGA TTGTGAAAGA TTAGCCAAGG CAAGTAAAAA TTGGAAAGAA
AATTTATCAG GCAATGTTAA AGGTGCAAGA GCATGCCTTG AGCTTTTTGC ACCACTTGAA
GGAGAAGATT TATGGGACTT ACAATTCTCT TTACAAGCTG AAGCAGATCC ATCACTAAAG
GTAGCAGCAG AAGCAGTATG GAATGCAGAC TCAGCAGTTC TACAGATTGG TGATATTCAA
ATAGCGCAGC CTGGAGAAAT TCTACTAGAA GGTCTTGGCA GAGCACTCAA TATCTTTCAA
CCAATAGAAA GGGGTCTGGA AAATGCTACT CCAAATAATA TGCAACTCAC ACCTGCAGAA
GCTTTTGTTC TAGTACGTAC AGCCTCAAAG CAATTACGTG ATATTGGTAT TGGTGTAATA
CTACCTAGAA GTTTATCAGG AGGATTAGCA AGTCGACTAG GTATAGCTAT TAAAGCAGAG
TTAGCGACTA GTGCCAGAGG ATTAACACTT CGAGAGAATC TAGAATGGAG TTGGGAGCTA
ATGATAGGGG GAAGCATATT AAGCCTTAAA GATCTAGAAC AACTGGCAAG TAAACGCAGC
CCTCTAGTTC GCTATAAGGA TTCATGGCTT GAATTACGTC CAAATGATCT TAAAATCGCC
GAAAAATTCT GTAGCAATAA TCCTGAATTA AGCCTAGATG ACGCATTAAG ACTTACCGCA
ACTAAAGGGG AGACTCTAAT GAAGCTTCCA GTACATCAAT TTAATGCTGG GCCAAAGCTC
CAAGGCGTTT TAGAGCAATA CCACCAACAT ACAAGTCCTG AGCCTCTAGC TGCACCAGAT
GGCTTCTATG GACAACTGAG GCCTTATCAA GAACGTGGCA TAGGATGGTT GGCTTTCTTG
CATCGTTTTA ATCAAGGTGC ATGTTTAGCA GATGACATGG GCCTGGGCAA AACAATTCAA
GTGCTTGCTT TTATTCAGCA CTTAAAAAGT AACAAGGACC TCAAGAAACC TGTTTTGCTA
ATTGCACCTA CGTCAGTATT AACAAACTGG AAACGAGAAG CTTATTCATT TACACCAGAG
TTATCTGTAT TAGAGCATTA CGGTCCTAAT CGTTCATCTA CATCAACACT CTTGAAAAAG
ATTCTCAAAA AAGTAGACAT TCTTATTACT AGCTATGGCC TACTACATAG AGATAAACAG
CTTCTGAAAA CAATTGATTG GCAAGGTGTA ATTATTGATG AAGCACAAGC TATAAAAAAT
CCAAATTCAA AACAAAGTCA AACAACTCGT GAAATTGTTA AAGGCGGAAA AATAATCCCT
TTTCGTATTG CATTAACTGG TACCCCTATA GAAAATCGTG TAAGTGAGCT TTGGTCATTA
ATGGATTTTT TAAATCCATC AGTACTTGGA GAAAAAGAAT TTTTTGATCA ACGCTACAAA
TTACCGATTG AACGTTATGG TGATATTTCT TCGTTAACCG ATCTCAAAGC TCGTGTCAGT
CCCTTTATTC TTAGAAGGTT AAAAAGTGAT AAATCAATTA TCTCGGATCT ACCAAGCAAA
GTCGAACTAA AAGAATGGAT TACTCTTAGT CAAGAGCAAA GAGCTCTTTA TAACAAAACT
GTAGACAATA CCTTACAGGA AATCGCAAGA AGTCCTATTG GTCAGCGTCA TGCGAAAACC
TTAGGTCTAT TAACACGTCT CAAACAAATA TGTAATCATC CTGCTCTTGC CCTCAAAGAA
AAAAACATTA GCGATGATTT TGGAATACGA TCAACCAAAC TTCAAAGGCT GGAAGAACTT
CTTGATGTGA TATTCGCAAC AGAGGACAGA GCTCTTCTTT TTACCCAATT CGCTGAATGG
GGTCACTTAC TACAAGCTTA TCTAGAAAAA AAGTGGGGAC ATAGCATACT TTTTCTACAT
GGAGGAACTC GCAAAATAGA TAGACAATCA ATGGTTGATC AATTTCAAGA AGATCCCAGA
GGCCCAAAAT TATTTTTACT TTCTCTCAAA GCAGGTGGTA TTGGTCTGAA CCTGACTCGA
GCTAACCACG TGTTGCATAT TGATCGATGG TGGAACCCTG CCGTAGAAAA TCAGGCAACA
GATCGTGCTT ATAGAATTGG TCAAAAAAAT AGCGTAATGG TTCACAAATT TATTGCTACA
GGGTCAGTAG AAGAAAAAAT TGATCAAATG ATTACTGAAA AGTCTAAGCT CGCAGAAAAT
ATAATTGGTG CAGGTGAAGA TTGGCTTGGC AAACTTGGCA TCAATGAATT ACGTGAATTA
GTTTCCTTAG AAAAAGAGAG TTAA
 
Protein sequence
MSLLHATWLP AMRTGSSHNP GLLIWADSWR VAKPSIVSNQ PVIHPFALSA ADLRIWLLQK 
KLLPKESIEC TALLTLPSKS IKNSLDKKLN GVTDSQNTSD QPQWSGLPLQ AGEPVTKQCE
WWPWQVEGIA IKPSEAASWL ANLPLTKKDP ELSEEILWWS HLERWSLSLI ARGLWLPQVE
LNTIDNIGAR ARWSPLLNNE NERKRLEEFS IRLPLVATCA IKREETSEEN QNHILKTTPR
ETLDEYGLAV CRPINSRLQV AYLLEELVDG QLRKDFEESS EDLDPLLKAW QEALGSHNGV
IRLPLEDCER LAKASKNWKE NLSGNVKGAR ACLELFAPLE GEDLWDLQFS LQAEADPSLK
VAAEAVWNAD SAVLQIGDIQ IAQPGEILLE GLGRALNIFQ PIERGLENAT PNNMQLTPAE
AFVLVRTASK QLRDIGIGVI LPRSLSGGLA SRLGIAIKAE LATSARGLTL RENLEWSWEL
MIGGSILSLK DLEQLASKRS PLVRYKDSWL ELRPNDLKIA EKFCSNNPEL SLDDALRLTA
TKGETLMKLP VHQFNAGPKL QGVLEQYHQH TSPEPLAAPD GFYGQLRPYQ ERGIGWLAFL
HRFNQGACLA DDMGLGKTIQ VLAFIQHLKS NKDLKKPVLL IAPTSVLTNW KREAYSFTPE
LSVLEHYGPN RSSTSTLLKK ILKKVDILIT SYGLLHRDKQ LLKTIDWQGV IIDEAQAIKN
PNSKQSQTTR EIVKGGKIIP FRIALTGTPI ENRVSELWSL MDFLNPSVLG EKEFFDQRYK
LPIERYGDIS SLTDLKARVS PFILRRLKSD KSIISDLPSK VELKEWITLS QEQRALYNKT
VDNTLQEIAR SPIGQRHAKT LGLLTRLKQI CNHPALALKE KNISDDFGIR STKLQRLEEL
LDVIFATEDR ALLFTQFAEW GHLLQAYLEK KWGHSILFLH GGTRKIDRQS MVDQFQEDPR
GPKLFLLSLK AGGIGLNLTR ANHVLHIDRW WNPAVENQAT DRAYRIGQKN SVMVHKFIAT
GSVEEKIDQM ITEKSKLAEN IIGAGEDWLG KLGINELREL VSLEKES