Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00501 |
Symbol | hepA |
ID | 5730888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 49870 |
End bp | 53073 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641284392 |
Product | SNF2 family DNA/RNA helicase |
Protein accession | YP_001549935 |
Protein GI | 159902591 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGC TACACGCTAC TTGGCTGCCA GCAATGCGAA CCGGAAGTTC GCATAATCCA GGACTACTCA TCTGGGCTGA TTCATGGAGA GTTGCAAAAC CAAGCATAGT CAGCAATCAG CCTGTAATAC ATCCATTTGC CTTATCAGCA GCAGATTTAC GTATTTGGCT ATTGCAAAAA AAGCTTTTAC CTAAAGAAAG TATTGAATGT ACAGCCTTAT TAACTCTACC TAGTAAATCT ATTAAAAACT CATTAGACAA AAAATTAAAT GGAGTAACGG ACTCACAAAA TACTAGCGAT CAACCTCAAT GGAGTGGACT ACCTTTACAA GCAGGAGAGC CAGTAACTAA ACAATGTGAA TGGTGGCCCT GGCAAGTTGA AGGTATAGCA ATCAAACCCA GTGAAGCTGC ATCGTGGCTT GCAAACTTAC CTCTCACGAA AAAAGATCCT GAGCTTAGTG AAGAGATCCT ATGGTGGAGT CATTTAGAAC GTTGGTCTCT AAGTTTAATT GCTCGTGGCC TTTGGTTGCC ACAAGTTGAA TTAAATACAA TTGATAATAT TGGAGCTAGA GCTAGGTGGA GTCCTTTACT TAATAACGAA AACGAGCGCA AAAGATTAGA AGAATTCTCT ATCAGGCTTC CATTAGTAGC AACATGTGCC ATAAAAAGAG AGGAAACTTC TGAAGAAAAT CAAAACCATA TATTAAAGAC TACTCCTAGG GAAACACTCG ATGAATACGG ACTTGCAGTA TGTCGACCAA TCAATAGTCG ACTTCAAGTG GCTTATCTCT TAGAAGAACT CGTGGATGGA CAGCTAAGAA AAGATTTTGA GGAAAGTTCT GAAGACCTTG ATCCATTGCT GAAAGCTTGG CAAGAGGCAT TAGGATCACA TAATGGAGTC ATTCGTCTTC CGTTGGAAGA TTGTGAAAGA TTAGCCAAGG CAAGTAAAAA TTGGAAAGAA AATTTATCAG GCAATGTTAA AGGTGCAAGA GCATGCCTTG AGCTTTTTGC ACCACTTGAA GGAGAAGATT TATGGGACTT ACAATTCTCT TTACAAGCTG AAGCAGATCC ATCACTAAAG GTAGCAGCAG AAGCAGTATG GAATGCAGAC TCAGCAGTTC TACAGATTGG TGATATTCAA ATAGCGCAGC CTGGAGAAAT TCTACTAGAA GGTCTTGGCA GAGCACTCAA TATCTTTCAA CCAATAGAAA GGGGTCTGGA AAATGCTACT CCAAATAATA TGCAACTCAC ACCTGCAGAA GCTTTTGTTC TAGTACGTAC AGCCTCAAAG CAATTACGTG ATATTGGTAT TGGTGTAATA CTACCTAGAA GTTTATCAGG AGGATTAGCA AGTCGACTAG GTATAGCTAT TAAAGCAGAG TTAGCGACTA GTGCCAGAGG ATTAACACTT CGAGAGAATC TAGAATGGAG TTGGGAGCTA ATGATAGGGG GAAGCATATT AAGCCTTAAA GATCTAGAAC AACTGGCAAG TAAACGCAGC CCTCTAGTTC GCTATAAGGA TTCATGGCTT GAATTACGTC CAAATGATCT TAAAATCGCC GAAAAATTCT GTAGCAATAA TCCTGAATTA AGCCTAGATG ACGCATTAAG ACTTACCGCA ACTAAAGGGG AGACTCTAAT GAAGCTTCCA GTACATCAAT TTAATGCTGG GCCAAAGCTC CAAGGCGTTT TAGAGCAATA CCACCAACAT ACAAGTCCTG AGCCTCTAGC TGCACCAGAT GGCTTCTATG GACAACTGAG GCCTTATCAA GAACGTGGCA TAGGATGGTT GGCTTTCTTG CATCGTTTTA ATCAAGGTGC ATGTTTAGCA GATGACATGG GCCTGGGCAA AACAATTCAA GTGCTTGCTT TTATTCAGCA CTTAAAAAGT AACAAGGACC TCAAGAAACC TGTTTTGCTA ATTGCACCTA CGTCAGTATT AACAAACTGG AAACGAGAAG CTTATTCATT TACACCAGAG TTATCTGTAT TAGAGCATTA CGGTCCTAAT CGTTCATCTA CATCAACACT CTTGAAAAAG ATTCTCAAAA AAGTAGACAT TCTTATTACT AGCTATGGCC TACTACATAG AGATAAACAG CTTCTGAAAA CAATTGATTG GCAAGGTGTA ATTATTGATG AAGCACAAGC TATAAAAAAT CCAAATTCAA AACAAAGTCA AACAACTCGT GAAATTGTTA AAGGCGGAAA AATAATCCCT TTTCGTATTG CATTAACTGG TACCCCTATA GAAAATCGTG TAAGTGAGCT TTGGTCATTA ATGGATTTTT TAAATCCATC AGTACTTGGA GAAAAAGAAT TTTTTGATCA ACGCTACAAA TTACCGATTG AACGTTATGG TGATATTTCT TCGTTAACCG ATCTCAAAGC TCGTGTCAGT CCCTTTATTC TTAGAAGGTT AAAAAGTGAT AAATCAATTA TCTCGGATCT ACCAAGCAAA GTCGAACTAA AAGAATGGAT TACTCTTAGT CAAGAGCAAA GAGCTCTTTA TAACAAAACT GTAGACAATA CCTTACAGGA AATCGCAAGA AGTCCTATTG GTCAGCGTCA TGCGAAAACC TTAGGTCTAT TAACACGTCT CAAACAAATA TGTAATCATC CTGCTCTTGC CCTCAAAGAA AAAAACATTA GCGATGATTT TGGAATACGA TCAACCAAAC TTCAAAGGCT GGAAGAACTT CTTGATGTGA TATTCGCAAC AGAGGACAGA GCTCTTCTTT TTACCCAATT CGCTGAATGG GGTCACTTAC TACAAGCTTA TCTAGAAAAA AAGTGGGGAC ATAGCATACT TTTTCTACAT GGAGGAACTC GCAAAATAGA TAGACAATCA ATGGTTGATC AATTTCAAGA AGATCCCAGA GGCCCAAAAT TATTTTTACT TTCTCTCAAA GCAGGTGGTA TTGGTCTGAA CCTGACTCGA GCTAACCACG TGTTGCATAT TGATCGATGG TGGAACCCTG CCGTAGAAAA TCAGGCAACA GATCGTGCTT ATAGAATTGG TCAAAAAAAT AGCGTAATGG TTCACAAATT TATTGCTACA GGGTCAGTAG AAGAAAAAAT TGATCAAATG ATTACTGAAA AGTCTAAGCT CGCAGAAAAT ATAATTGGTG CAGGTGAAGA TTGGCTTGGC AAACTTGGCA TCAATGAATT ACGTGAATTA GTTTCCTTAG AAAAAGAGAG TTAA
|
Protein sequence | MSLLHATWLP AMRTGSSHNP GLLIWADSWR VAKPSIVSNQ PVIHPFALSA ADLRIWLLQK KLLPKESIEC TALLTLPSKS IKNSLDKKLN GVTDSQNTSD QPQWSGLPLQ AGEPVTKQCE WWPWQVEGIA IKPSEAASWL ANLPLTKKDP ELSEEILWWS HLERWSLSLI ARGLWLPQVE LNTIDNIGAR ARWSPLLNNE NERKRLEEFS IRLPLVATCA IKREETSEEN QNHILKTTPR ETLDEYGLAV CRPINSRLQV AYLLEELVDG QLRKDFEESS EDLDPLLKAW QEALGSHNGV IRLPLEDCER LAKASKNWKE NLSGNVKGAR ACLELFAPLE GEDLWDLQFS LQAEADPSLK VAAEAVWNAD SAVLQIGDIQ IAQPGEILLE GLGRALNIFQ PIERGLENAT PNNMQLTPAE AFVLVRTASK QLRDIGIGVI LPRSLSGGLA SRLGIAIKAE LATSARGLTL RENLEWSWEL MIGGSILSLK DLEQLASKRS PLVRYKDSWL ELRPNDLKIA EKFCSNNPEL SLDDALRLTA TKGETLMKLP VHQFNAGPKL QGVLEQYHQH TSPEPLAAPD GFYGQLRPYQ ERGIGWLAFL HRFNQGACLA DDMGLGKTIQ VLAFIQHLKS NKDLKKPVLL IAPTSVLTNW KREAYSFTPE LSVLEHYGPN RSSTSTLLKK ILKKVDILIT SYGLLHRDKQ LLKTIDWQGV IIDEAQAIKN PNSKQSQTTR EIVKGGKIIP FRIALTGTPI ENRVSELWSL MDFLNPSVLG EKEFFDQRYK LPIERYGDIS SLTDLKARVS PFILRRLKSD KSIISDLPSK VELKEWITLS QEQRALYNKT VDNTLQEIAR SPIGQRHAKT LGLLTRLKQI CNHPALALKE KNISDDFGIR STKLQRLEEL LDVIFATEDR ALLFTQFAEW GHLLQAYLEK KWGHSILFLH GGTRKIDRQS MVDQFQEDPR GPKLFLLSLK AGGIGLNLTR ANHVLHIDRW WNPAVENQAT DRAYRIGQKN SVMVHKFIAT GSVEEKIDQM ITEKSKLAEN IIGAGEDWLG KLGINELREL VSLEKES
|
| |