Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28381 |
Symbol | hepA |
ID | 4777403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2507041 |
End bp | 2510340 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088361 |
Product | SNF2 family DNA/RNA helicase |
Protein accession | YP_001018833 |
Protein GI | 124024526 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.590649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGTT GTGGAACTCC TGCGTGGATG GTTGCCGTTG ATCGGCAGTG CACTCCTGCT CCAAGAAACC CAACACATAC TTTTTGCGTC GCGGCCATGA GCCTGCTGCA CGCCACCTGG CTTCCAGCCA TCCGTACTCC GACCAGCTCC GGTCGCCCTG CGCTCCTTGT GTGGGCAGAT ACCTGGCGAG TCGCTACCCC AGCAGGACCA GCAGCAACTC CCGCACTCCA CCCCTTCACA CTCAACCCAG ACGATCTACG TGCCTGGCTG ATTGAGCGCG ATCTACTGCC CGATGAAATC ATCGACGCCA CAGCATGTCT GACCCTGCCT AGCCGAACAG TCAAACCGCG CAGCAAAGCC AAGAACGTAT CCACTGAATC CGACGAAGAC AAAGACCACA AAACAAGTTG GACAGGACTG CCCTTACAAG CAGGCGAACC CATTCCCAAA CAGACTGAAT GGTGGCCCTG GCAGGTGCAA GGCCTGGCAG TGGAGCCTGC TGCTGCAACG GCCTGGCTTT CGAAACTGCC TCTTTCAGGA GATCATCCTG ATCTCGCCGA TGAATTGCGC TGGTGGAGCC ATCTACAGCG CTGGGCCCTG AGCATGATTG CTCGCGGACG TTGGCTACCC CAGGTGGAAC TCAGCAAGGG AGAGGGCTAT CCCCACCGAG CACGCTGGAC ACCGCTACTC AACCGTGAAG ATGATCGCCG CCGCCTCGAA GACCTTGCCG CTCAGCTCCC CTTAGTGGCC ACCTGCGCCC TCCCCTGGCG GGAGCCCACC GGAAGGCGTA GCAACCGAAT GACCCGCCTA AGACCAGAGG CGATGCGAGC CGCTAACCCT GTGGCTTCAT GCCGACCCCG CAGCGGTCGC CTTCGCGTAG CCAGCCTGCT GGAAGAACTC TTGGATGCCC AACTGCGCAC CGGATTTGAA GCGAGTGAGC AAGGCCTAGA CCCATTGCTC ACAGCCTGGC AGGAAGCACT GGGGTCGGAC AGCGGCGTGA TCAACCTCCC CGATGAGGAA GCCGAACGTC TAGCGACAGC AAGCAACCAT TGGCGAGAAG GCGTGGCTGG CAACGTCGCA CCAGCCAGGG CCTGCTTAGA ACTCTTCACT CCCGGCGAAG GGGAAGACCT CTGGGAGCTG CGCTTCGCCT TACAGGCTGA GGCTGATCCC ACGATCAAAG TACCGGCCGC AGCAGCCTGG GCAGCGGGTC CCAAGGTCCT GCAACTAGGC GAAATCCGTG TGGAACATCC AGGCGAGGTG CTACTGGAAG GCATGGGGCG AGCCCTCACG GTGTTTGCAC CGATCGAACG AGGCCTCGAC AGCGCCACAC CAGAAGCAAT GCAGCTCACC CCTGCTGAAG CCTTTGTATT GGTGCGCACT GCAGCGGCCC AACTGCGTGA TGTTGGCGTT GGCGTGGAAT TGCCTGCCAG CCTCTCGGGA GGGCTGGCCA GTCGCCTAGG CCTAGCGATC AAGGCGGAGC TATCGGAGAG ATCTAGAGGT TTCACTTTGG GCGAAACCCT CGACTGGAGT TGGGAGCTCA TGATCGGTGG CGTCACCCTG ACGCTTCGCG AGCTGGAGCG ACTAGCAAGC AAGCGCAGCC CGCTTGTCAA CCACAAGGGC GCCTGGATCG AATTACGCCC CAACGATCTC AAAAATGCGG AACACTTCTG CAGCGTCAAT CCAGGCATCA GCCTCGACGA TGCCTTGCGC CTTACCGCAA CCGATGGCGA CACGCTGATG AGACTGCCCG TTCACCGCTT TGAGGCCGGT CCACGACTAC AGGCGGTGTT GGAGCAGTAC CACCAGCAAA AAGCTCCCGA CCCCCTACCT GCTCCCGAAG GCTTCTGCGG TCAGCTAAGG CCTTATCAGG AAAGGGGTCT GGGTTGGCTG GCCTTCCTGC ATCGCTTCGA TCAAGGGGCA TGCCTGGCCG ACGACATGGG CCTGGGCAAA ACGATCCAGC TACTGGCATT CCTGCAACAT CTCAAGGCGG AACAGGAACT CAAACGGCCG GTATTGCTTA TCGCTCCCAC ATCCGTACTT ACCAACTGGA AGAGAGAGGC ATTGGCCTTC ACACCAGAGT TAAACGTCCG AGAACACTAT GGGCCGCGTC GGCCCTCTAC CCCCGCCGCC TTAAAGAAAG CACTCAAAGG CTTAGACCTC GTTCTCACCA GTTACGGGCT CCTGCAGCGA GATAGTGAGC TCCTGGAAAC GGTCGACTGG CAAGGAGTGG TCATCGATGA AGCCCAAGCC ATTAAGAACC CCAACGCCAA ACAGAGCCAA GCAGCACGCG ATATGGGCCG CCCAGACAAA AACAATCGCT TCAGGATTGC TCTTACCGGC ACACCCGTCG AAAACCGAGT CAGTGAACTT TGGGCACTGA TGGACTTCCT CAACCCAAGG GTTCTCGGTG AAGAAGACTT CTTCCGCCAG CGCTACCGGC TGCCAATTGA ACGCTATGGC GACATGTCTT CCCTGCGAGA CCTCAAAGGC CGTGTTGGTC CCTTCATCCT GAGACGACTA AAAACCGACA AGGCAATCAT CTCCGACCTA CCTGAAAAGG TAGAGCTGAG CGAATGGGTG GGTCTGAGCA AAGAACAGGC AGCCCTCTAT CGCAACACAG TGGATGAAAC ACTGGAGGCC ATTGCCCGCG CACCCAGTGG TCAACGTCAT GGCAAGGTGC TCGGCTTGCT TACCCGACTG AAGCAAATCT GCAACCATCC CGCCCTAGCC CTCAAAGAAA AAACCGTTGC AAAAGGCTTC ATGGACCGCT CCGCCAAGCT GCTGCGTTTG GAAGAAATTC TCGAGGAAGT GATCGAGGCA GGAGATCGCG CTCTGTTATT CACCCAATTC GCAGAATGGG GTCATCTCCT TAAGGCCTAC CTGCAACAAC GCTGGCGCTT TGAAGTTCCC TTCCTGCACG GCAGCACAAG CAAAACTGAA CGTCAGGCCA TGGTTGATCG CTTCCAGGAG GATCCACGTG GACCCCAACT GTTCCTGCTG TCACTCAAAG CCGGTGGCGT AGGCCTAAAC CTCACGCGGG CTAGCCATGT GTTTCATGTC GATCGCTGGT GGAATCCTGC CGTAGAAAAC CAGGCCACTG ATCGCGCTTA CAGGATCGGA CAAACCAATC GGGTGATGGT GCACAAATTC ATCACCAGCG GCTCAGTTGA AGAGAAAATT GATCGCATGA TTCGCGAAAA ATCTCGACTT GCCGAAGACA TCATTGGCTC TGGAGAAGAC TGGTTAGGTG GCTTAGGCGT CAGTCAATTG CGCGAACTAG TGGCCCTAGA AGACAGCTGA
|
Protein sequence | MIGCGTPAWM VAVDRQCTPA PRNPTHTFCV AAMSLLHATW LPAIRTPTSS GRPALLVWAD TWRVATPAGP AATPALHPFT LNPDDLRAWL IERDLLPDEI IDATACLTLP SRTVKPRSKA KNVSTESDED KDHKTSWTGL PLQAGEPIPK QTEWWPWQVQ GLAVEPAAAT AWLSKLPLSG DHPDLADELR WWSHLQRWAL SMIARGRWLP QVELSKGEGY PHRARWTPLL NREDDRRRLE DLAAQLPLVA TCALPWREPT GRRSNRMTRL RPEAMRAANP VASCRPRSGR LRVASLLEEL LDAQLRTGFE ASEQGLDPLL TAWQEALGSD SGVINLPDEE AERLATASNH WREGVAGNVA PARACLELFT PGEGEDLWEL RFALQAEADP TIKVPAAAAW AAGPKVLQLG EIRVEHPGEV LLEGMGRALT VFAPIERGLD SATPEAMQLT PAEAFVLVRT AAAQLRDVGV GVELPASLSG GLASRLGLAI KAELSERSRG FTLGETLDWS WELMIGGVTL TLRELERLAS KRSPLVNHKG AWIELRPNDL KNAEHFCSVN PGISLDDALR LTATDGDTLM RLPVHRFEAG PRLQAVLEQY HQQKAPDPLP APEGFCGQLR PYQERGLGWL AFLHRFDQGA CLADDMGLGK TIQLLAFLQH LKAEQELKRP VLLIAPTSVL TNWKREALAF TPELNVREHY GPRRPSTPAA LKKALKGLDL VLTSYGLLQR DSELLETVDW QGVVIDEAQA IKNPNAKQSQ AARDMGRPDK NNRFRIALTG TPVENRVSEL WALMDFLNPR VLGEEDFFRQ RYRLPIERYG DMSSLRDLKG RVGPFILRRL KTDKAIISDL PEKVELSEWV GLSKEQAALY RNTVDETLEA IARAPSGQRH GKVLGLLTRL KQICNHPALA LKEKTVAKGF MDRSAKLLRL EEILEEVIEA GDRALLFTQF AEWGHLLKAY LQQRWRFEVP FLHGSTSKTE RQAMVDRFQE DPRGPQLFLL SLKAGGVGLN LTRASHVFHV DRWWNPAVEN QATDRAYRIG QTNRVMVHKF ITSGSVEEKI DRMIREKSRL AEDIIGSGED WLGGLGVSQL RELVALEDS
|
| |