Gene P9303_28381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28381 
SymbolhepA 
ID4777403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2507041 
End bp2510340 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content58% 
IMG OID640088361 
ProductSNF2 family DNA/RNA helicase 
Protein accessionYP_001018833 
Protein GI124024526 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.590649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGTT GTGGAACTCC TGCGTGGATG GTTGCCGTTG ATCGGCAGTG CACTCCTGCT 
CCAAGAAACC CAACACATAC TTTTTGCGTC GCGGCCATGA GCCTGCTGCA CGCCACCTGG
CTTCCAGCCA TCCGTACTCC GACCAGCTCC GGTCGCCCTG CGCTCCTTGT GTGGGCAGAT
ACCTGGCGAG TCGCTACCCC AGCAGGACCA GCAGCAACTC CCGCACTCCA CCCCTTCACA
CTCAACCCAG ACGATCTACG TGCCTGGCTG ATTGAGCGCG ATCTACTGCC CGATGAAATC
ATCGACGCCA CAGCATGTCT GACCCTGCCT AGCCGAACAG TCAAACCGCG CAGCAAAGCC
AAGAACGTAT CCACTGAATC CGACGAAGAC AAAGACCACA AAACAAGTTG GACAGGACTG
CCCTTACAAG CAGGCGAACC CATTCCCAAA CAGACTGAAT GGTGGCCCTG GCAGGTGCAA
GGCCTGGCAG TGGAGCCTGC TGCTGCAACG GCCTGGCTTT CGAAACTGCC TCTTTCAGGA
GATCATCCTG ATCTCGCCGA TGAATTGCGC TGGTGGAGCC ATCTACAGCG CTGGGCCCTG
AGCATGATTG CTCGCGGACG TTGGCTACCC CAGGTGGAAC TCAGCAAGGG AGAGGGCTAT
CCCCACCGAG CACGCTGGAC ACCGCTACTC AACCGTGAAG ATGATCGCCG CCGCCTCGAA
GACCTTGCCG CTCAGCTCCC CTTAGTGGCC ACCTGCGCCC TCCCCTGGCG GGAGCCCACC
GGAAGGCGTA GCAACCGAAT GACCCGCCTA AGACCAGAGG CGATGCGAGC CGCTAACCCT
GTGGCTTCAT GCCGACCCCG CAGCGGTCGC CTTCGCGTAG CCAGCCTGCT GGAAGAACTC
TTGGATGCCC AACTGCGCAC CGGATTTGAA GCGAGTGAGC AAGGCCTAGA CCCATTGCTC
ACAGCCTGGC AGGAAGCACT GGGGTCGGAC AGCGGCGTGA TCAACCTCCC CGATGAGGAA
GCCGAACGTC TAGCGACAGC AAGCAACCAT TGGCGAGAAG GCGTGGCTGG CAACGTCGCA
CCAGCCAGGG CCTGCTTAGA ACTCTTCACT CCCGGCGAAG GGGAAGACCT CTGGGAGCTG
CGCTTCGCCT TACAGGCTGA GGCTGATCCC ACGATCAAAG TACCGGCCGC AGCAGCCTGG
GCAGCGGGTC CCAAGGTCCT GCAACTAGGC GAAATCCGTG TGGAACATCC AGGCGAGGTG
CTACTGGAAG GCATGGGGCG AGCCCTCACG GTGTTTGCAC CGATCGAACG AGGCCTCGAC
AGCGCCACAC CAGAAGCAAT GCAGCTCACC CCTGCTGAAG CCTTTGTATT GGTGCGCACT
GCAGCGGCCC AACTGCGTGA TGTTGGCGTT GGCGTGGAAT TGCCTGCCAG CCTCTCGGGA
GGGCTGGCCA GTCGCCTAGG CCTAGCGATC AAGGCGGAGC TATCGGAGAG ATCTAGAGGT
TTCACTTTGG GCGAAACCCT CGACTGGAGT TGGGAGCTCA TGATCGGTGG CGTCACCCTG
ACGCTTCGCG AGCTGGAGCG ACTAGCAAGC AAGCGCAGCC CGCTTGTCAA CCACAAGGGC
GCCTGGATCG AATTACGCCC CAACGATCTC AAAAATGCGG AACACTTCTG CAGCGTCAAT
CCAGGCATCA GCCTCGACGA TGCCTTGCGC CTTACCGCAA CCGATGGCGA CACGCTGATG
AGACTGCCCG TTCACCGCTT TGAGGCCGGT CCACGACTAC AGGCGGTGTT GGAGCAGTAC
CACCAGCAAA AAGCTCCCGA CCCCCTACCT GCTCCCGAAG GCTTCTGCGG TCAGCTAAGG
CCTTATCAGG AAAGGGGTCT GGGTTGGCTG GCCTTCCTGC ATCGCTTCGA TCAAGGGGCA
TGCCTGGCCG ACGACATGGG CCTGGGCAAA ACGATCCAGC TACTGGCATT CCTGCAACAT
CTCAAGGCGG AACAGGAACT CAAACGGCCG GTATTGCTTA TCGCTCCCAC ATCCGTACTT
ACCAACTGGA AGAGAGAGGC ATTGGCCTTC ACACCAGAGT TAAACGTCCG AGAACACTAT
GGGCCGCGTC GGCCCTCTAC CCCCGCCGCC TTAAAGAAAG CACTCAAAGG CTTAGACCTC
GTTCTCACCA GTTACGGGCT CCTGCAGCGA GATAGTGAGC TCCTGGAAAC GGTCGACTGG
CAAGGAGTGG TCATCGATGA AGCCCAAGCC ATTAAGAACC CCAACGCCAA ACAGAGCCAA
GCAGCACGCG ATATGGGCCG CCCAGACAAA AACAATCGCT TCAGGATTGC TCTTACCGGC
ACACCCGTCG AAAACCGAGT CAGTGAACTT TGGGCACTGA TGGACTTCCT CAACCCAAGG
GTTCTCGGTG AAGAAGACTT CTTCCGCCAG CGCTACCGGC TGCCAATTGA ACGCTATGGC
GACATGTCTT CCCTGCGAGA CCTCAAAGGC CGTGTTGGTC CCTTCATCCT GAGACGACTA
AAAACCGACA AGGCAATCAT CTCCGACCTA CCTGAAAAGG TAGAGCTGAG CGAATGGGTG
GGTCTGAGCA AAGAACAGGC AGCCCTCTAT CGCAACACAG TGGATGAAAC ACTGGAGGCC
ATTGCCCGCG CACCCAGTGG TCAACGTCAT GGCAAGGTGC TCGGCTTGCT TACCCGACTG
AAGCAAATCT GCAACCATCC CGCCCTAGCC CTCAAAGAAA AAACCGTTGC AAAAGGCTTC
ATGGACCGCT CCGCCAAGCT GCTGCGTTTG GAAGAAATTC TCGAGGAAGT GATCGAGGCA
GGAGATCGCG CTCTGTTATT CACCCAATTC GCAGAATGGG GTCATCTCCT TAAGGCCTAC
CTGCAACAAC GCTGGCGCTT TGAAGTTCCC TTCCTGCACG GCAGCACAAG CAAAACTGAA
CGTCAGGCCA TGGTTGATCG CTTCCAGGAG GATCCACGTG GACCCCAACT GTTCCTGCTG
TCACTCAAAG CCGGTGGCGT AGGCCTAAAC CTCACGCGGG CTAGCCATGT GTTTCATGTC
GATCGCTGGT GGAATCCTGC CGTAGAAAAC CAGGCCACTG ATCGCGCTTA CAGGATCGGA
CAAACCAATC GGGTGATGGT GCACAAATTC ATCACCAGCG GCTCAGTTGA AGAGAAAATT
GATCGCATGA TTCGCGAAAA ATCTCGACTT GCCGAAGACA TCATTGGCTC TGGAGAAGAC
TGGTTAGGTG GCTTAGGCGT CAGTCAATTG CGCGAACTAG TGGCCCTAGA AGACAGCTGA
 
Protein sequence
MIGCGTPAWM VAVDRQCTPA PRNPTHTFCV AAMSLLHATW LPAIRTPTSS GRPALLVWAD 
TWRVATPAGP AATPALHPFT LNPDDLRAWL IERDLLPDEI IDATACLTLP SRTVKPRSKA
KNVSTESDED KDHKTSWTGL PLQAGEPIPK QTEWWPWQVQ GLAVEPAAAT AWLSKLPLSG
DHPDLADELR WWSHLQRWAL SMIARGRWLP QVELSKGEGY PHRARWTPLL NREDDRRRLE
DLAAQLPLVA TCALPWREPT GRRSNRMTRL RPEAMRAANP VASCRPRSGR LRVASLLEEL
LDAQLRTGFE ASEQGLDPLL TAWQEALGSD SGVINLPDEE AERLATASNH WREGVAGNVA
PARACLELFT PGEGEDLWEL RFALQAEADP TIKVPAAAAW AAGPKVLQLG EIRVEHPGEV
LLEGMGRALT VFAPIERGLD SATPEAMQLT PAEAFVLVRT AAAQLRDVGV GVELPASLSG
GLASRLGLAI KAELSERSRG FTLGETLDWS WELMIGGVTL TLRELERLAS KRSPLVNHKG
AWIELRPNDL KNAEHFCSVN PGISLDDALR LTATDGDTLM RLPVHRFEAG PRLQAVLEQY
HQQKAPDPLP APEGFCGQLR PYQERGLGWL AFLHRFDQGA CLADDMGLGK TIQLLAFLQH
LKAEQELKRP VLLIAPTSVL TNWKREALAF TPELNVREHY GPRRPSTPAA LKKALKGLDL
VLTSYGLLQR DSELLETVDW QGVVIDEAQA IKNPNAKQSQ AARDMGRPDK NNRFRIALTG
TPVENRVSEL WALMDFLNPR VLGEEDFFRQ RYRLPIERYG DMSSLRDLKG RVGPFILRRL
KTDKAIISDL PEKVELSEWV GLSKEQAALY RNTVDETLEA IARAPSGQRH GKVLGLLTRL
KQICNHPALA LKEKTVAKGF MDRSAKLLRL EEILEEVIEA GDRALLFTQF AEWGHLLKAY
LQQRWRFEVP FLHGSTSKTE RQAMVDRFQE DPRGPQLFLL SLKAGGVGLN LTRASHVFHV
DRWWNPAVEN QATDRAYRIG QTNRVMVHKF ITSGSVEEKI DRMIREKSRL AEDIIGSGED
WLGGLGVSQL RELVALEDS