Gene Haur_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1958 
Symbol 
ID5733847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2383832 
End bp2387833 
Gene Length4002 bp 
Protein Length1333 aa 
Translation table11 
GC content52% 
IMG OID641279102 
Producthypothetical protein 
Protein accessionYP_001544729 
Protein GI159898482 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA CGGCGATTAT CACCGAGGGC GGGTTGTTGC CTGCCGATAT GCTTCAAGCG 
ATTGCTGAGG GTGAGCAGGG CAGCCTCGCT GGTCAGCGCC CAGCCGATTT TGGCTTGCCT
GCCAATCGCC GTATGAGCGA TGATATTGCC GCTGCTTGGG GCCAGGTGCG GGCGCAATGG
CAGATTTTTC AGGCGGCAAT CGAGCGCCGC CCCCAAGATT CGCACACAAC CTTGACCCGC
CGTTATTGGG TCGAGCCGTT TTTGCAGTTG ATCGGCTATG AGCCAACCAG CACCAAAAGC
GCTCGCCGCG TTGATAATCG CACCTATGCT ATCAGTCATT CCGCCGATGA GCACGATGAT
TCGCCGCCTA TTCATATCGA GGGGATTAAG ACTGATATCG ACCGCCGCCC CGAAAGTGGT
CGCCCACGGA TTTCGCCCCA CGCTTTGATG CAGGAATATC TCAATAGCAC TGAACATACT
TGGGGGATTG TGACCAACGG CAGGCGTTTG CGTTTGCTGC GTGATTCGTC GCAAACTACC
CGCCCAAGTT TTGTCGAGTT CGATTTGGAG TCGCTGGTGA CTGGCCAGTT GTTCAATGAG
TTTGCGCTGC TTTATCGGAT TTTGCATCGC ACGCGCTTGC CAATTACCAG CGCCGATACC
GCCCAATCGT TGCTCGAACA GTATCATCAG CAGGCGCGTG AGGCTGGCGG TCGGGTGCGC
GAGGGCTTGC GCGAAGGGGT TGAACGGGCG CTCAAGTTGC TGGGCCAAGG CTTGTTGCGC
CATCCACGCA ACAGCGATTT GCGCCAGCGC TTTGCCACCA ATCAATTAAC TCCGCTGGAA
TATTATCGCC AGTTGCTCAA GTTGGTCTAT CGCTTGCTGT TTTTGATGGT CGCCGAGGAT
CGCGGGTTGA TCGAGGCCGA AACTGCTAGC GATAAATTAT CGGAGTTGGC GCAGCGTGGC
ACGCCCAGCG AACGGCTGAA ATTGTATTAT GAGCATTATA GCGTTGGCCG TTTGCGGCGT
TTGGCCGAGG TGCGCGGCGC TGGTCGCGGC CCTTACGATG ATATTTGGAT GGCGCTGCAA
CAAACGTTTC GGATTTTTGA GGGTACTGAT CTTAAAGCCA ATCGCTTGGG CATCGCAGCG
CTCGATGGCG ATTTGTTTGG CGAAGGCGCG ATTGGGGCGC TCGAAACTGC TCATTTGCGT
AATGCCGATG TTTTAGCGGC GCTGCGGGCG CTCTCGATCT ATGCCGATCC GCAGTCGCGG
GCTTTGCGGC GGGTCAATTA TGCGGCGCTC GATGTCGAGG AGCTGGGTAG TGTCTATGAG
TCGTTGCTCG ATTATCGTCC GGTGGTCGCT GGCACAAGCT TCGATTTGGT CGCTGGCACC
GAGCGCAAAA CCACAGGCTC GTATTACACT CGCCCCGAAT TGGTGCAGGA GCTGATCAAG
AGTGCGCTTG AGCCAATTAT TGCTGAACGC TTGCGCGATA AAAACCCGGA ACAGGCACTG
CTCTCAATTA CGGTCTGCGA CCCCGCTTGT GGTTCGGGCC ACTTTTTGCT GGCCGCTGCT
CGCCGCATTG GGCGCGAACT GGCACGGGTG CGCTCCGGCG AGGATCAGCC AACGCCCGAT
CAGTTTCGCC ATGCGGTGCG CGATGTGATT ACTCACTGTA TTTATGGAGT CGATTTCAAT
CCGTTGGCGG TCGATTTGTG TAAATTGGCG CTGTGGATCG AGGGCCATTG CGCGGGCATG
CCGCTTTCGT TTATTGATTA TCATATTCGT TGGGGCAATA GTTTGGTTGG CGCAACTGAA
GAACTGGTTA ATCAAGGGAT TCCCGATGAT GCGTTTAAAC CGGTGACTGG CGACGATAAA
ACGATCGCGA GCAATTTGCG CAAACGCAAC AAGCGCGAAC GTGAGGATAT TGCCAGCGGC
CAAATTACCA TGAATCTTGC GCCCAGCCAG CTTGATCATG CGACGCTCGG TCGGGCCACA
CGCCAACTTG AGGCCTTGCC CGATGATAGT GTGGCGGCAG TGCGGGCCAA AGCGGCTCGC
TACGCCCGCA TGCGCGAGCA AGAACGCCCA AACTGGACGC GCTACAATCT TTGGACGGCG
GCTTTTTTCC AGCCGATTAC CAAGGATACG CTGCCGCTGA TTCCAACTAG CGCTACCTTG
CACGCCTTTG ATACGGCCCG CCAAAGCGTC AGCGCTGGCC TGCTGGCTTG GGTCGATGGC
CTTGCCGACC AGCCTGAAAT GCGCTTTTTT CATTGGGAGT TGGAGTTTCC GCATATTTGT
GGCGAAGGTA GCCCGCGTGG TTTTGATGTG ATTTTGGGCA ACCCGCCGTG GGAGCGGATT
AAGCTGCAAG AGCAAGAGCA TTGGGTCGAT GTAGCCGAGA TTCGCGAGGC GGCCAATAAA
GCGGCGCGTG AGAAGTTGCT CAAGGCGTGG GCCAGCAGCA GCGAACCAAG CAAGCAACAG
CGTTATGCCA AATTTGAGCA TGCCAAATAT ATTGCCGAGG CTGCTAGCCG CTTTATTCGG
GTTTCGCAGC GCTACCCACT GACGGCGGTT GGCGATGTTA ATACCTATGC CTTGTTTTCC
GAGCTTGATC GCGATTTGAT CAATCGTAAA GGCCGCGCAG GCATTATTGT GCCAACTGGC
ATCGCCACCG ATGATACAAC TAAAGCCTTT TTTGGCGATT TAATCAAGAA ACAATCATTA
GAAAGGTTGA TTGGTTTTGA AAATGAAGCA TTTATTTTTC CTGAAGTACA TAACGCTTTC
AAATTCTGTG CACTTACAAT GGTAGGAAAT GATATTTCTA GCGAAACCCC TGACTTCATT
TTCTTATGTA GGTATTTTAG TGATATAGAA CAGGATGCTA GACATTTTAA TATGACTAGC
GATGAATTCG CCTTAATTAA TCCCAATACT TTGAATTGCC CTATATTTCG TACTAAAACT
GATGCACAGT TAACGAAAAA AATTTATCGG ATTGCCCCAA TCTTAGATAA CCAAAAAACG
AAACGAAATC CTTGGAATAT ATCGTTTGGT ACAATGTTTC ATATGGCAAA TGATAGTGGT
TTATTTAAAA ACGAATCCTC ACGCGATAGA ATGCCTTTAT ATGAAGCAAA AATGATATGG
CAATTTGATC ATCGATTTGC ATCACTCATA GGCAAAGAAA ATGCAGGCAA CAGATTATCC
AGAAAATATG AAGGCTGGTA TGGTGCAGAT TATGGCAACC CAGAAGATCT TCCAATTCCT
ACATACTGGA TTGATAGAGA GAGTATAGAG GATCGTATTC CAAGTAAGCA TCAAAATAAG
TGGTTATTGG TATTTCGTGA TATTACTAGC AGTGTTGTTG AACGAACGGC GATTTTTAGC
CTGATTCCAC GAGTGGCGGT AGGCCATACC GCACCTTTGA TTTTCCTAAC AGATATTAAT
TCTAGTTTGT TTTCCTGCTT CTTAAGCATA GTTAATAGCC TTTGCTTTGA CTACATAGTA
CGACAGAAGA TTGGTGGCAC ACATTTAACC TTTGGCTATG TCAAACAACT GCCCGTGCTG
CCACCCGAAC GCTTTGATGC AGCCCAGCTA GCCTTCATCG TGCCACGGGT TTTAGAGCTG
GTCTATACCG CGTGGGATCT GCAACCGTTC GCCGCAGATG TTTGGGCCGA ACTTGATGAA
ACGGGGCGGC AGGCACTTTT AGCCCAAAAC GCCGAGTGCA ACCGAGATGC GCCGCCGGAG
TGGTTCAGCC CACGTGATGG TTTTGCTTTG CCACCCTTCC GCTGGAGCGA CGAACGGCGG
GCGGTGTTGC GAGCCGAGCT TGATGCGCGG ATTGCGCGAT TGTATGGGCT AAGCCGCGAC
GAACTGCGCT ACATCCTCGA TCCGGCTGAA GTCTATGGCC CCGACTTCCC GGGCGAAACC
TTCCGCGTGT TGAAAGAAAA AGAGCTGAAA CAGTATGGTG AGTATCGCAC GCGGCGTTTA
GTGCTCGAAG CGTGGGATGG GGAGGATGCC CTCACCCCCT AG
 
Protein sequence
MKYTAIITEG GLLPADMLQA IAEGEQGSLA GQRPADFGLP ANRRMSDDIA AAWGQVRAQW 
QIFQAAIERR PQDSHTTLTR RYWVEPFLQL IGYEPTSTKS ARRVDNRTYA ISHSADEHDD
SPPIHIEGIK TDIDRRPESG RPRISPHALM QEYLNSTEHT WGIVTNGRRL RLLRDSSQTT
RPSFVEFDLE SLVTGQLFNE FALLYRILHR TRLPITSADT AQSLLEQYHQ QAREAGGRVR
EGLREGVERA LKLLGQGLLR HPRNSDLRQR FATNQLTPLE YYRQLLKLVY RLLFLMVAED
RGLIEAETAS DKLSELAQRG TPSERLKLYY EHYSVGRLRR LAEVRGAGRG PYDDIWMALQ
QTFRIFEGTD LKANRLGIAA LDGDLFGEGA IGALETAHLR NADVLAALRA LSIYADPQSR
ALRRVNYAAL DVEELGSVYE SLLDYRPVVA GTSFDLVAGT ERKTTGSYYT RPELVQELIK
SALEPIIAER LRDKNPEQAL LSITVCDPAC GSGHFLLAAA RRIGRELARV RSGEDQPTPD
QFRHAVRDVI THCIYGVDFN PLAVDLCKLA LWIEGHCAGM PLSFIDYHIR WGNSLVGATE
ELVNQGIPDD AFKPVTGDDK TIASNLRKRN KREREDIASG QITMNLAPSQ LDHATLGRAT
RQLEALPDDS VAAVRAKAAR YARMREQERP NWTRYNLWTA AFFQPITKDT LPLIPTSATL
HAFDTARQSV SAGLLAWVDG LADQPEMRFF HWELEFPHIC GEGSPRGFDV ILGNPPWERI
KLQEQEHWVD VAEIREAANK AAREKLLKAW ASSSEPSKQQ RYAKFEHAKY IAEAASRFIR
VSQRYPLTAV GDVNTYALFS ELDRDLINRK GRAGIIVPTG IATDDTTKAF FGDLIKKQSL
ERLIGFENEA FIFPEVHNAF KFCALTMVGN DISSETPDFI FLCRYFSDIE QDARHFNMTS
DEFALINPNT LNCPIFRTKT DAQLTKKIYR IAPILDNQKT KRNPWNISFG TMFHMANDSG
LFKNESSRDR MPLYEAKMIW QFDHRFASLI GKENAGNRLS RKYEGWYGAD YGNPEDLPIP
TYWIDRESIE DRIPSKHQNK WLLVFRDITS SVVERTAIFS LIPRVAVGHT APLIFLTDIN
SSLFSCFLSI VNSLCFDYIV RQKIGGTHLT FGYVKQLPVL PPERFDAAQL AFIVPRVLEL
VYTAWDLQPF AADVWAELDE TGRQALLAQN AECNRDAPPE WFSPRDGFAL PPFRWSDERR
AVLRAELDAR IARLYGLSRD ELRYILDPAE VYGPDFPGET FRVLKEKELK QYGEYRTRRL
VLEAWDGEDA LTP