Gene Haur_5126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5126 
Symbol 
ID5737084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp173372 
End bp177967 
Gene Length4596 bp 
Protein Length1531 aa 
Translation table11 
GC content50% 
IMG OID641282291 
Producthypothetical protein 
Protein accessionYP_001547882 
Protein GI159901636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATC GCGCCCTGCG TCTTGCCCTG GAAGCCTTGC TGGATGAACC CACCCTGAAA 
TCAACGCATC GCGCGTCGAT TGAGCGTGGC CTTACCGATC TCAAGACGCA GCGCATAACT
CCCGATGAGT GGGAATATCT TTGCTATCTC ATCGAGCACT ATGGAGCATC AACGCCGCTA
CCTGCCGATC CCGTGGCCGC ATTGATCGCC GCCTTTCAAG ACAAATTACC GTCTGTTGAT
CCCATAGCCT TGAGTACGGC GATCCACGCC CTCGTGGGGG GTGATACCGC CCAGCTTGAT
GGCAAGTCGC TCGCGCTTAC GATTGAACGC CGCGGGCAAA CCCAGATTGC GAACTTGGCG
GAACGCAATG TCGTGATGGG GAATCTGAAT CAGCTTGATA TTTCCGTACA TATCCCAATC
CCTGATGCCC GCTTTGATAG CCTGATGCAG GCCGCGTTCC AGCGCCAAAC GCATGAACGC
ATCACCAACG CTGCCCAACA AACCGAGAGC GATCGCCTGC TGAATCTGCG CCTTCAGGGC
TTTGTTGGTC GGGACAATGA ACTAGCAGCC ATTCGTGCGC AGATTGCAAC GGTACGACCC
ACGGGCGGCT ATGTATTAAT CAAATCAGAA GCAGGTGAAG GTAAGAGCAG CATCATTGCT
AAACTGATTC AGGATGCGGG GTTTGCGCAA ACCCCGCATC ACTTTATTGC CCTGACTACG
GGCCGAGACT ATCAATTGAG CTTGCTCCGC GCGGTGGTGG CGCAATTGAT TCTGAAGCAT
AGGCTGCCTG TTTCATACTT CCCCGAAGAA AGTTATTCCA CAATGAAGGG GGAATTTGCC
CGTATTCTTG ACGAGTTATC CAAACACGGC ATTCAGGAAA CAATTTATCT CGATGGCCTC
GATCAACTTC CACCGGAGGG GGATAGGTTG ATCGATCTAT CGTTTTTGCC ATCACAGCCG
CCACCAGGGA TCGTGATTGT ACTTGGGTCG CGACCGGATG AAGCCTTCAA ACCGTTACAC
CACTTGAACA AAGCTGCGTA TCATTTACCA CCAATGAGCG AAATAGATGC ATTCACCGTA
TGGCGATCAG TCCAGTCTGG CGTGGCAGAT ACTTTGTTTC ATGACCTCTA TACCGCATTA
AAAGGCAATG CATTGTTTGT CTATTTAGCG GCGGATACGA TACGCAATCA ATCTGTGGTC
GATATGGCCA GTTTAATTCG ACAAATTGAG CAGAATCCAA AGGACATATT TGGTATTAAA
CTGGAGCGGA TCAAAAACCG TTCTGAGCAT CGTTGGCCAA CGATATGGAG GCCCATGTTG
GCGCTCTTAG TCGTCACACA AGAGCCATTA CCTATGAGGG TTATTGGGCG GTTATTGGGG
CAACCACGGG CCGTGATTCA GGATGCCGTA TGGGTAATGG GTGGCTTGGT CAGCCAAGGC
AGCGATCAAC GGGTGGCGCT GCATCATCTG CTGTTTCGTG AGTATATCCA AGCACACGAG
TTTGATAGTG AGGAACTTCA TGACATACAT CAACGGCTGG CCGACTGGTG TGCGCAGGAT
GGGGATGCGA TTTGGGCCGA TCATCGTGAT GCATTAGAGC AAGCGCGTCG GGTCTATGCG
CGGCATCACT ACATCACCCA TCTTGCACTG GCCGAGAATT GGCCAACACT CTGGCGGGTT
TTGGATGCAG GTGACTATGG CGCACAGAAA ACGCGCTTTG ATCCCAGTAT GCGGCTCTAT
GCGCTGGATT TGGATCGGGG GCGTGAGAGT GCCATTAAGG CTGGGCAATC GACCGAAGAA
CATATCCAGC ATTTACCACG CCTGTGGAAG TATAGTTTAT TGCGGACAAG TTTAACCAAT
CGCGTCGATC AGTGGCCAAA TAAAGTGTTC GCGATTTTGG CAACGATTGG GCAAACACAC
GAGGTATTAG AACGAATAGA GCTCCTTTCG AGCACGAGGA AGCAGGTTCA GTGTTGGGAT
ACGATTCTTC CATGGTGTGA TACACAACAA CAGAAGACAA TGCTTGTCCG ATTGTATCAA
GTAGCTGGAC ATCTTTCGGG TATCGATGAG ATTTTCACCC TCAGTATCAT TGCGAAAATA
ACCATCATGC TTGGAGAAAA TGAACAGGCG ATAGGGATTC TTGACCATGC CCTTGCCATT
GCCCATACCC TTAAAACTCC AGAACAATGT AACGAGGCAA TCCGTGCCAT TACGAAAATA
ACGACAATGC TTGCTACGAA TGAACATGTA CTAGGGATTC TTGACCGATC CTTCACTATC
GCTCAATGCA TTGACCATGC AAACTATCGT TCAGAGGCCC TTAGTGTCAT TGCAGCAGCT
ATTGCTGTTC ATGGTGATAG AGGCAGAGTT CTGACACGCA ACCAGATGAT TGATACTTCA
GGACGACGCA ACGAAACCAT CAGTGTCATT GCAGCAGCTA TTGCTGTTCA TGGCGATATC
GACTATGCTT TGACCATCAC CCAATCTATT GATAATCCAG TACAATATAT CTTGGCCATT
GCTTTCATCG CAAACATACT GACCATATTG GATAAACCTG AACACGTACT AGGGATTCTT
GACCGTGCCC TGTTTATCGC CCAGTCCATT GATAATCCGG AACAACGTGA CAAAGCCATG
GGTGCCATTG CAGCAGTGAT TGTGACCCAT GGCGATATCG ACCGTGCCCT GTCTATCGCC
CAATCCATTG ATAATGTATG GCAGCGTGAC AGCAGCCTTG CTGAAATTGC GGTAGTCATC
GCTACTCATG GCGATATCGA CCGTGCCCTA TCTATCGCCC AATCCATTGA TCCTCCAAAG
GGGCATACAT TGACTCTCGC TGCCATTGCG GTAGCCATCG CTACCCATGG CAATATCGAC
CGTGCCCTGT CTATCACCCA ATCTATTGAA CACACAAAGG AGCGGGCTGA GACCATGAGT
AGCGTTGCAG TAGCCGCTGC TACCCATGGC GATATCGACC GTGCCGTTAT TATTGCTCAA
TCTATTAACT CATGGCAATA TTCTGAGACC ATGAGTAGCG TTGCAGTAGC CGCTGCCACC
CATGGCGATA TCAACCGGGC CCTCTCCATC ACGCAATCTA TTGAGTCCAC AAAGGAGCGG
GCTGAGACCA TGAGTAGAAT TGCAGAAGTT GCTGTTACCT ATGGACAACA TGAACAAGCA
CTGGCGATTC TCAACATGGC CCTATCATCC ATCACCCAAT CCGTTAACGA GGCATGGCAA
CAACGTGATG AAGCTATCAA GGCTATTGCC GTAGCCATCG CTACCAATGG TGATATCGAC
CGTGCCCTGA CCATCGCTCA ATCCATTGAT AATATATGGG AACGTTATCA GACACTTGGT
ATCCTTGCTG AAGCTATTGC TGGCTATGGC GATATCGACC ATGCCCTGAC CATCGCTCAA
TCCATTGACG ATCCTGTGCA ATGTACCTGG GCTCTCGCTG CCATTGCAGA AGTCACTATT
ACTCGTGGGC AGGATCAACA AGCATTAGCG ATTCTCAACA TGGCCCAGAC CATTACACAC
GCCATTGACA ACGCAAAGGG ACGGGCCATG GCATTCCCTG CCATCGCAGA AGTCACGGCC
ACCCATGGCG ATATCGACCA TGCCTTGTCC ATCACTCAAT CTATCGATGG GTCATGGGAA
CGTAATAGGA CTCTTGCTGC CATTGTGGAA GCCATCGCTT CCCATGGCAA TATCGACCGT
GCCCTGTCTA TCGCCCAGTC CATTGATAAT CCGGAACAAC GTAACAAAGC CATGGGTGCC
ATTGCAGCAG TGATCGTGAC CCGTGGCGAT ATCGACCGTG CCCTGTCTAT CGCCCAATCC
ATTGATAATG TATGGCAGCG TGACAGCACC CTTGCTGAAA TTGTGGTAGT CATCGCTTCC
CATGGTGATA TTGATCGTGC CCTCACCATC GCCCAATCCA TTGATCCTCA AAAGGAGCGG
GCTGAGGCCC TTGCTGCCAT CGCAGAATTC ACTGTTACCC CTGGGCAGCG GAAACAAGCG
TTAGAAATAC TCAACACGAC CCTAACTATT TTCCACACTC TTGACGATCA ACTAGGATTA
CACGATGAGG TTATCGGTGC CATTGCGGTA GCCATCGCTT CCCATGGCGA TATCGACCGT
GCCCTGTCCA TCACCCAATT TATTGGCAAT ACATGGCCAC GACGTGACAA TACCGTGGGT
GCCATTGCAA AAGTCGCAGC CACCCATGGC GATATCGACC GTGCCCTGTC TATCGCCCAA
TTCATCGATA ATCTCGATCG ACGTGATAAT GCCGTGGGTG CCATTGCAGA GGCTACTGCC
ACCCGTGGCG ATACCGATCG TGCCCTGACA ATCGCTCAAT CCATTGATGA TCCAACTCTA
CGGGCTCAAA CCTTTCACGT GATTCTTCAA AAAGCTCAAT CAGTCATAGG GATTTTAAGG
ATACTTCATC ATGAATGGTT TCGGAGCCTA AGATCTACCG ATGTATGGGC AATGACAGTG
ATTTCGACTC CATTAATAAA CGAATATCCA TGGCTTGGTG TAACCATGCT TGATGCAGAG
GAATGGGTGA ATGTACAGTT GAAACGGCTG GAGTAA
 
Protein sequence
MDDRALRLAL EALLDEPTLK STHRASIERG LTDLKTQRIT PDEWEYLCYL IEHYGASTPL 
PADPVAALIA AFQDKLPSVD PIALSTAIHA LVGGDTAQLD GKSLALTIER RGQTQIANLA
ERNVVMGNLN QLDISVHIPI PDARFDSLMQ AAFQRQTHER ITNAAQQTES DRLLNLRLQG
FVGRDNELAA IRAQIATVRP TGGYVLIKSE AGEGKSSIIA KLIQDAGFAQ TPHHFIALTT
GRDYQLSLLR AVVAQLILKH RLPVSYFPEE SYSTMKGEFA RILDELSKHG IQETIYLDGL
DQLPPEGDRL IDLSFLPSQP PPGIVIVLGS RPDEAFKPLH HLNKAAYHLP PMSEIDAFTV
WRSVQSGVAD TLFHDLYTAL KGNALFVYLA ADTIRNQSVV DMASLIRQIE QNPKDIFGIK
LERIKNRSEH RWPTIWRPML ALLVVTQEPL PMRVIGRLLG QPRAVIQDAV WVMGGLVSQG
SDQRVALHHL LFREYIQAHE FDSEELHDIH QRLADWCAQD GDAIWADHRD ALEQARRVYA
RHHYITHLAL AENWPTLWRV LDAGDYGAQK TRFDPSMRLY ALDLDRGRES AIKAGQSTEE
HIQHLPRLWK YSLLRTSLTN RVDQWPNKVF AILATIGQTH EVLERIELLS STRKQVQCWD
TILPWCDTQQ QKTMLVRLYQ VAGHLSGIDE IFTLSIIAKI TIMLGENEQA IGILDHALAI
AHTLKTPEQC NEAIRAITKI TTMLATNEHV LGILDRSFTI AQCIDHANYR SEALSVIAAA
IAVHGDRGRV LTRNQMIDTS GRRNETISVI AAAIAVHGDI DYALTITQSI DNPVQYILAI
AFIANILTIL DKPEHVLGIL DRALFIAQSI DNPEQRDKAM GAIAAVIVTH GDIDRALSIA
QSIDNVWQRD SSLAEIAVVI ATHGDIDRAL SIAQSIDPPK GHTLTLAAIA VAIATHGNID
RALSITQSIE HTKERAETMS SVAVAAATHG DIDRAVIIAQ SINSWQYSET MSSVAVAAAT
HGDINRALSI TQSIESTKER AETMSRIAEV AVTYGQHEQA LAILNMALSS ITQSVNEAWQ
QRDEAIKAIA VAIATNGDID RALTIAQSID NIWERYQTLG ILAEAIAGYG DIDHALTIAQ
SIDDPVQCTW ALAAIAEVTI TRGQDQQALA ILNMAQTITH AIDNAKGRAM AFPAIAEVTA
THGDIDHALS ITQSIDGSWE RNRTLAAIVE AIASHGNIDR ALSIAQSIDN PEQRNKAMGA
IAAVIVTRGD IDRALSIAQS IDNVWQRDST LAEIVVVIAS HGDIDRALTI AQSIDPQKER
AEALAAIAEF TVTPGQRKQA LEILNTTLTI FHTLDDQLGL HDEVIGAIAV AIASHGDIDR
ALSITQFIGN TWPRRDNTVG AIAKVAATHG DIDRALSIAQ FIDNLDRRDN AVGAIAEATA
TRGDTDRALT IAQSIDDPTL RAQTFHVILQ KAQSVIGILR ILHHEWFRSL RSTDVWAMTV
ISTPLINEYP WLGVTMLDAE EWVNVQLKRL E