Gene Haur_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2224 
Symbol 
ID5734111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2821969 
End bp2829453 
Gene Length7485 bp 
Protein Length2494 aa 
Translation table11 
GC content50% 
IMG OID641279365 
ProductYD repeat-containing protein 
Protein accessionYP_001544992 
Protein GI159898745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTAA ACCGCCCCTA TCGACGATTG ATTCACGTTG GAATGCTCTT GAGCATGCTC 
TGGTCGCTCT TCCCAGCGAG CCAGCCAGCC ACCGCCCAAA CTGATCAGTC CAACAAAACG
GTTGAGCAAG CGTTTGCTCC CGATACTGGC CTAACACGCG ATCCGCAAAC CCCGCGTTTA
GCAACTGCGC TACCAAAAAA AGCCAGCAGC AGGGCTGATT TTATGCCTGT GGCGGCCCAA
AACTTAAGCA TTCAGCCGCA ACTTAATCAG GTGGTCACTG CCCAACTTGA TAGCGCCAAA
ACCACCGCCG TACAATTCGA GCAAGCGCCG TTGAGCTTGT TGGTTGAGGC CGATACCTTC
GCGACCGCTA CGCGCTTGGA GTTTCAAGCT CAAGCCTTGC CAAACTTGAC CCAACAGCTC
CAACGCAGCA GCAAAAGTGA AGGCTACCTG CGCGACCAAG CTGAGCAAGT TACGTTCTAT
CGCTTTAACA TCGAAGCCCA AACCAACGAT CAAGCTGCCA ACTTCAAAAA ACCAGTGCGC
ATGGTGCTCG ATTTGCGCCA ACTCATGCGC GATGTTCCCA GCAATTATCA ACAATTTTAT
TTGGCCTACC AAGATCAAGC CGATCCCAAC CACTGGATCG AAGTGCCAAT TACCTTACAC
GATGCCAAGG GCTTGATTAG CGCCGATGTT GACCATTTTT CGACATGGGC AGCTGGCACA
CGGCCAGAGC GTTGGAATCC CAGCTGGGTT CCAGCAGCAG TTGCCAGTTT TAGCGGAGCA
GCCACCTATG CCTACCCAAT TGAAGGCCCA ATGGGTCGTG GTGGCCTGCA ACCAAAAGTC
GAATTAAGCT ACAACAGCCG GAGCCTTGAT GGCCGGATTC GCGATGACAC TGGTGCAGGC
CCACTGGGCG ATGCTTGGTC AATCAGCGAT ATTAGCATTG CACGGGTTGG GGTAAAAACT
GAATTCGTTG CGGGCTTTCC ATCCAACAAA CATCCCGACA ACTTCCGTTT GACGATCAAC
GGCGCTGGAC ATGAACTCTT CCCTGAGTTT CCCAGCCAAA CTGCCACTGC CAGCAGCATG
CGCTATTTTG CCAAAGATGC TCCAGGCTAC TACATCAAAC GGGTCTATAA CACGGCCACC
CCCAATACCG ATGGAATATA TTGGATTGTG GTAACGCCAA CGGGTGTGAC CTATCGTTTG
GGTTATTATC CTCACGCCGA AGAACACCAA ATTTGGGATA TTGGCTATTG GAATGTGCAA
GGCCATCAAG GCCGCCCCAA TAACGAACGC TCAGCCTTGG CTTGGCATGT CGATACGGTC
ACCGATCGCG CTGGCAACCA GATGACCTAT CAATATGTCA ACTGGACAGT CACCGAGCCA
ATTGAGTGGT ATCTCCAAGG CTCAAGCCAT AAATCGACCT TGCAATTAAC TACGTGGAAA
AGCCGCATCG GCAGCATTAG CTACAATTAC CCAAATCGGG TGACGGCCTT GCCAGTCAGC
GATACCGTTG CTCAATTGAG CACTACTCCT GCCAGCCGCT TGGTATTTAC CACCAAAACC
CAATTTGCCT ACCTGATCGA CACAATTTAT GTCTATCATG GCTCGTTGAG CACCCCAATT
AAAGAATATC GGATTAACTT ATCAGGCCAT TTCGTTGATA GTCCGGCCTG TATGAACCAA
GATACTCAGC CGAATATTCC GCGCTCAACT CATACTCGGG TGGTCAATGC GATTACGGTC
GCTAGCGGGG TTGATGCCGA CCCAACAACC GAAGATGGTT GGACTTTGCC CGCCACCAGC
TTTACCTACG AAGCCAAACC ACATTACAAC AACAATTGTT TCTCTTTCTA CTATCTCAAG
AGCATGCGCA GCAATTATGG TGGTGAAATC AGCTTCAATT ACGCCTCGGA TAATCGTTGG
ATTGGCGATT ATACCTACCT AGGCTATGAT CGGTATGTTT GGCCAAGTTT GGGCCAAAGC
TATTACGTGG TCGAAACTTT AGCCAACGAC GGGCGTAATC CTGCCGTCAA AACCACCTAC
AGCTATAGCC AGCCCTGTTA TGGCCAATGG TCAAGCAACG TTCCCGCAGG CGCAATCACT
TGTGGCGCAA GCGATGCTCC CGAGTTTGGC ACAATCACTG GCTTTGCCAC GGTCAATCAA
CGCAGCTACG ATTTTAACGG CACAACGCTG CTCAAACGCA ACGAAACAAT TTTCTCGCAA
AATAGTGCTA CCACCAGTGG CAAGCCCATG ATTCAGCGCA ATTTTGCTGG CGATGGAACG
TTGTTGGATG CAACCCACAG CACCTACAAC ACCGATAGCC TGCATGGCTT GCCCAATATG
TTCACCTATT TGAGCGAAGT TAAGAGCTAT CAATATAGCA ACGGCCTTGA AATTTCGACC
AAGCGCACGT TTGGTTACGA TGTGGCCAAA CAAGCTGGCG TGCAATATGG CAACCTCACC
GACACTTGGC TCTATGCTAG TGCTGCTGCA ACGACACCTT ACGAAAAACA AGTTACCTAT
TACACGCCCA ACAATGGCCG AGCCACTGGA GGCCAAGCCT GGATGGTCAA CGCGCCCACA
GCCAGCGGTC GTTACGATCA ACAAGGCACA TTTTTAAACG GCACATGGAT CTATTATGAT
GGGGCCACAA CCAACGCTAC TGCCCCAACC CAAGGCTTGG TCACCCGCAG CCGCCAAACT
CGGCCTATTA CTTGTGCTGA GATTCCAAAT CCAAGCGGCT TGCCACAACC AGCTGACCCT
AATTGTGTCC ATGCCTTCCA AACCATCGAT AGCGATATGA CTTACGATAG CTTTGGCAAC
CCCAAAACGA CCACAATCTA CAGTGGCTTT GGCTATCGTT CGCTCAAAGC CAACTTTAGC
GATTCGCTCG ATTGGAAGCC CACCGAAACT GGCCAACCAA ACCTGAGCCA AGTTGCCAGT
TTGTGGTACG ACAGCGATTA CAATCTATAT CCAGTCAAAA CCACTAACGC ATTAAATCAA
GCTACAACCT ACGAAATTTA TGGTTTCCGC AACGATGCTG GCAACATCGC CGCCGTTGAT
GGCTTCCAAA TTCAGACTGG CTTGCTCAAA AGTGTCACCA ATCCCGATGC CACGATTGTA
CGCTACGAGT ACGATCCATT TGGTCGCTTA GTCAATACTT TCGATAGCTA TAGCTTCACG
GGCTTTGGCG ATAGCACCAA GTGGAACGGC AATCCAGTTA TTCGCTACCG CTATTGGGAT
AATTATTGGA ACGACAGCGC GGTTTTTGCC AATCCTGCTG CCAATCAACC ATTCCTGATC
AGCGATGAAA AGCGGCCTGG CAGCTATGCC AACCCCAGCA GCACTGGCAA TTTCGCCTAC
AACGACCAAA CCATGTACGA TGGTTTTGGC CGCGCTATCC AAAGCCGCCA TATTTGGGCC
GATGTTGATG GCGAAGCCAA GCGCCAAGAA ATTTATAGCA CAACCGCCTA CAATGCACTC
GGACAGACAA TCTGCCAAAC TGCGCCGTTC AACCTGCCGT TCTACATCGA TCGCGGTTTG
GTTTGGCCAG CGTCACCCTT CGTCACCACG CCATGCAGCG ATAGCAGTAT GGCCAAAACC
CTGACCAGCT ACGATAATTT TGGGCGGGTC AAGCAAACCA CTGCCGCCGA TGGCAGCCTT
AACAAAGCCA ATTGGAGCTT AGTTAACAAC ATCACGGTTG CTGGCCAAAA TCTCTTCTGG
CAGCATCAAC AAATTAATCC CAAGAACCAA CTAGAAATGC GCTTGATCAA CAATCAAGAA
CAGTTGGTCT TGCGGCGCGA ATATCGCGGT ACTGCCGATA GCCCAATCGT TTATAGCGAT
ACCCAGTTCC AATACGATAC CCTCGGCAAT ATCAACAAAA TCAGCCGCCG CCAACCTAGC
AACGCTGGCA ACGGAGCGTT GATCGCACCC GAAGCAACCA TGGTTTACAA CGGCTTTGGC
CATAAGTTGC AAATCAACGA CCCCGATATG GGCACAATCA AGTATCGCTA TAACGCCAAT
GCGCGAATTA TTGAGCAACG CACGCTCAAC GATAGTCTGC TTACGAACGA TGACGATGTT
GTGTGTTTCT ATTTCGATGC GTTGCAGCGC AATACCAGCA AAAATACCAC CAATGCTGGC
GCTAATTGTT CTAACACGCC AATTTTGAAC GGGGCGTTGT GGCTAGCCAA CTCAAGCTAC
TACAGCAGTG GCGCAGGCAA AATCGGCAAA CTCCAATCAG TTAAATGGAA TCGTGATGGT
AATGGCGCGG TTGATGGCGA AAGTTTCAAC TACAATAGCC TTGGCTTGCT CACCAGCCAT
ACGCGCACGC TCAATGGCGT AAGCTTTAGC ATGCAATTTG GCGACTTCGA TGCCCTCAAT
CGCGCCACCA CAATCACCTA TCCCGATGGC GAAGTCGCAA CCATCACCCA CGATTTAGAA
GGCGAAAATA GCCTGAGTTT GGGCAGCCAT GGTGCGTTGG TAAGCAATAT CGAATACAAC
GCCCGTGGCC ACATCAGTCT AATCGATCGT ACAAATGGCG GGCATAACAC GGTCTTTAAT
TATTATGGCG CAACTGGCAC GGCCAACACA GGCAACAGCA ATTTCCGCCT AGCTAGCATC
AATCATCAAC ATAGCCTGTT GCCGAGCTAT ACCTACGAGT ACGATCAAAT CGGCAATATC
AGCCTCCTCT ACGAAAGTGG CTCGCTCTCG GGCAATACCT ATTTCAATTA TGATGAATTG
GATCGTTTGA CCAGCACCAG CGGCATTTAC AGCCATATCT ATGCTTATGA CAAGCTGGGT
AACTTAACCA ACAATAATGG CATTGCTCAA ACCTACAACG GCATAGGTAC TCAGCCGCAT
GCGCTGCGTT CAACCAGCCA AGGCAATTTC TTCGAGTACG ATCAAGCAGG CAATATGATC
GTGCGCAACG ATGCTAGCGG CCTGTATCAA CAAGCCTTCG ATGTTGAGCA ACGCTTGTAT
GAAGTGATCG ATCAACACGA TCAAACCACG CGCTTCCGCT ATGATCCCAG CGGCCAGCGC
ACTACCACCT TCGCCGCTGA TGGCACAGTT ACCTACGATC CTTTCCCGAA TTATCAACGC
ACGACCGTCA GCAGCAGCAA CTCAGCCGTG GATAGCCTCA ACGCTGGAGT TCTGTGTAGC
GATTACAACC CTGAAACAAA AAGCTATGGC GGCGCTGGCT ATATTATGTA CAGTGAAATC
CCAGTCAAAC AGCGCTTTGG CAATTTGCCA GCGGCCAACA TCAGCGACCA CTTCATCTGT
GTGCGCAACA ATACTGGCGT TTGGGAATAC GATAACGATG CTGGCTTCTA TGCATTCACT
CCGATTGCCA GCGACTTGTT GGTCGCCAGT TTCAACTACA ATGCAACCAC AGTCAGCCCA
TATCTCAATC AATCAGGGGC GATTTATGGC CTGCGCTATG GCTATACCAC CAGCAACCTA
GCCTTCAGCA AAGATGTCTT TGGTGGAACC AACAATCCAG GCGAGTTCGA AATTGCTGGA
ACCCACTTCA AAACCAATGC CTTTGCCCAA AGCGTCGCCA ACCATGGCTA TGGTGTGGCT
TGCCAAGAAG ATGCAACTGG CACAGGCTAC CTGATGTACA GCGCTGAATC GGTGCATAGC
CGCTTCGCCG AGCAAGCGCC CGATATCAAC AATGCTGCGC ACTTCATCTG TGTACGCCAC
AACGGCCAAA CCTGGCAATA CGATAATAAT TCGGCCTATT TCGCCTTTAC TCCACGCCAA
AGCGACCGCT TGATCGGCGC AATCGATTTC AGCAACGATA GCTACACCAG CTATGTCGGC
CAAACTGGCA CGATCTTGGG CATGCAAAAA GGCCTAAGCA GCAGCAACTT AACCATCACG
GTCAATCAAT GGAATGGCGA GAGCAACCCA GGCGAATTTG GGATTGCTGG CTTGAATTTC
ACGCCGCAAG CCTACGAAGT AACGATCACA TCGGCAGGCA TGGGCATCAA TTGTTTGGAT
ACCGCGACAG GCACTGGCTA TATCATGCAC AGTCGCCAAG CCCTGAACCA ACGCTTTAGC
CAATCGATCC CAGCGCAACT TGCCAGCAAA CACTTTGTTT GTGTGCGCTA TAACAGCACG
CTCAGCACTT GGCAATACGA TGATGGCAGC AATTACTATG GCTTCACGCC ACGCGCAAGC
GACACCTTGG TTGCCAGCGT CAACTTCAGC ACCGACCAAG TAACGAGCCT CGCGGGAGCA
AGCGACAGCG AATTTGGCAT AACCAAAGGC TTTGTCAGCG GTATTAGCAT CGTCGCCAAT
CAATGGGGCG GCAATAGCAA TGCTGGTGAG TTTCAAGTTA TTGGCAACAA TTTAACCACG
CATACGATTG ATATCGGCAG CAAAACTGTG GCAATCGCTG GCACGCCAAT CGCTACCCGC
CGCAAACATA GTGTCGCCAC AAGCTTGGTT GATCAATCAT TAGTGTTTGT GTATGTCGAT
AAGCTCGGCA GCGCCAACAC TTTGATGGAT CAAACGGGCA CAGCGATTTT GAATAATGTA
CGCTATCTGC CATTTGGCGA GGAACGCCTA GGCCTCAACT CAGCCTATAG CGATCGCGGC
TTTACTGGCC ATCAAGAGAA TCGTGAGCTT GGCCTAACCT ATATGAATGC CCGCTTCTAT
CTGCCAAGCA CTGGCCGCTT TATCAGTGCC GACAGCATGA TTCCTGAGCC GAGTAATCCC
CAAAGTTTCA ATCGCTATAG CTATGTCTAC AACAACCCAA TCAACGCGAC TGATCCTTCG
GGTCACTTGC CAGGTGATGA TGAACCTGAA ATTCCAAATC CGTTCCCTGA ATCGAATCCA
ATCCCTAATA TGGAGTACAG CGCCTATAGA AAATGGCTTA ACTTCTGGCA AGCCTATACC
GACAACGATA ATCCCTACAT CATGATTAAA CAAGGAGATC AACTTGTTGA AAGCTCAGTT
GCAATCAAAG CGGGGTCTGT CAGTCTAAGC CCAGATAGCG TTAGTGTTGC TGGAAATTGG
GGATTGGTAG GTGCTGAAGT ATCAATGCCA GGAGCATATA AGAAAGGTGA AGGCGATAAT
TTCTTAGAAA CACTGTGGGA AGGAACATCA GCAAAAATAT TAATAGGTCC ACAGATCGAC
TTAATTGTTG TCGAAGTGAT GCCTTTAGCA TTGGGGATCG ATCCATTTAC TGGTAATATT
ACCCTCGAAA CAAGCGCAGG AATTGGACTA GTCGAGGGAA GTACTACAAT CAATCCGTTT
GCCAAAACAG ATGCGATCTA TGTTTTGCAA ATCAGCGATG AATTGCATGA TAAGCTGATG
GGTCGAGATC CATGTGCCTT GAGTGTTTCG TTCCAAGATC AACAAGCAGC TTGGACGGAA
TTAATCAATG TAGTCAACAG CTATGGGTTT AATGGCACTG ACCCTCGCTG GACACTCCAT
TCAATTCCAA ATTACGTCTA CAATGGCGAG GAAAACACAC CCTGA
 
Protein sequence
MSVNRPYRRL IHVGMLLSML WSLFPASQPA TAQTDQSNKT VEQAFAPDTG LTRDPQTPRL 
ATALPKKASS RADFMPVAAQ NLSIQPQLNQ VVTAQLDSAK TTAVQFEQAP LSLLVEADTF
ATATRLEFQA QALPNLTQQL QRSSKSEGYL RDQAEQVTFY RFNIEAQTND QAANFKKPVR
MVLDLRQLMR DVPSNYQQFY LAYQDQADPN HWIEVPITLH DAKGLISADV DHFSTWAAGT
RPERWNPSWV PAAVASFSGA ATYAYPIEGP MGRGGLQPKV ELSYNSRSLD GRIRDDTGAG
PLGDAWSISD ISIARVGVKT EFVAGFPSNK HPDNFRLTIN GAGHELFPEF PSQTATASSM
RYFAKDAPGY YIKRVYNTAT PNTDGIYWIV VTPTGVTYRL GYYPHAEEHQ IWDIGYWNVQ
GHQGRPNNER SALAWHVDTV TDRAGNQMTY QYVNWTVTEP IEWYLQGSSH KSTLQLTTWK
SRIGSISYNY PNRVTALPVS DTVAQLSTTP ASRLVFTTKT QFAYLIDTIY VYHGSLSTPI
KEYRINLSGH FVDSPACMNQ DTQPNIPRST HTRVVNAITV ASGVDADPTT EDGWTLPATS
FTYEAKPHYN NNCFSFYYLK SMRSNYGGEI SFNYASDNRW IGDYTYLGYD RYVWPSLGQS
YYVVETLAND GRNPAVKTTY SYSQPCYGQW SSNVPAGAIT CGASDAPEFG TITGFATVNQ
RSYDFNGTTL LKRNETIFSQ NSATTSGKPM IQRNFAGDGT LLDATHSTYN TDSLHGLPNM
FTYLSEVKSY QYSNGLEIST KRTFGYDVAK QAGVQYGNLT DTWLYASAAA TTPYEKQVTY
YTPNNGRATG GQAWMVNAPT ASGRYDQQGT FLNGTWIYYD GATTNATAPT QGLVTRSRQT
RPITCAEIPN PSGLPQPADP NCVHAFQTID SDMTYDSFGN PKTTTIYSGF GYRSLKANFS
DSLDWKPTET GQPNLSQVAS LWYDSDYNLY PVKTTNALNQ ATTYEIYGFR NDAGNIAAVD
GFQIQTGLLK SVTNPDATIV RYEYDPFGRL VNTFDSYSFT GFGDSTKWNG NPVIRYRYWD
NYWNDSAVFA NPAANQPFLI SDEKRPGSYA NPSSTGNFAY NDQTMYDGFG RAIQSRHIWA
DVDGEAKRQE IYSTTAYNAL GQTICQTAPF NLPFYIDRGL VWPASPFVTT PCSDSSMAKT
LTSYDNFGRV KQTTAADGSL NKANWSLVNN ITVAGQNLFW QHQQINPKNQ LEMRLINNQE
QLVLRREYRG TADSPIVYSD TQFQYDTLGN INKISRRQPS NAGNGALIAP EATMVYNGFG
HKLQINDPDM GTIKYRYNAN ARIIEQRTLN DSLLTNDDDV VCFYFDALQR NTSKNTTNAG
ANCSNTPILN GALWLANSSY YSSGAGKIGK LQSVKWNRDG NGAVDGESFN YNSLGLLTSH
TRTLNGVSFS MQFGDFDALN RATTITYPDG EVATITHDLE GENSLSLGSH GALVSNIEYN
ARGHISLIDR TNGGHNTVFN YYGATGTANT GNSNFRLASI NHQHSLLPSY TYEYDQIGNI
SLLYESGSLS GNTYFNYDEL DRLTSTSGIY SHIYAYDKLG NLTNNNGIAQ TYNGIGTQPH
ALRSTSQGNF FEYDQAGNMI VRNDASGLYQ QAFDVEQRLY EVIDQHDQTT RFRYDPSGQR
TTTFAADGTV TYDPFPNYQR TTVSSSNSAV DSLNAGVLCS DYNPETKSYG GAGYIMYSEI
PVKQRFGNLP AANISDHFIC VRNNTGVWEY DNDAGFYAFT PIASDLLVAS FNYNATTVSP
YLNQSGAIYG LRYGYTTSNL AFSKDVFGGT NNPGEFEIAG THFKTNAFAQ SVANHGYGVA
CQEDATGTGY LMYSAESVHS RFAEQAPDIN NAAHFICVRH NGQTWQYDNN SAYFAFTPRQ
SDRLIGAIDF SNDSYTSYVG QTGTILGMQK GLSSSNLTIT VNQWNGESNP GEFGIAGLNF
TPQAYEVTIT SAGMGINCLD TATGTGYIMH SRQALNQRFS QSIPAQLASK HFVCVRYNST
LSTWQYDDGS NYYGFTPRAS DTLVASVNFS TDQVTSLAGA SDSEFGITKG FVSGISIVAN
QWGGNSNAGE FQVIGNNLTT HTIDIGSKTV AIAGTPIATR RKHSVATSLV DQSLVFVYVD
KLGSANTLMD QTGTAILNNV RYLPFGEERL GLNSAYSDRG FTGHQENREL GLTYMNARFY
LPSTGRFISA DSMIPEPSNP QSFNRYSYVY NNPINATDPS GHLPGDDEPE IPNPFPESNP
IPNMEYSAYR KWLNFWQAYT DNDNPYIMIK QGDQLVESSV AIKAGSVSLS PDSVSVAGNW
GLVGAEVSMP GAYKKGEGDN FLETLWEGTS AKILIGPQID LIVVEVMPLA LGIDPFTGNI
TLETSAGIGL VEGSTTINPF AKTDAIYVLQ ISDELHDKLM GRDPCALSVS FQDQQAAWTE
LINVVNSYGF NGTDPRWTLH SIPNYVYNGE ENTP