Gene Haur_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2043 
Symbol 
ID5733932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2550200 
End bp2554693 
Gene Length4494 bp 
Protein Length1497 aa 
Translation table11 
GC content51% 
IMG OID641279187 
Producttetratricopeptide domain-containing protein 
Protein accessionYP_001544814 
Protein GI159898567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATC GCGCCCTGCG TCTTGCCCTT GAATCCCTGC TGGATGAACC CACCCTCAAA 
TCAGCGCATC GTGCGTCACT TGAGCGTGGC CTTGCCGATC TCAGAGCGCA GCGCCTCACC
TCCGATGAGC GTGACTATCT TTGCTATCTC ATCGAGCACT ATGGGCAGTC AAACCCATCC
CCTGATCCCG TGGCCGCCTT AATCGCCGCG TTCCAAGACA AGCTACCAAC CTACGATCCC
GCAGCCGTGG CCACGGCAGT TCACCATGCC CTCGCCCAGC AACCCGCCAC GCTCAACGGC
GCTCCGGTCA CAATCACGAT GAACGGCGAT ACCTATGCGC ACCAGGCAAA CTTGGCCATT
GGGAATCAGG TGGGCGGCAA TCTCGTCCAG GTCACGGTCA CGATTCCGAT TCCTGATGTC
CGCTTTGATG CCCTCTTGGC CACCGCCCTC AGAACCCATG AGCACGAAAA AACACGCGAA
CGCCTGATCA ACGCCGCCCA ACAAATCGAA AGTGATCGGC TCCTGAATCT CCGGCTCAAA
GGCTTCGTCG GTCGCGTCAC CGAACTGGCG GCGATCCGCG AGCATATTGA GACCATGCGC
TCCACGGGTG GCTATGTGTT GATCAAGGCA GCGGCAGGTG AAGGCAAGAG CAGTAGCATT
GCCAAGCTGA TTCAGGAAGC GGGGATTGCG CAAACACCCC ATCACTTTAT TGCCCTGACG
ACGGGTCGTG AATATCAATT AGGGTTGTTG CGGGCGGTGG CGGCCCAATT GATTTTGAAA
CACAATCTTA CCGTTTCGTA CTTCCCCGAA GAAAGCTATC CAGCGATGAA GGGTGAGTTT
GTGCGGATTC TTGACGAGCT TTCCAAGCAA GGCATTGAGG AAACGATCTA TCTGGATGGC
CTTGATCAAC TCCAACCAGA GATTGATGGA TCACGCGATT TATCATTCCT CCCGCCGCAA
CCACCCCCAG GCATCGTGAT GGTGCTTGGT TCGAGACCGG ATGAAACCTT AAAACCGCTC
GAGATTCTGC ATCGGGTCGA TTATGACCTG CCGCCACTCA GTGAAGCCGA TGCGCTGGCA
TTGTGGCGAT CCGTCCAGCC TGCTGTGGCG GATGGCCTCC TGCATGACCT CTATGCCGCA
CTGAAGGGCA ATGCGTTGTT CGTCCACTTG GCTGCCAATA CCATGCGTGA TCAATCAGTG
GTCGATGCGA CCAGCCTGAT TCGCCAGATC GAGCAGAATC CCAGCAATCT CTTTGGGATT
ACGCTCGATC GGATTAAGGG TCGATCACGG TCGCAATGGG ATGTGGTCTG GAAGCCAATG
CTGGTGCTCT TGCTCGTGGC CCAAGAACCA TTGCGGCTGG ATGTGGTAGG CGATTTGCTG
GAACATGATC ACGACACGAT GCAGGATGCC GTGAGAGTTT TAGGTGGCTT GGTGAGTCAA
GGGATTGATC AACGAGTCGC GCTGCATCAC CTACTGTTTC GCGATTATTT GGCAGCAGTG
GTCTTTAATG CCCGTGAGGT GAAACGTTGG CAGCAACGGC TAGCAGACTG GTGTTCCGTT
GATTTGGAGA TGATTTGGGG TGATCATTCC GATCCTATTG AACAAGCACG ACGGGTCTAT
GCGCGGTATC ATTACATCAC GCATCTTTCA TTGGCGGAAA ATTGGCCAAC GCTGTGGCAA
GTTTTGGATG CGGGTGACTA TGGCGAACAG AAAACCCGCT TCGATCCGAG TACCCGACTC
TATGCCTTGG ATTTGGATCG CGGGCGTGAG AGTGCCATCA ACGCGGGACA ATCGACCGAG
GAACATCTTC AGAACCTGCC ACGCCTGTGG AAGTATAGTT TGCTGCGAAC AAGTTTAACC
AGTCGCGTTG ATCAGTGGCC GGATGATCTG TTTGTGGTCT TAGTGATGCT TGGACGCACA
CAGGAGGCTT TAGATCGGAT TGATTTATGT ACTGATCATG GGCGGCAAAT ACAGCTTTGG
AAGAAGATTT TGGTATTGTG CGAACCACAA TACCAAAGTA CGATTATTAT ACTTATGCAT
CAGGTAGCCA GTCACCTTTC AGGCATCGAG AAATTTCGAC TCCTCGGATT AATTGCGACA
ACGATTGCAA CTCTTGGCGC TATCGACCAA GCCCATCAAA TTGCCCAATC CATTACCGAT
CTAACCCATC GTTTCGGTGC ATTCAGGGAT ATCGCCATAA CGGCTGACAA ACTTGGCAAT
AGGGAACAAG CGTTAGAGAT ACTCAATCAT GCACTTATCA TTGCCCAATC CATTGACGAT
CCTAAGTCAC GAGCTGAACC TCTCTACGCA ATTGCGACAA CGATTGCAAC CCTTGGCGCT
ATCGACCAAG CCCACCAAAT TACTTGCCAA ATTGATCTTC CTGAGTTGCG TGATGATGCT
CTCGCTGTTG TTGTAACGGT CACTGCCACC TTTGGTGATA TCGACCGTGC TTTAGCCATG
GCCCAACCCA TTGAGGGTCT ATGGCAACAA GCCCATGCCC TCAGATCTAT TGCGATAACC
ACCACCCATG GTAGTATTAG CCGTGCTCTC GTCATTGCCC AATCTATTAA TGACCCTATT
TTCCGTGCTG ATGCATTCGG AACTATTGCG ACAATCGTTG CCACCCTTGG CGATATCAAC
CAAGCCCTCG CCATTGCCCA ATCTATTGAT AGTTGGTCTC ACCGCGTTGA TGCACTCGGA
ACTATTGCGA CAACCGTTGC CACCCTTGGC GATATCAACC AAGCCCTCGC CATTGCCCAA
TCTATTGATA GTTGGTCTCA CCGCAGTCAT GTCTTCAGGA CCATTGCGGC AACCACCAAT
ATCCTTGGCA ATAGGGCAAA AGCAATAAGT ATACTCGACC ATGCCCGCCA ACTTGCCCAA
TCCATTAACG ATCGATGGCA AAAGGCCGAT GCCCTCAAGG CCATTGCAAC AACGGCTGCT
ACCTTTAATG ACCAGGAACA AGCGATAAGG ATACTAGACC AAGCCCTCGC CATTGCCCAA
TCTATCAAAA GTCTTCGTCA GCGTGCCGAT ACCCTCAGGG ATATTGCGAT AATCGCCGCC
ACGATTGGCA GTATCGATCA AGCCTTTGCC ATTGCCCAAT CTATTGACCA TCCTAAACAA
TGTGTTGATA CTCTTGGTGT CATCACTATA ACGGCTACGG CTGCTTATGG TAATGGCCAA
CCTGCAATAA CAATTTTTAA CAAGACCAGA TCCATTATCC AATCCATTAA CTATCCTGAA
CAACGAGCCG ATGCCCTTGG TGTTATTGCA ACCACAACTG CCACCTTCGG CGAAATCAAC
CGTGCGCTAG CCATTGCTCA ATCCATTGAT GATTCTGAAC GACATGCCGA TACACTGAGG
GCTATTGCGA CAACTATCGC CACCTTCGGC GAAATCAACC GTGCGCTAGC CATCGCCCAA
TCCATTAACA ATCGATGGCA ACGTGCTGAT GCCCTCAAGG CCATTGCAAC AACGGCTGCA
GCCTTTGGGG ATATCGACCA TGCACTTGTT ATTACTAAAT TCATTAATGA ATTTTTCCAG
CAGACCAATG CCCTAAGGTC TATTGCAATA ACCACCGCCA CCTTCGGTGA TATCAACCGT
GCGCTAACCA TCGCTCAATC TATTAATGAC CTTGGTCAAC AGGCTGACAC GCTCAAGGTT
ATTGCAACCA TCACTTCAAC CCTTGGTGAT ACTAACCGTG CACTAGCCAT CGCTCAATCT
ATTAATGATC TTGTGCATCG TGTGGATGTT CTCTGTTTCC TCGCGTTAAT CGCCACTAAA
CTCGGCAACA GAGAAAAAGG CACAGATCTA CTCAATTATG CTATTGTTAT TGCTCAATCC
ATTGTTCGCC CCGAGCGATG TGCCGATGCA TTCAAAGTTA TTGCGATAAC CACCGCTACT
CTTGGCGATA TCAGTCAAGC CCTTGCGATT GCTCAATCCA TTGTTCGCCC TGAGCGGCGT
GCCGATGCAT TCAAAGTTAT TGCGATAACC ACCGCTACTC TTGGCGATAT CAGTCAAGCC
CTTGCGATTG CCCAATCCAT TACCCAGCCC GAGCAATGTG TTGATGCACT CGGAACTATT
GCTGCGAGGG CTGCCGATAC TTACGGTAAC GATCAACTTC CAATGGCAAT TCTCGAGAAG
GCTCACCAAA TTGCCCAATC TATTGCCCAG CCTGAGCGAC GTGCCGATGC GCTCGGGACT
ATTGCGACAA CCACCGCTAC CCTTGGCGAT ATCGATCGTG CCATTGCCAT TGTCCAATCC
ATTGTCAGTC CTGATAAACA TGACCATTCT CTTAGGATGA TTGTAAAAAC AGTACGATCA
ATAACAAGAA TTCTCGCTAT CATTCAGAGA ATATGGTTTC ATAGCAAAAC ATCCGGTAGC
ATATGGGGAA TGACCCCAAT TATCGCTCCA CTATTAAATG ACTATCCATG GCTTGGAACA
GCAATTCTGA AGGAAGAGGC ATGGGTCAAT GAGCAACTCA AACGACTGGG GTAA
 
Protein sequence
MDDRALRLAL ESLLDEPTLK SAHRASLERG LADLRAQRLT SDERDYLCYL IEHYGQSNPS 
PDPVAALIAA FQDKLPTYDP AAVATAVHHA LAQQPATLNG APVTITMNGD TYAHQANLAI
GNQVGGNLVQ VTVTIPIPDV RFDALLATAL RTHEHEKTRE RLINAAQQIE SDRLLNLRLK
GFVGRVTELA AIREHIETMR STGGYVLIKA AAGEGKSSSI AKLIQEAGIA QTPHHFIALT
TGREYQLGLL RAVAAQLILK HNLTVSYFPE ESYPAMKGEF VRILDELSKQ GIEETIYLDG
LDQLQPEIDG SRDLSFLPPQ PPPGIVMVLG SRPDETLKPL EILHRVDYDL PPLSEADALA
LWRSVQPAVA DGLLHDLYAA LKGNALFVHL AANTMRDQSV VDATSLIRQI EQNPSNLFGI
TLDRIKGRSR SQWDVVWKPM LVLLLVAQEP LRLDVVGDLL EHDHDTMQDA VRVLGGLVSQ
GIDQRVALHH LLFRDYLAAV VFNAREVKRW QQRLADWCSV DLEMIWGDHS DPIEQARRVY
ARYHYITHLS LAENWPTLWQ VLDAGDYGEQ KTRFDPSTRL YALDLDRGRE SAINAGQSTE
EHLQNLPRLW KYSLLRTSLT SRVDQWPDDL FVVLVMLGRT QEALDRIDLC TDHGRQIQLW
KKILVLCEPQ YQSTIIILMH QVASHLSGIE KFRLLGLIAT TIATLGAIDQ AHQIAQSITD
LTHRFGAFRD IAITADKLGN REQALEILNH ALIIAQSIDD PKSRAEPLYA IATTIATLGA
IDQAHQITCQ IDLPELRDDA LAVVVTVTAT FGDIDRALAM AQPIEGLWQQ AHALRSIAIT
TTHGSISRAL VIAQSINDPI FRADAFGTIA TIVATLGDIN QALAIAQSID SWSHRVDALG
TIATTVATLG DINQALAIAQ SIDSWSHRSH VFRTIAATTN ILGNRAKAIS ILDHARQLAQ
SINDRWQKAD ALKAIATTAA TFNDQEQAIR ILDQALAIAQ SIKSLRQRAD TLRDIAIIAA
TIGSIDQAFA IAQSIDHPKQ CVDTLGVITI TATAAYGNGQ PAITIFNKTR SIIQSINYPE
QRADALGVIA TTTATFGEIN RALAIAQSID DSERHADTLR AIATTIATFG EINRALAIAQ
SINNRWQRAD ALKAIATTAA AFGDIDHALV ITKFINEFFQ QTNALRSIAI TTATFGDINR
ALTIAQSIND LGQQADTLKV IATITSTLGD TNRALAIAQS INDLVHRVDV LCFLALIATK
LGNREKGTDL LNYAIVIAQS IVRPERCADA FKVIAITTAT LGDISQALAI AQSIVRPERR
ADAFKVIAIT TATLGDISQA LAIAQSITQP EQCVDALGTI AARAADTYGN DQLPMAILEK
AHQIAQSIAQ PERRADALGT IATTTATLGD IDRAIAIVQS IVSPDKHDHS LRMIVKTVRS
ITRILAIIQR IWFHSKTSGS IWGMTPIIAP LLNDYPWLGT AILKEEAWVN EQLKRLG