Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2043 |
Symbol | |
ID | 5733932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2550200 |
End bp | 2554693 |
Gene Length | 4494 bp |
Protein Length | 1497 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279187 |
Product | tetratricopeptide domain-containing protein |
Protein accession | YP_001544814 |
Protein GI | 159898567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATC GCGCCCTGCG TCTTGCCCTT GAATCCCTGC TGGATGAACC CACCCTCAAA TCAGCGCATC GTGCGTCACT TGAGCGTGGC CTTGCCGATC TCAGAGCGCA GCGCCTCACC TCCGATGAGC GTGACTATCT TTGCTATCTC ATCGAGCACT ATGGGCAGTC AAACCCATCC CCTGATCCCG TGGCCGCCTT AATCGCCGCG TTCCAAGACA AGCTACCAAC CTACGATCCC GCAGCCGTGG CCACGGCAGT TCACCATGCC CTCGCCCAGC AACCCGCCAC GCTCAACGGC GCTCCGGTCA CAATCACGAT GAACGGCGAT ACCTATGCGC ACCAGGCAAA CTTGGCCATT GGGAATCAGG TGGGCGGCAA TCTCGTCCAG GTCACGGTCA CGATTCCGAT TCCTGATGTC CGCTTTGATG CCCTCTTGGC CACCGCCCTC AGAACCCATG AGCACGAAAA AACACGCGAA CGCCTGATCA ACGCCGCCCA ACAAATCGAA AGTGATCGGC TCCTGAATCT CCGGCTCAAA GGCTTCGTCG GTCGCGTCAC CGAACTGGCG GCGATCCGCG AGCATATTGA GACCATGCGC TCCACGGGTG GCTATGTGTT GATCAAGGCA GCGGCAGGTG AAGGCAAGAG CAGTAGCATT GCCAAGCTGA TTCAGGAAGC GGGGATTGCG CAAACACCCC ATCACTTTAT TGCCCTGACG ACGGGTCGTG AATATCAATT AGGGTTGTTG CGGGCGGTGG CGGCCCAATT GATTTTGAAA CACAATCTTA CCGTTTCGTA CTTCCCCGAA GAAAGCTATC CAGCGATGAA GGGTGAGTTT GTGCGGATTC TTGACGAGCT TTCCAAGCAA GGCATTGAGG AAACGATCTA TCTGGATGGC CTTGATCAAC TCCAACCAGA GATTGATGGA TCACGCGATT TATCATTCCT CCCGCCGCAA CCACCCCCAG GCATCGTGAT GGTGCTTGGT TCGAGACCGG ATGAAACCTT AAAACCGCTC GAGATTCTGC ATCGGGTCGA TTATGACCTG CCGCCACTCA GTGAAGCCGA TGCGCTGGCA TTGTGGCGAT CCGTCCAGCC TGCTGTGGCG GATGGCCTCC TGCATGACCT CTATGCCGCA CTGAAGGGCA ATGCGTTGTT CGTCCACTTG GCTGCCAATA CCATGCGTGA TCAATCAGTG GTCGATGCGA CCAGCCTGAT TCGCCAGATC GAGCAGAATC CCAGCAATCT CTTTGGGATT ACGCTCGATC GGATTAAGGG TCGATCACGG TCGCAATGGG ATGTGGTCTG GAAGCCAATG CTGGTGCTCT TGCTCGTGGC CCAAGAACCA TTGCGGCTGG ATGTGGTAGG CGATTTGCTG GAACATGATC ACGACACGAT GCAGGATGCC GTGAGAGTTT TAGGTGGCTT GGTGAGTCAA GGGATTGATC AACGAGTCGC GCTGCATCAC CTACTGTTTC GCGATTATTT GGCAGCAGTG GTCTTTAATG CCCGTGAGGT GAAACGTTGG CAGCAACGGC TAGCAGACTG GTGTTCCGTT GATTTGGAGA TGATTTGGGG TGATCATTCC GATCCTATTG AACAAGCACG ACGGGTCTAT GCGCGGTATC ATTACATCAC GCATCTTTCA TTGGCGGAAA ATTGGCCAAC GCTGTGGCAA GTTTTGGATG CGGGTGACTA TGGCGAACAG AAAACCCGCT TCGATCCGAG TACCCGACTC TATGCCTTGG ATTTGGATCG CGGGCGTGAG AGTGCCATCA ACGCGGGACA ATCGACCGAG GAACATCTTC AGAACCTGCC ACGCCTGTGG AAGTATAGTT TGCTGCGAAC AAGTTTAACC AGTCGCGTTG ATCAGTGGCC GGATGATCTG TTTGTGGTCT TAGTGATGCT TGGACGCACA CAGGAGGCTT TAGATCGGAT TGATTTATGT ACTGATCATG GGCGGCAAAT ACAGCTTTGG AAGAAGATTT TGGTATTGTG CGAACCACAA TACCAAAGTA CGATTATTAT ACTTATGCAT CAGGTAGCCA GTCACCTTTC AGGCATCGAG AAATTTCGAC TCCTCGGATT AATTGCGACA ACGATTGCAA CTCTTGGCGC TATCGACCAA GCCCATCAAA TTGCCCAATC CATTACCGAT CTAACCCATC GTTTCGGTGC ATTCAGGGAT ATCGCCATAA CGGCTGACAA ACTTGGCAAT AGGGAACAAG CGTTAGAGAT ACTCAATCAT GCACTTATCA TTGCCCAATC CATTGACGAT CCTAAGTCAC GAGCTGAACC TCTCTACGCA ATTGCGACAA CGATTGCAAC CCTTGGCGCT ATCGACCAAG CCCACCAAAT TACTTGCCAA ATTGATCTTC CTGAGTTGCG TGATGATGCT CTCGCTGTTG TTGTAACGGT CACTGCCACC TTTGGTGATA TCGACCGTGC TTTAGCCATG GCCCAACCCA TTGAGGGTCT ATGGCAACAA GCCCATGCCC TCAGATCTAT TGCGATAACC ACCACCCATG GTAGTATTAG CCGTGCTCTC GTCATTGCCC AATCTATTAA TGACCCTATT TTCCGTGCTG ATGCATTCGG AACTATTGCG ACAATCGTTG CCACCCTTGG CGATATCAAC CAAGCCCTCG CCATTGCCCA ATCTATTGAT AGTTGGTCTC ACCGCGTTGA TGCACTCGGA ACTATTGCGA CAACCGTTGC CACCCTTGGC GATATCAACC AAGCCCTCGC CATTGCCCAA TCTATTGATA GTTGGTCTCA CCGCAGTCAT GTCTTCAGGA CCATTGCGGC AACCACCAAT ATCCTTGGCA ATAGGGCAAA AGCAATAAGT ATACTCGACC ATGCCCGCCA ACTTGCCCAA TCCATTAACG ATCGATGGCA AAAGGCCGAT GCCCTCAAGG CCATTGCAAC AACGGCTGCT ACCTTTAATG ACCAGGAACA AGCGATAAGG ATACTAGACC AAGCCCTCGC CATTGCCCAA TCTATCAAAA GTCTTCGTCA GCGTGCCGAT ACCCTCAGGG ATATTGCGAT AATCGCCGCC ACGATTGGCA GTATCGATCA AGCCTTTGCC ATTGCCCAAT CTATTGACCA TCCTAAACAA TGTGTTGATA CTCTTGGTGT CATCACTATA ACGGCTACGG CTGCTTATGG TAATGGCCAA CCTGCAATAA CAATTTTTAA CAAGACCAGA TCCATTATCC AATCCATTAA CTATCCTGAA CAACGAGCCG ATGCCCTTGG TGTTATTGCA ACCACAACTG CCACCTTCGG CGAAATCAAC CGTGCGCTAG CCATTGCTCA ATCCATTGAT GATTCTGAAC GACATGCCGA TACACTGAGG GCTATTGCGA CAACTATCGC CACCTTCGGC GAAATCAACC GTGCGCTAGC CATCGCCCAA TCCATTAACA ATCGATGGCA ACGTGCTGAT GCCCTCAAGG CCATTGCAAC AACGGCTGCA GCCTTTGGGG ATATCGACCA TGCACTTGTT ATTACTAAAT TCATTAATGA ATTTTTCCAG CAGACCAATG CCCTAAGGTC TATTGCAATA ACCACCGCCA CCTTCGGTGA TATCAACCGT GCGCTAACCA TCGCTCAATC TATTAATGAC CTTGGTCAAC AGGCTGACAC GCTCAAGGTT ATTGCAACCA TCACTTCAAC CCTTGGTGAT ACTAACCGTG CACTAGCCAT CGCTCAATCT ATTAATGATC TTGTGCATCG TGTGGATGTT CTCTGTTTCC TCGCGTTAAT CGCCACTAAA CTCGGCAACA GAGAAAAAGG CACAGATCTA CTCAATTATG CTATTGTTAT TGCTCAATCC ATTGTTCGCC CCGAGCGATG TGCCGATGCA TTCAAAGTTA TTGCGATAAC CACCGCTACT CTTGGCGATA TCAGTCAAGC CCTTGCGATT GCTCAATCCA TTGTTCGCCC TGAGCGGCGT GCCGATGCAT TCAAAGTTAT TGCGATAACC ACCGCTACTC TTGGCGATAT CAGTCAAGCC CTTGCGATTG CCCAATCCAT TACCCAGCCC GAGCAATGTG TTGATGCACT CGGAACTATT GCTGCGAGGG CTGCCGATAC TTACGGTAAC GATCAACTTC CAATGGCAAT TCTCGAGAAG GCTCACCAAA TTGCCCAATC TATTGCCCAG CCTGAGCGAC GTGCCGATGC GCTCGGGACT ATTGCGACAA CCACCGCTAC CCTTGGCGAT ATCGATCGTG CCATTGCCAT TGTCCAATCC ATTGTCAGTC CTGATAAACA TGACCATTCT CTTAGGATGA TTGTAAAAAC AGTACGATCA ATAACAAGAA TTCTCGCTAT CATTCAGAGA ATATGGTTTC ATAGCAAAAC ATCCGGTAGC ATATGGGGAA TGACCCCAAT TATCGCTCCA CTATTAAATG ACTATCCATG GCTTGGAACA GCAATTCTGA AGGAAGAGGC ATGGGTCAAT GAGCAACTCA AACGACTGGG GTAA
|
Protein sequence | MDDRALRLAL ESLLDEPTLK SAHRASLERG LADLRAQRLT SDERDYLCYL IEHYGQSNPS PDPVAALIAA FQDKLPTYDP AAVATAVHHA LAQQPATLNG APVTITMNGD TYAHQANLAI GNQVGGNLVQ VTVTIPIPDV RFDALLATAL RTHEHEKTRE RLINAAQQIE SDRLLNLRLK GFVGRVTELA AIREHIETMR STGGYVLIKA AAGEGKSSSI AKLIQEAGIA QTPHHFIALT TGREYQLGLL RAVAAQLILK HNLTVSYFPE ESYPAMKGEF VRILDELSKQ GIEETIYLDG LDQLQPEIDG SRDLSFLPPQ PPPGIVMVLG SRPDETLKPL EILHRVDYDL PPLSEADALA LWRSVQPAVA DGLLHDLYAA LKGNALFVHL AANTMRDQSV VDATSLIRQI EQNPSNLFGI TLDRIKGRSR SQWDVVWKPM LVLLLVAQEP LRLDVVGDLL EHDHDTMQDA VRVLGGLVSQ GIDQRVALHH LLFRDYLAAV VFNAREVKRW QQRLADWCSV DLEMIWGDHS DPIEQARRVY ARYHYITHLS LAENWPTLWQ VLDAGDYGEQ KTRFDPSTRL YALDLDRGRE SAINAGQSTE EHLQNLPRLW KYSLLRTSLT SRVDQWPDDL FVVLVMLGRT QEALDRIDLC TDHGRQIQLW KKILVLCEPQ YQSTIIILMH QVASHLSGIE KFRLLGLIAT TIATLGAIDQ AHQIAQSITD LTHRFGAFRD IAITADKLGN REQALEILNH ALIIAQSIDD PKSRAEPLYA IATTIATLGA IDQAHQITCQ IDLPELRDDA LAVVVTVTAT FGDIDRALAM AQPIEGLWQQ AHALRSIAIT TTHGSISRAL VIAQSINDPI FRADAFGTIA TIVATLGDIN QALAIAQSID SWSHRVDALG TIATTVATLG DINQALAIAQ SIDSWSHRSH VFRTIAATTN ILGNRAKAIS ILDHARQLAQ SINDRWQKAD ALKAIATTAA TFNDQEQAIR ILDQALAIAQ SIKSLRQRAD TLRDIAIIAA TIGSIDQAFA IAQSIDHPKQ CVDTLGVITI TATAAYGNGQ PAITIFNKTR SIIQSINYPE QRADALGVIA TTTATFGEIN RALAIAQSID DSERHADTLR AIATTIATFG EINRALAIAQ SINNRWQRAD ALKAIATTAA AFGDIDHALV ITKFINEFFQ QTNALRSIAI TTATFGDINR ALTIAQSIND LGQQADTLKV IATITSTLGD TNRALAIAQS INDLVHRVDV LCFLALIATK LGNREKGTDL LNYAIVIAQS IVRPERCADA FKVIAITTAT LGDISQALAI AQSIVRPERR ADAFKVIAIT TATLGDISQA LAIAQSITQP EQCVDALGTI AARAADTYGN DQLPMAILEK AHQIAQSIAQ PERRADALGT IATTTATLGD IDRAIAIVQS IVSPDKHDHS LRMIVKTVRS ITRILAIIQR IWFHSKTSGS IWGMTPIIAP LLNDYPWLGT AILKEEAWVN EQLKRLG
|
| |