Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2254 |
Symbol | |
ID | 5734141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2876027 |
End bp | 2880355 |
Gene Length | 4329 bp |
Protein Length | 1442 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279395 |
Product | hypothetical protein |
Protein accession | YP_001545022 |
Protein GI | 159898775 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACGA CCCCAAACAC TCAGGTTTTG CATGAAGCCG CCAACCAGCG TGAAAGTGCC CGTTTGTTGC AATTGCGGCT CACTGGCTTC GTTGGTCGTC AAGCCGAACA AACAGCGATT CGCGGATTAA TCGACCAAAC CCGCCCAAGC GGTGGCTATG TGTTGGTGAC GGGTGAGGCT GGGGCGGGCA AAAGTAGTCT GCTCGCCCAA CTAATTGTGA ATGCTGGGCT AGACCAAACC CCGCAGCATT TTATTGCGCT GACTCCAGGC CGCGCCTATC AACTCGACGT GTTGCGTAGC ATCGTTGCTC AACTGCTGCT TAAACATGAT CTGGCGAGCA ACTATTTTCC TGCCGATAGC TACCCCGCCT TGCGCCTCGA ATTTGGTCAG TTGTTACAAA CCCTCTCGGC GCGTGGCATC AGTGAAACGA TCTATCTCGA TGGACTGGAT CAATTGCAGC CTGAGGTGGA TGGAACCCGC GATCTCAGCT TTTTACCCTT GCAGCTGCCG CCTGGCATCG TGATAGTGCT TGGCTCGCGT CCCAACGAGA CGATCGATAG TTTAGCACTT GAGCATGGGG TCGTTTATCA GGTTCCACCG TTGCACGAAC AGGATGCGAT TGGGCGTTGG CAGCAGGTGC AGCCGACGCT GGAGCCAGCG TTGTTGCACG GTTTAGCGCA AGCCGTCAAG GGCAATGCCT TGTTGATCGA ACTGGCGGCC AATGTGCTGC GCCACACTTC GACTAGTGAA ATGCTTGCAT TACTCGACCA CGCCAGCGCT GACGCAACCA ATCTCTTTCG GCTGAGCCTT GGGCGGATCG AACAAGCAGC GCCGCGCCAC TGGCAACCGC TGATTCGCCC GTTGTTGGCG GTGTTGTTGA TAACCCAAGA ACCGCTTGAG CCAGCGGTGC TTGCGGCGAT TATTGAACGA CCAACCGCTA CAGTGGTCGA GGCGCTGACC CTGATGAGCG ATTGGGTGAG TGTTGCCGCC GATCAGCGCG TGGCGCTACG CCATTTGTTG TTTCACGATT TTCTGATCCA ACACGAATTT ACCCAGCCAG AATTGCAAGT GTGGCATGGA CGCATGACGC AATGGTGTGG CGCAGCACTT GACCAGATTT GGCACGATAG TACAGAATCC GTTGAACAGG CACGGCGCTG GTATGCACGC CAGCATTACA TCACCCATTT GGATTGCGCA GAACAATGGG AAGCATTGTG GCAAGTGATC GATGCGGGCG ATTATGGCGA GCACAAAGTG CGGTTTGAGC CAAGCACGCG CTTGTATGGC TTGGATTTGG ATCGGGCCCG CGAGAGCGTA ATTGCCGCTG GCCAGAGCAT CGAACAGCAG CTTGAATTAT TGCCACGCTT GTGGCGCTAT AGCCTGCTGC GCACCAGCCT CACGGCCCAT GCCGATCAAT GGCATGATGA TGTATTTGTG ATTTTGGCGA TGCTTGGGCG GGTATCTGAG GCGCTTGCCC AGATTGAAAT TTGCTCGGAT CAAGTACGTC AAGTGCTGTT ATGGTCACGT GTAGTTGCTT ATACAGAGCC TGAGCTGCGT TTGCATATTT TTCAGCGCAT GGAACAAGTT GCGCGAAGCT TGCATGAGTC TGAAGAGCGT GATTATGCGT TGCATCTCGT TGCAATGGCC TATGCTGACC ATGGCTTGCT AAATATGGCC TACCCGATTG CGATTAGCCT GGGGAATACT CGTGATGAAA CGCTTGCTTA TTTGGTTGAC GTGGTGATTA AACAGCATGA TTTGGCAAGA GTTAAGCTGA TCATTGGACA AATTCAAACT CCAAAATATC GAATTAAGAG TTCTATGCTG CTAGCTAATG CCCTGATTGA AGAAACTGAA TTTATCGAGG CTCGACAACT ATTAATTGAA ACGTTACCAT TTGCTCAAAA CGAGCAAGTA GTTGAAATTA AAAGCTTACT TGCAACAATT GCATGGCGGT TGGGTGATCA TCAACAGGCC GATATATTAC TAGCTGAAGC TCGATCCATG CACAAATATT TTGCTGATGA TGCAAAAATT GCAGCACTTT TGGCAATAAT TAAAGGGTAC TTGGCACAAG GGAATTTAGC ACAAGCCTAT AATTTGCATG ATGAGATCAA ATTAAATCGA TTTCGTCGGG AATTAGTTGA TATCTATATT AATCGTGATG ATATTGCAAT CGCGGTTGAG CTTGCGGCTA CCATTACCCA CTGGCACTCC AGTGATCTGG CTTATGCCGC ACTAGTTGCC TGGTATTGTA AGGAAGCTGA TTTCTCGAAG GCCGAGCAAG CGCTTGGATT AATCAAAGCG CCTGATCAAC AGATTAAAAG CTATTGTTTA TTGGCAAATA GCCATGCCGA TCGATTTCAA TGGATGCGGC TGCTGGAGTC GGCCCAGCTG TCTTTGAGTT CGGTTATAAG CTCGATCTCT TTAGCAAAAT GCTGGCTGCA ACTTGCCGAT GCATATGCTT TTCAGCATGC GCATGACCGT GCTCAGTCTA TGTTTGAGCA TGCGTTGACA GCGATTTTGG CCACATCAAA CTCGTTTTTG AATGACCAAA TGTATGATTT ATTGCAGCTT GCTCATTTTG CTAAACGCTA TAACTATGAT GATCTCTGTC AGCGAGTGAT CTACACCACG TTTTTAGTTG GTAAACACGA GGATTTTGAG TATTCTTTGC CGTTTGAGCA TGCTATGCTC CATCTTAATC ATGGTGAGAT AGACCAAGTT CGCCAGATTA TTGACACAAG TGTTGGACCA TATGTGGCAG TTAACTTATT ACAAATACTT ATAGCTGAAT CAATTAAGCA ACAGGATCAT CCTCAAGCTC AACTATACTT GTTCGAGGCG TTGAACCATG CACGAAAGCT TGAAAATCCT AGCTATCGGG TGAGTCTCCT GGGTGAACTT GCTGATATAG CCTTAGCTAG TGGGTTTGAG CTACTCGCTA AAACAATTCT CGGCGAAGCT ACGCAACTGC TGCCCGTAAT CACTGCTGAA AACGAGCAGC GTTGGGCTGG GGTGGGCCTT GTTCGTCGCT ATTATAGCCA TGGGATGCTA GCGAGTGCGG CTACGATAAC CCAGGTAATG ACCGTTTCGC AAACACACGA TCACATTATG GTAGAAATAA GTCTTTGTTA TGCTGATAGT GGACAGCTAG CCCAAGCTTA TGCAACGCTG AAGGTTATCA ATACTCAAAC TGAAGTGTAT GCACGGAATT TGTGCCAGAT TATTATTAAA GCGCATGAGC ATGGTTTAGC TACGCTTGCA GCAGAGTATT ATGATGAGTT GATTGAAGCA TGGAGTGTAA TTGCCGATCC GATTCGCTTG CTTAAGGATC TAAAGGATCT GGCGATTGCC CAGATTAACT ATGGCTCCAA TCAGTATCTC CCCAGCCTAT TGGAGGCCAT CCGCACTATT CGACATCCAG CCTTGGCTGA GTATCAGTAT GTGGAAGTGC TTTGTGAAAT TGCCAGAGCC TATATCAAAC AAGCCAATTA CCCAGATTTT GCCGATTGGT TAGCGTATGC CCATTCCATT GCTCAATCGA TTGTGATTGA GTCGCGTCCC AAAGTTGCTG CTTATCATTA TGTTGCGATC ACATATCTTT GCCATGCTAC CGATTCAGAT ACAGAGATAT TTTTGGCTGA TATGCTTCGT TTAGCCAATG GTATTGCACC GAGTAGCTAT GCAAATGATC TATTTAATGC ATTAGCCAAT AGCTGTGCGT CCTATGCGGT GCGTGGGCAT CCAGATTTTT TTGCCAAAGC CTATCAGTTT GCTATGGCTA TTTCAGTGCC GTGGCAGCGT GCCCAAGCCC TAAGAAGCGT CGCCAATGGC TATGCTAAAG TTGATGACCG CGTGATGGTA GAAATGATTA TTGCTGAAAT AAGCCGACTT TCGCCTAATT ACTTAAGTCT TGATGCTGTC GCTTTGATCT ATGCGCAACG AGGCGATTTG GCTTTTGCTC AAACGCTGAT TGCCAATGAT GAAGCATCTG AAGAACGAGA TGCTGTCTTG GATTATCTGA TTCCAGCTTT GCTGCAAACC GATGCTGTCG TTGCTGCATA TCAGATCTCG CATGGGTTTA CTAGGCTGAC GAAACGGATT AAGTTCTTGC ACCAAATCGT TAACTACTAC GTTGAGCGTG GGCAGATTGC CGAAAGTATT CAGATTATTC AAGCCGCATG GCGTAATTGT GGTGCTGCTG CTGATCTATG GGAGCTGCGG ACAATCGTCT TGCCGTTTGA TTCAACCCAT CCTTGGCTTG GCACTGCCGT GCTCGATAGC GTGCCATGGG TTGAGCAGCA ATTAGCCCGA TTGAATTAA
|
Protein sequence | MMTTPNTQVL HEAANQRESA RLLQLRLTGF VGRQAEQTAI RGLIDQTRPS GGYVLVTGEA GAGKSSLLAQ LIVNAGLDQT PQHFIALTPG RAYQLDVLRS IVAQLLLKHD LASNYFPADS YPALRLEFGQ LLQTLSARGI SETIYLDGLD QLQPEVDGTR DLSFLPLQLP PGIVIVLGSR PNETIDSLAL EHGVVYQVPP LHEQDAIGRW QQVQPTLEPA LLHGLAQAVK GNALLIELAA NVLRHTSTSE MLALLDHASA DATNLFRLSL GRIEQAAPRH WQPLIRPLLA VLLITQEPLE PAVLAAIIER PTATVVEALT LMSDWVSVAA DQRVALRHLL FHDFLIQHEF TQPELQVWHG RMTQWCGAAL DQIWHDSTES VEQARRWYAR QHYITHLDCA EQWEALWQVI DAGDYGEHKV RFEPSTRLYG LDLDRARESV IAAGQSIEQQ LELLPRLWRY SLLRTSLTAH ADQWHDDVFV ILAMLGRVSE ALAQIEICSD QVRQVLLWSR VVAYTEPELR LHIFQRMEQV ARSLHESEER DYALHLVAMA YADHGLLNMA YPIAISLGNT RDETLAYLVD VVIKQHDLAR VKLIIGQIQT PKYRIKSSML LANALIEETE FIEARQLLIE TLPFAQNEQV VEIKSLLATI AWRLGDHQQA DILLAEARSM HKYFADDAKI AALLAIIKGY LAQGNLAQAY NLHDEIKLNR FRRELVDIYI NRDDIAIAVE LAATITHWHS SDLAYAALVA WYCKEADFSK AEQALGLIKA PDQQIKSYCL LANSHADRFQ WMRLLESAQL SLSSVISSIS LAKCWLQLAD AYAFQHAHDR AQSMFEHALT AILATSNSFL NDQMYDLLQL AHFAKRYNYD DLCQRVIYTT FLVGKHEDFE YSLPFEHAML HLNHGEIDQV RQIIDTSVGP YVAVNLLQIL IAESIKQQDH PQAQLYLFEA LNHARKLENP SYRVSLLGEL ADIALASGFE LLAKTILGEA TQLLPVITAE NEQRWAGVGL VRRYYSHGML ASAATITQVM TVSQTHDHIM VEISLCYADS GQLAQAYATL KVINTQTEVY ARNLCQIIIK AHEHGLATLA AEYYDELIEA WSVIADPIRL LKDLKDLAIA QINYGSNQYL PSLLEAIRTI RHPALAEYQY VEVLCEIARA YIKQANYPDF ADWLAYAHSI AQSIVIESRP KVAAYHYVAI TYLCHATDSD TEIFLADMLR LANGIAPSSY ANDLFNALAN SCASYAVRGH PDFFAKAYQF AMAISVPWQR AQALRSVANG YAKVDDRVMV EMIIAEISRL SPNYLSLDAV ALIYAQRGDL AFAQTLIAND EASEERDAVL DYLIPALLQT DAVVAAYQIS HGFTRLTKRI KFLHQIVNYY VERGQIAESI QIIQAAWRNC GAAADLWELR TIVLPFDSTH PWLGTAVLDS VPWVEQQLAR LN
|
| |