Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2400 |
Symbol | |
ID | 5734281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3058052 |
End bp | 3059386 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279541 |
Product | ThiJ/PfpI domain-containing protein |
Protein accession | YP_001545168 |
Protein GI | 159898921 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.188328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAC GTCAGCTTAC GCGCCTCGGT TATTTGCTGC TAGCTTGCCT ACCGTTATTG TTAGCCGCCA GCATTGGCAG CTATTATTCA ATGCAGGTGG CGATGAGTAT TCGCAGCGAT AAACCCAACA TTCAATCACA GCCAATAGTA TATGATGCCC AAAAACCAAC CGCCGTAATT CTGCTTGGCA ATACGGTTAG CGAAATTACC GATGTGCTCG CACCCTATGC CTTATTAGCC AAAACAGGCT TGTATAATGT CTATACAGTG GCCGAAACAA GCAGCGTGCG CAGCCTCAGC GGTGGGCTTG ATTTGCTGCC TGATTATTCT TTTGCAGGCC TTGCAACGTT GCTCAAGCAA CCACCAGCGC TAGTGATTGT GCCTGCAATT ACTGAAATTC AGGCCAGCCA AAATCAACCA GTTTTGGCAT GGCTGCGCCA ACAATCCCAA GCAGCCAGCA CGGTGATGTC ATGGTGTACT GGGGCTGAGG TTTTAGCTGA AAGCGGATTG CTCGATGGCT TGCCAGCCAC CGCCCACTGG GCCGATCTGA GTAGTTTACA AAAGCGCTAT CCCAAGGTTA AATGGCAAAA TAATCAACGT TATGTCGATA TCAATCAACA GATAATCACC ACAGCCGGAT TAACCTCGGG GATTGATGCA ACCCTGTATT TCTTGCAAAA AATGCATGGT GCAGATGTAA GTCAGCAGTT GGCCGACATG ATCAATTATT CAGACCAAAG CTACCTTGAA CATGCTACAA TGCAGCCTTT CAGCATAACC CCAAGCGATA GTGTTTATCT GCTGAATGCG GCATTCTATT GGCCCAAGCA AACGCTGGGC ATTTGGCTCA GCCAAGGGGT TGACGAGCTA GCGTTAGCAG CCTTTTTCGA TGTTTATACA GGTTCGTGGG TTTACGATTT TCGGACAATT GGCGCAGAGC CAAACATTCG CTCAGCCCAT GGCTTACAAC TGATCCCACG CTATCAAGCA GCTACGCACA TCGATCGTTT GGTTGGGTTT GGCCCGAATC AACAGGCTCA AACCTGGGCC GAGCAGCAGC AAATGACCTA TCACGAACTT GATTTAGCAC AACAAGGCAA TATGTTCGAG CAGGCGCTTG TCCGATTTGC GATAGATCAA GACCAAGCTA GTGCTCAATT TGCCGCCAAA CGCATGGAAT ATCGCCAACC ATTAACTTTG CATGGGGCAA GTTGGCCGTG GCGTACATTG ATAGGAATTG GCGTTTGGTT AGGGGTTGGA ATTGGGGCGT GCTATGGGTT GCGGCGGATC AGCAACAAGC GCAAATCCGC CGCTACGAAC GAAAACTTAG GCTAA
|
Protein sequence | MLKRQLTRLG YLLLACLPLL LAASIGSYYS MQVAMSIRSD KPNIQSQPIV YDAQKPTAVI LLGNTVSEIT DVLAPYALLA KTGLYNVYTV AETSSVRSLS GGLDLLPDYS FAGLATLLKQ PPALVIVPAI TEIQASQNQP VLAWLRQQSQ AASTVMSWCT GAEVLAESGL LDGLPATAHW ADLSSLQKRY PKVKWQNNQR YVDINQQIIT TAGLTSGIDA TLYFLQKMHG ADVSQQLADM INYSDQSYLE HATMQPFSIT PSDSVYLLNA AFYWPKQTLG IWLSQGVDEL ALAAFFDVYT GSWVYDFRTI GAEPNIRSAH GLQLIPRYQA ATHIDRLVGF GPNQQAQTWA EQQQMTYHEL DLAQQGNMFE QALVRFAIDQ DQASAQFAAK RMEYRQPLTL HGASWPWRTL IGIGVWLGVG IGACYGLRRI SNKRKSAATN ENLG
|
| |