Gene Haur_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2400 
Symbol 
ID5734281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3058052 
End bp3059386 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content49% 
IMG OID641279541 
ProductThiJ/PfpI domain-containing protein 
Protein accessionYP_001545168 
Protein GI159898921 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAC GTCAGCTTAC GCGCCTCGGT TATTTGCTGC TAGCTTGCCT ACCGTTATTG 
TTAGCCGCCA GCATTGGCAG CTATTATTCA ATGCAGGTGG CGATGAGTAT TCGCAGCGAT
AAACCCAACA TTCAATCACA GCCAATAGTA TATGATGCCC AAAAACCAAC CGCCGTAATT
CTGCTTGGCA ATACGGTTAG CGAAATTACC GATGTGCTCG CACCCTATGC CTTATTAGCC
AAAACAGGCT TGTATAATGT CTATACAGTG GCCGAAACAA GCAGCGTGCG CAGCCTCAGC
GGTGGGCTTG ATTTGCTGCC TGATTATTCT TTTGCAGGCC TTGCAACGTT GCTCAAGCAA
CCACCAGCGC TAGTGATTGT GCCTGCAATT ACTGAAATTC AGGCCAGCCA AAATCAACCA
GTTTTGGCAT GGCTGCGCCA ACAATCCCAA GCAGCCAGCA CGGTGATGTC ATGGTGTACT
GGGGCTGAGG TTTTAGCTGA AAGCGGATTG CTCGATGGCT TGCCAGCCAC CGCCCACTGG
GCCGATCTGA GTAGTTTACA AAAGCGCTAT CCCAAGGTTA AATGGCAAAA TAATCAACGT
TATGTCGATA TCAATCAACA GATAATCACC ACAGCCGGAT TAACCTCGGG GATTGATGCA
ACCCTGTATT TCTTGCAAAA AATGCATGGT GCAGATGTAA GTCAGCAGTT GGCCGACATG
ATCAATTATT CAGACCAAAG CTACCTTGAA CATGCTACAA TGCAGCCTTT CAGCATAACC
CCAAGCGATA GTGTTTATCT GCTGAATGCG GCATTCTATT GGCCCAAGCA AACGCTGGGC
ATTTGGCTCA GCCAAGGGGT TGACGAGCTA GCGTTAGCAG CCTTTTTCGA TGTTTATACA
GGTTCGTGGG TTTACGATTT TCGGACAATT GGCGCAGAGC CAAACATTCG CTCAGCCCAT
GGCTTACAAC TGATCCCACG CTATCAAGCA GCTACGCACA TCGATCGTTT GGTTGGGTTT
GGCCCGAATC AACAGGCTCA AACCTGGGCC GAGCAGCAGC AAATGACCTA TCACGAACTT
GATTTAGCAC AACAAGGCAA TATGTTCGAG CAGGCGCTTG TCCGATTTGC GATAGATCAA
GACCAAGCTA GTGCTCAATT TGCCGCCAAA CGCATGGAAT ATCGCCAACC ATTAACTTTG
CATGGGGCAA GTTGGCCGTG GCGTACATTG ATAGGAATTG GCGTTTGGTT AGGGGTTGGA
ATTGGGGCGT GCTATGGGTT GCGGCGGATC AGCAACAAGC GCAAATCCGC CGCTACGAAC
GAAAACTTAG GCTAA
 
Protein sequence
MLKRQLTRLG YLLLACLPLL LAASIGSYYS MQVAMSIRSD KPNIQSQPIV YDAQKPTAVI 
LLGNTVSEIT DVLAPYALLA KTGLYNVYTV AETSSVRSLS GGLDLLPDYS FAGLATLLKQ
PPALVIVPAI TEIQASQNQP VLAWLRQQSQ AASTVMSWCT GAEVLAESGL LDGLPATAHW
ADLSSLQKRY PKVKWQNNQR YVDINQQIIT TAGLTSGIDA TLYFLQKMHG ADVSQQLADM
INYSDQSYLE HATMQPFSIT PSDSVYLLNA AFYWPKQTLG IWLSQGVDEL ALAAFFDVYT
GSWVYDFRTI GAEPNIRSAH GLQLIPRYQA ATHIDRLVGF GPNQQAQTWA EQQQMTYHEL
DLAQQGNMFE QALVRFAIDQ DQASAQFAAK RMEYRQPLTL HGASWPWRTL IGIGVWLGVG
IGACYGLRRI SNKRKSAATN ENLG