Gene Haur_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1887 
Symbol 
ID5733776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2275514 
End bp2276722 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content50% 
IMG OID641279031 
ProductGntR family transcriptional regulator 
Protein accessionYP_001544658 
Protein GI159898411 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCGA TTGCAACAAC TCCTAGCCAA TTTGAACTCG CTGCTTGGGC TAAGACCATC 
ACACCCTCGG CGCTGCAAGA TATGCTTTCG GCAACTGCCA ATCCTGAGGT TATTTCGTTT
GCTCTGGGGC TACCCGCCCC AGAGCTGTTT CCTCGCCACC AATTTAGCCA ATTGGCCAGC
ACGTTACTCG AAGCCGAACC TTTGGCGTTG CAATATGGCC CGCCAAGTAC CACCCTCAAA
ACGGCAATTG TTTCATTGAT GGCGCAACGA GGGGTACGCT GTCGGCCCGA GCAAATCTTT
CTGACCAATG GTGCGCAACA GGGCATGAAC CTGTTGGTAC GCTTGCTTTT GGCCGATGGC
GGCAGCGTTT TGTTGGAAGA TTGTATTTAT ACGGGCTTTC AGCAAGTGCT TGATCCATTT
CAAGCTAAGT TGCTAACCGT GCCAACCAAC CCTGAAACTG GTATGGATGT AGCAGCAGTC
GAAGCTCATT TAGCTGCAGG CCAACGCCCA AGCTTAATCT ATGCGATCAG CGATGGACAC
AACCCGCTTG GCGTGAGCAT GAGCCTCGCC CAACGCCAGC AGCTCGTTGA ACTAGCCCAA
CAGTATCAAA TTCCCATTAT TGAAGATGAT GCTTATGGCT TTTTAAGCTA TCAGGCTGAT
ACGATTGCCC CAATGCGAGC CTTAAGTGAC GACTGGGTTT TATATATTGG CTCATTTTCG
AAAATTCTAG CCCCATCGTT GCGGGTTGGT TGGTTAGTCG TACCCGAGTG GTTAATCGAA
CGCTTGTCGA TCGTCAAAGA GGCGAGCGAT ATTGGTACAG CCACGCTGAG CCAACGTTTA
GTCGCAGCCT ATACCCAAAC CCATCAATTA ACTACGCATA TCGACCAATT ATGTCAAATA
TATACAACTC GCCGCGATAC AATGTTTAGC GCGTTGGAGC AGCATTTTCC GTCCCAAACC
CGTTGGTATC AGCCTAGCCA TGGCATGTTT ATTTGGGTTG AACTGCCTAC AACAGTTGAT
CCCTTTAAAC TCCTAGACCG AGCGATCAAC CAAGCGAAGG TGGCGTTTAT CCCAGGCAGT
GTGTTTGGTG TGGCGGGCAA ATCGATGAGT ACCAATGGAA TTCGCCTGAA TTTTTCGAAT
GCCGATATTG ACCAGATTAA TGCGGGAATT GAGCGTTTAG CCACAATCAT GCAAACCCTC
AAAGCCTGA
 
Protein sequence
MAAIATTPSQ FELAAWAKTI TPSALQDMLS ATANPEVISF ALGLPAPELF PRHQFSQLAS 
TLLEAEPLAL QYGPPSTTLK TAIVSLMAQR GVRCRPEQIF LTNGAQQGMN LLVRLLLADG
GSVLLEDCIY TGFQQVLDPF QAKLLTVPTN PETGMDVAAV EAHLAAGQRP SLIYAISDGH
NPLGVSMSLA QRQQLVELAQ QYQIPIIEDD AYGFLSYQAD TIAPMRALSD DWVLYIGSFS
KILAPSLRVG WLVVPEWLIE RLSIVKEASD IGTATLSQRL VAAYTQTHQL TTHIDQLCQI
YTTRRDTMFS ALEQHFPSQT RWYQPSHGMF IWVELPTTVD PFKLLDRAIN QAKVAFIPGS
VFGVAGKSMS TNGIRLNFSN ADIDQINAGI ERLATIMQTL KA