Gene Haur_3364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3364 
Symbol 
ID5736906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4242088 
End bp4243584 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content53% 
IMG OID641280511 
Producttranscriptional regulator 
Protein accessionYP_001546128 
Protein GI159899881 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTTC AGCTTGATCG TCAGCACGCC AAACCACTCT ATATTCAATT ATCCGAACAA 
CTGCAAGAGC GGATTCGGAG TGGCTCGTTG CCAGCTGGCA CGAAACTGCC GCCAGTGCGC
GATTTAGCTG AATCACTTGG TCTCACCCGT TTGACGGTTC ACAATGCCTA TAGCGAACTG
CAAGCAAGTG GTTGGGTTGA GGCCTACGTT GGTCGCGGCA CCTTCGTCGC CGAGCGGATC
AAGCCGATTA TTCCAGCATA TGAGATTCGC CAACGGGTAG TGGATGAGCT ACAAACGCCA
TGGTTTAGCC AAGGCATGTT GGCCGATATG CTACGCTTGG CCCAACAGCC AAATTTAATC
TCCTTTGCCC AAGCCGCGCC AGCCGAAGAA ACCTTTCCAG TGCGTGAAAT TGGCCGCGCA
ATTCAGCAAG CCCTGCGCGA CCCCAGCGCC CTAGGCTATG GCCCAACCCA AGGCGAATTA
TGTTTGCGCG AAGCGATTGC CACATGGCTG CTCGACCGCA ATGTTGTAAC CTCGCCCGAC
CATGTGCTGG TGACAACTGG TGCTCAGCAG GGCGTAGCCT TGGCATTAAA GGCCTTTGTT
CGCCAAGGCG ATGTGGTTTT GGTCGAGGAG CCAACCTATT TGGGCTTTAT CGAGCAGGCT
ACGGCCTTGG GTGTGCGCTT AATCGGCATT CCATTGGATG ATCAAGGCTT GCGGTTGGAT
ATTTTGCAAC GGGTATTGTG TGAATACAAA CCACGGTTGC TCTATACCGT GCCAACCTTC
CACAACCCAA CCGGCGTTTG CCTTTCGACC GAGCGCCAAG AAGCCCTATT GCAATTGGCC
CAAGAACATA ACTTAATTAT TTTAGAAGAT GATGTCTATG GGCCGCTGAG CTACGATGCT
CAAGCACCAC ACCCAATCAA AGCCCGCGAT ACTAATGGGC AGGTGGTCTA TCTTGGCAGC
TTCTCCAAAA TTCTAACTCC GGGCTTACGC CTAGGTTATT TGGTTGCCCG TGACGAATTT
TTGCACCCGT TGCTGACTGC CAAGCGTGGC AACGATCTCC ACTGCTCGCC ATTATTGCAA
CGAGCTTTGG CCGATTATCT TGGCCGTGGT CAGTTGGCGG CGCATTTGCG CTATGTGCGT
GAACTCTATC GTGAGCGTCG CGATGCCATG GAACGAGCGT TGAACCGCTA TTGTCCCCGT
GATATTCAAT GGACGCATCC ACGTGGCGGG TTATGCTACT GGCTAACCTT GCCCTCTGGA
TTAAATGGCA CCGATATTTA TACCGAGGCG ATTGAAGCAG GCGTTGGCGT GACCCTTGGC
AATGTCTTTT TTCCACAACC GCCACGCAAC GCCCACTTAC GGCTCTGTTT TGCCACCCAA
TCACCAGAAT TAATTGATCG TGGAATTCGC ATCCTTGGCG ATGTGCTAAC CCGCCATGTC
TTGCGTTGTG GTCAACTTGC TGCCCGTGCT TGGCGCGAAA CCACCCCACT GATGTAA
 
Protein sequence
MEFQLDRQHA KPLYIQLSEQ LQERIRSGSL PAGTKLPPVR DLAESLGLTR LTVHNAYSEL 
QASGWVEAYV GRGTFVAERI KPIIPAYEIR QRVVDELQTP WFSQGMLADM LRLAQQPNLI
SFAQAAPAEE TFPVREIGRA IQQALRDPSA LGYGPTQGEL CLREAIATWL LDRNVVTSPD
HVLVTTGAQQ GVALALKAFV RQGDVVLVEE PTYLGFIEQA TALGVRLIGI PLDDQGLRLD
ILQRVLCEYK PRLLYTVPTF HNPTGVCLST ERQEALLQLA QEHNLIILED DVYGPLSYDA
QAPHPIKARD TNGQVVYLGS FSKILTPGLR LGYLVARDEF LHPLLTAKRG NDLHCSPLLQ
RALADYLGRG QLAAHLRYVR ELYRERRDAM ERALNRYCPR DIQWTHPRGG LCYWLTLPSG
LNGTDIYTEA IEAGVGVTLG NVFFPQPPRN AHLRLCFATQ SPELIDRGIR ILGDVLTRHV
LRCGQLAARA WRETTPLM