Gene Haur_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3775 
Symbol 
ID5735639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4745007 
End bp4746653 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content50% 
IMG OID641280927 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001546539 
Protein GI159900292 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAA TAACTCTCGA TGCTCAACGC TGGCAAACAT TCAGTCAGTT GCTCACGGCA 
ACCGCCTACC AAAACCCACA GGATCTGCTT GATTCGTTTT TGCAACGTTT AATCGAGTTT
TGGCCGGCCC AAGCAGGGGC ATTGCTCTAT ATTGATCCAT TAAATGTGCC GTTTTCGCTG
GAGCATGGCG AATTGCCTGC CGATGGTCAC GAAATGATCG AGCAAGCTCG TTCGGTATTC
GAGCGCACAA CTGGCGGCGA GCAAGATTTA GGGGTCTATG GGATTAACGA TGAGCTAGCC
TTGGTGGAAT TAACTCTGCG TTCTGGTGAT GATGAAGTTG GCCTGTTGCA CCTGATTGTT
TCTGCCGACC GCGTATCACC TGAGAATGAG GGCGAATTGG CCTTGCTTTT GGTGGGCGTG
GCTGGTGGAG CCGCCGATCG AGTGGCCTTG CTCAACACGA CGCGCAGCGA ACTTGATGAG
ATGAATTTGC TCTATCAAGT CAGCCAATCG ATTGCCAGTG ATATTGATTT GCGCAGCCTT
TTGCGCACGA TTATTGAAAA AGTTACCCAA ATTGTCGATA GCGAAAGCGC TTCGTTGCTG
TTGGTCGATG AAGTGCATAA AGAGCTGTAT TTTGAAACGC CTTCGAACAA TAGCCAAGAA
TTGCGCTCTT ATCGCATGCC AATGGATCAA GGCTTCGCTG GCTGGGTTGT CACCCATGGC
CAAGGCTTGA TTGTTGATGA TCCACAAAAT GATGCGCGGT TCTATCGCCA AGTTGATAGC
GATATTTCGC ATCAAACCCG CAATATTCTG ACCGTGCCAG TGCGTTCACG CGAACGGACA
GTTGGGGTAA TTCAGGCAGT CAATAAGCGT AATGGCCCGT TTACCGAGCA CGATTTGCGC
ACGCTCTCGA TGCTCGCCAA CCAAGCGGCG ATCTCGATTG AAAATGCCAA TCTTTACACC
AAACTCAAAG AAGAACGCGA TCGTTTGATT CGCAAAGAAG AAGAAGTGCG TCACCAAATC
AATCGCGATC TGCACGATGG GCCAACCCAA TCGGTCGCAG CGATTGCCAT GAACATCGAG
TTCATCAAAA AATTGATGCA GGCCATGCCT GAGCGCGTCG AAGATGAGCT TAATTCATTA
GCGGCCTTGG TCAAAAAAAC CTACCAAGAA TTGCGGACGT TGCTATTTGA GTTACGCCCG
TTGGGTCTCG AAACCCAAGG CTTGGTGGCA ACTCTACACG AATACGCCAA CAAATTCCGC
GATCCATCGG GGATGCAAGT GCGCTTTACG CCTGGCAACT TTATTGGTCG TTTGGCCCCA
CAAACTGCCG CCGCCACCTT TATGATTATC CAAGAGGCGG TCAATAATGC CCGCAAACAT
GCCAAAGCCT CGATGGTCTG GATTGAGCTT TTTCAAGATA GTGAACGCCA AATGCTGACT
GCGGTTGTGC GTGATAGTGG GATTGGCTTT GATCTTAAGA GCATCACCAC GTCGTATGAG
GAGCGTGGCT CGTTTGGGTT GTTGAATATG GGCGAACGGG CAACCCTCGC GGGTGGCGCT
TGCGAAATTC GCTCGTCGGC AGGCGAAGGA ACCCAAATTA TCATTCGAGT GCCGACCTTG
GTCGAAGAAG CCGAAGATTT TGCGTAA
 
Protein sequence
MPSITLDAQR WQTFSQLLTA TAYQNPQDLL DSFLQRLIEF WPAQAGALLY IDPLNVPFSL 
EHGELPADGH EMIEQARSVF ERTTGGEQDL GVYGINDELA LVELTLRSGD DEVGLLHLIV
SADRVSPENE GELALLLVGV AGGAADRVAL LNTTRSELDE MNLLYQVSQS IASDIDLRSL
LRTIIEKVTQ IVDSESASLL LVDEVHKELY FETPSNNSQE LRSYRMPMDQ GFAGWVVTHG
QGLIVDDPQN DARFYRQVDS DISHQTRNIL TVPVRSRERT VGVIQAVNKR NGPFTEHDLR
TLSMLANQAA ISIENANLYT KLKEERDRLI RKEEEVRHQI NRDLHDGPTQ SVAAIAMNIE
FIKKLMQAMP ERVEDELNSL AALVKKTYQE LRTLLFELRP LGLETQGLVA TLHEYANKFR
DPSGMQVRFT PGNFIGRLAP QTAAATFMII QEAVNNARKH AKASMVWIEL FQDSERQMLT
AVVRDSGIGF DLKSITTSYE ERGSFGLLNM GERATLAGGA CEIRSSAGEG TQIIIRVPTL
VEEAEDFA