Gene Haur_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3219 
Symbol 
ID5735087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4071225 
End bp4073606 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content49% 
IMG OID641280365 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001545984 
Protein GI159899737 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3437] Response regulator containing a CheY-like receiver domain and an HD-GYP domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTGGC GCTCGTTTAT TCCACGGACA CTAGCCGATT CCCAATCTCG TCAACAGTTA 
TTGATTGTCA TGCTCAAAAG CGTTGTTTGG ATCTTTACTC CCGTCCAAGT TATCTTGAGT
TTTACCTTGA CAACCACAAG CTGGCCACGC CTGTTGTTGC TGGCGATTGT CAACCTGCTT
TGTGCTGGAA TCTGGCTCTT AATCAAACGC AATAAGCTGG ATCTGGCGGG CAAGCTGTTT
GTTGGGTTGT TTTGGTTGTT ATTTACGGGT TTGATGCTGA GCACTGGTGG CATTAGCTCG
CCTTCGATTA TTGCCTACTT TTTCGTCATT TTTACCGCCA GCTTTTTGTT GGGAGAACTG
GCAAGTGTGA TTATCGGCTG TCTGAGTTTA GCGGCAACCT TTGTAGCCGT GCTCCTTGAG
CTGAATCAGC TTTTGCCCGC CTCGGTGATT GTTTATACCC CCGTTTCGCG TTGGCTCAGT
TACAGCTTTT ATCTTGGCAT TATGCTGATA TTTCAGATGG TCAGTGGGCG CTTGGTCTAT
AAGGCCATGC AAAAAGCCCA AGCCGAGTTA CACGAACGCC AACGCATCGA AGCTGAATTA
CGCCATTCTG AAGCCCAATA TCGTCTTTTG TTCGATACTA TTCCAATTGG CATTGGCATG
GCTAAACTCG ATGGTACAGT CATCGCCATC AACCCAGCTG GCAGCCAAAT GATGGGCTAT
AGCCATGCCG AATTGATGCA AATCAAACTC GAACAGTTAT ATGCCAATCC CGATCAACGG
GCTAGTTTAA TTCAGCAAGC CCAGCAAACT GGCAAGATTC GCGACCGCGA GATGGCTTTC
CGGCGCAACG ATGGCCAATT GGTTTGGGCG TTGGTCAATC TTGATCTTGG GTTAATTAAT
GGTGAGGCGG TGACGATTGC CACGCTGCGC GATAACACCA CTCAACGCGC AGCAGAACAA
GCCTTGCGCA CCAACGAAGC TCGTTTTCGG GCAATCTTTG AGCATGCAGC AATCGGAATT
GTCTTAATTG ATAATAATGG TGTGGTCTTT AGCGCCAACC CAACTGCTTG TATGTTGCTT
AATTTCAGTG AACATGAATT GCAACAACAA GCCTTTGTTA ATTTCACCCA TCCCGATGAT
CGCCAATTTG AAATGAGCTT AAATCAAGAA CTTCGCGCCG GATTATGTGA TTCGTATCAA
ATTGAGCAGC GCTTTATTCG CTCTGATGGT GGGATTGTTT GGGGCCGTTT GTGTGCCTCA
TTAGTCCAAG ATGCTGCTGA TCAACCTTTA TTTATGATTG CGATGATTGA AGATCTTAGC
TCGTATAAAG ATACCCAGGC CCAACTTGAC TTGCAAATGC AAACTCTGAC AGCGCTCTAC
TACAGCTCAC AACGCCTAAC AACCAAACTC AAAGTCGAGG AATTGGCCCG CGATGGGGCC
GATAGTTGTG TTAGCATTTT TGGCGCACAA CGGGCTTGGC TCACGCTCAA TCAGGCTGAT
CAGCTTAACT ATCACGAACA TAACCTAGAA CAACCCTTGC CCAGTGTTCA CATTGAGATT
GAATCGACTC AACCCATCAC CCGCGATGGC CCAACTCGAA CCATGGCCTT TCCGTTGGTC
AGCCATAATC ACACCTTTGG GATGCTCAAT TTGCAGAGCA ATCAGGCTGA TTTCTTTCAA
GCTGAGCGCA ACGATATGCT ACAAACCTAT GCCAGTCAGG TCGCCGCCGC CTTGGATAAC
GCGCTCATGT TCCAACATTT GCAACAAATC AACCGTGAAG TTACCAGCGC CTACGATATG
ACGATTGAGG GTTGGTCTCG CGCCTTAGAT TTGCGTGATC ACGAGACTGA AGGCCATACC
CAACGTGTTA CCTGGATGAC TGAACGGCTG GCCGCCGCAA TGGGCCAATT TAGCGCCGAA
GAGTTGATCC ATGTGCGACG AGGCGCTTTG CTCCACGATA TTGGAAAAAT GGGTGTGCCC
GATGCCATCT TGCACAAGCC TGGGCCGCTC AACGATGAAG AATGGGTGAT TATGCGGCGG
CATCCGGTCT ATGCGTATCA ACTGCTTGCG CCAATTGGCT ACTTGCAAGG CTCGCTGGAT
ATTCCGCATT ATCATCATGA ACGCTGGGAT GGGGGCGGCT ATCCCAAAGG CTTGCAGGCC
GAAGAAATTC CTTTGGCGGC GCGAGTATTT GCCGTAGTTG ATGTATGGGA TGCCTTGCGT
TCTGATCGAC CATATCGCAA AGGTTGGTCA GATCAGCGGA TTATGGACTA TTTGGCGGGT
GAGGCGGGCA AGCATTTTGA TCCATTAGTC GTTGAAGTAT TTTTGCAATT GCTCATGACC
ATGAGCATTG CCGAGGTGCG TAATCCAGTT AATGAAGCCT AA
 
Protein sequence
MHWRSFIPRT LADSQSRQQL LIVMLKSVVW IFTPVQVILS FTLTTTSWPR LLLLAIVNLL 
CAGIWLLIKR NKLDLAGKLF VGLFWLLFTG LMLSTGGISS PSIIAYFFVI FTASFLLGEL
ASVIIGCLSL AATFVAVLLE LNQLLPASVI VYTPVSRWLS YSFYLGIMLI FQMVSGRLVY
KAMQKAQAEL HERQRIEAEL RHSEAQYRLL FDTIPIGIGM AKLDGTVIAI NPAGSQMMGY
SHAELMQIKL EQLYANPDQR ASLIQQAQQT GKIRDREMAF RRNDGQLVWA LVNLDLGLIN
GEAVTIATLR DNTTQRAAEQ ALRTNEARFR AIFEHAAIGI VLIDNNGVVF SANPTACMLL
NFSEHELQQQ AFVNFTHPDD RQFEMSLNQE LRAGLCDSYQ IEQRFIRSDG GIVWGRLCAS
LVQDAADQPL FMIAMIEDLS SYKDTQAQLD LQMQTLTALY YSSQRLTTKL KVEELARDGA
DSCVSIFGAQ RAWLTLNQAD QLNYHEHNLE QPLPSVHIEI ESTQPITRDG PTRTMAFPLV
SHNHTFGMLN LQSNQADFFQ AERNDMLQTY ASQVAAALDN ALMFQHLQQI NREVTSAYDM
TIEGWSRALD LRDHETEGHT QRVTWMTERL AAAMGQFSAE ELIHVRRGAL LHDIGKMGVP
DAILHKPGPL NDEEWVIMRR HPVYAYQLLA PIGYLQGSLD IPHYHHERWD GGGYPKGLQA
EEIPLAARVF AVVDVWDALR SDRPYRKGWS DQRIMDYLAG EAGKHFDPLV VEVFLQLLMT
MSIAEVRNPV NEA