Gene Haur_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3340 
Symbol 
ID5735210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4210619 
End bp4212292 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content55% 
IMG OID641280487 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001546104 
Protein GI159899857 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA GCGTCGCAAT GACTGAGCAG CCTAGTCAAA CAGCTGCGAT AACTCTCCCA 
CGTTCGCCCT TAAGTCGGTT GTTGTTGCCA ATTGCCTTGG TGGTGTTGGC GCTTGGTGCT
GGGCTGGCAG CGTGGAGCGG ATTTAATTTT GGCCCACAAA CGATGCTGAT CGGGGCGATT
GCAGGCGGAG TTGGCGCACT GGCCGCTTTG GTTGGCCTGA TTGCTGGCTT GCGTCGGCGC
TTGCCCTTGA GTGGTTGGGC CTTGATTGTG GGTACGCTCA GCATCGCTTT GGGTGCAACC
ATGGTCTTCC CTGAACAACG CGAATTGGCG ATTATCAGCG CCGCCCTGCC GGTTTTGTGT
GCCGCGCTGA CCTTGCGCCC CTTGCACACT GCTATTGTTA CGGTTGTTAG TTTAGCGAGT
TTATGGGCGG CGGCCTATTT TGCTGCCGCC GAGCCAAGCA CTAATCAGTT AATTCAATGG
AGCTTGCTCA GTGCCTTTGT GCTCGTGATC AGCATTTTTG CCTTGGTTTG TAGCGCAGCG
CTCCGTGCTT TTGATCGCCA GCAAACTGTC GGACATGAAT CAACCACCCA ACTCAATAAG
CAACTTGGCG AAGAGCAAGA ACGGGTCGAA CGCGCCGCCC AATTGTTGGT GCAAGAGCGT
GATCGACTGG CGGCGGTGCT GGGCGCAGCG AGCGATGGCG TGGTGTTGGC CGATAGCAAT
GGCATCATTT TGCAGGCCAA TGCTGCTGCC CGCCAACTGT TGAATGAAGC GTTTGGCGGC
ACGCTCGAAG GCCAAGCACT CAACCAGTGG AGCCAAGAAA ACACGGGCCG TTTGCGGGTA
ATTAGCAACG AACGCGAAGG CGAACGCCAA CGGGTGGTTT TTGAGCAACA ACAAGGCACG
CGCAAACCCG TAATTGGCCT TAACCAAGTG CCAATTCGCA GTAGCGCCGG CAGTGTCTTG
GGCTATGTTG GGGTTTTCCA CGATAAAACC ACCGAATTAG AAGTTGAAGA AATGCGTTCG
CAATTACTCG ATTTCCTCGT GCAAGATATG CATGATCCGC TCAACTCGGT GTTGGCAGCC
CAAGATACCT TGTTGGCTGG CGATTTAGGC GATGGCAATG AGCGGGTTTT ATCGACCGCC
CGCCGCACAA CCTCGCGCTT GGTTGAGCTA ACCAATACCT TGATGGAAAT GAGCCGTTTG
CATGGCGACC CCAATAGCCT GCATCGTTTG GCCAACCCAT TGCGCCCATT GATCGAAGGC
AGCATCGCCC AATCAACACC ACAAGCCCAA CAACGGGCAA TCAACTTGGT GTTGGAGTAT
GGCGCAGATA GTGGTGGCCT CGCCTTCGAT GCTGATAAGA TGCGGCGGGT GATGAGCCAC
TTGCTTGATA ATGCTTTGCG TCGCAGCCCA GCCTATAGCA CGGTGCGGGT GCAAGTGAGC
AACACGGGCG GAAATGCCCA AGTACGGATC GCCGACCAAG GCCCAAGTAT TCCGGTGGAA
CTCGCAGGCC GCATTTTTGA TCGCTTCAGC AAACAGGTTG GCGAACAACG GATCGGCGGG
GTTGGCCTAG CCTATTGCAA ACAGGTGATT GAGGCTCATG GTGGCCGAAT TTGGGTCGAT
AGCACGCCTG GCAAAGGCAG TACCTTTATC TTTAGCATGC CCTCAGCAGC ATAA
 
Protein sequence
MTESVAMTEQ PSQTAAITLP RSPLSRLLLP IALVVLALGA GLAAWSGFNF GPQTMLIGAI 
AGGVGALAAL VGLIAGLRRR LPLSGWALIV GTLSIALGAT MVFPEQRELA IISAALPVLC
AALTLRPLHT AIVTVVSLAS LWAAAYFAAA EPSTNQLIQW SLLSAFVLVI SIFALVCSAA
LRAFDRQQTV GHESTTQLNK QLGEEQERVE RAAQLLVQER DRLAAVLGAA SDGVVLADSN
GIILQANAAA RQLLNEAFGG TLEGQALNQW SQENTGRLRV ISNEREGERQ RVVFEQQQGT
RKPVIGLNQV PIRSSAGSVL GYVGVFHDKT TELEVEEMRS QLLDFLVQDM HDPLNSVLAA
QDTLLAGDLG DGNERVLSTA RRTTSRLVEL TNTLMEMSRL HGDPNSLHRL ANPLRPLIEG
SIAQSTPQAQ QRAINLVLEY GADSGGLAFD ADKMRRVMSH LLDNALRRSP AYSTVRVQVS
NTGGNAQVRI ADQGPSIPVE LAGRIFDRFS KQVGEQRIGG VGLAYCKQVI EAHGGRIWVD
STPGKGSTFI FSMPSAA