Gene Haur_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0803 
Symbol 
ID5732703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp907598 
End bp908797 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content53% 
IMG OID641277934 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001543579 
Protein GI159897332 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.944221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTGGA TTGTGTTGTT GGCGTTGACA ACGATGATCG CGTTGCTTTG GGGTTGGCGT 
GGTCAACGCA ATCTCCGCCG TGAACTGGCC TATTTACGCA ATCGGCCTGT GGTTGCCCCA
ACCCCATCAG CCGATCCATT TAATCAGTTG TTTCGCACGC TCAGCGCCGC GCTTGATGCA
GGCGTGATTG TCGTCAATGA TGGCCGCACC ATTCGTTATT GTAACGATGT TGCGGCTAAA
TTATTTGGGG TTAGTGCCAA TGCGGTTGTC AATCATAGCG TGATTACGTT GGTACGCGAT
TATCAGGCCG ATACCATGAT CGAGCAATCG ATTAGGCGAC GCGACCCGCA GCAAGTAACC
TTGCAACCAG TGCTCTCGAA TCGTACAATT CGGATTTGGT GTGAGCCATT GCCTGAAAGC
GGCGCGTTAA TTCTAGCCCG CGATTTAACC CAACTGAGTT TGCTGGAGCG GGCGCGGCGC
GATTTGGTGG CCAACGTTTC GCACGAATTA CGCACACCAC TAGCCTCAAT CAAACTCTTG
GTTGAAACTT TAGCAACCCA GCCACCACCC GAATTAGCCC AACGCATGCT GGGCCAAGTC
GATACCGAGC TGGATGCGGT GATGCAGTTG GTTGATGAAT TGCACGAACT TTCGCAGATC
GAATCTGGGC GCACGGCCTT GCAATTGCAG CCAACTCCAG TCGAAGATAT TGTTGAACGG
GCTAATGATC GGATTCAGCC GCAAGCCAAG CGCAAAGATC TGCGAGTGGC GGTGACAATT
GCCCCTGATC TGCCCGAAGT CTATGTTGAT CGCGACCGAA TTAGCCAAGT GCTGCTCAAT
TTGTTGCATA ATGCAGTTAA ATGGACTGAT GCTGGCGGCA CGATTACAAT TGAAGCTGGC
TTGCGTTCGC GTAGCGAACT GGGCCATCAT CTTAGCCGCA TGCTTGAGCC AAGCAATCGT
TGGGTGATTA TGGCAATTCA CGATACTGGC GCGGGCATTC CAGCCGAGGC GATTCCGCGA
ATTTTCGAGC GTTTCTATAA AGTTGATCGG GCGCGGACGC GCGGGGTTGG TGGCACAGGC
TTGGGTTTGG CAATTGTCAA ACACTTGGTC GAAGGTCATG GCGGTGTGGT TTGGGTTGCC
AGCAGCGAAG GTCGCGGTAG CACTTTTACC GTAGCCCTGC CAGTCGTCGA AGATGATTAA
 
Protein sequence
MTWIVLLALT TMIALLWGWR GQRNLRRELA YLRNRPVVAP TPSADPFNQL FRTLSAALDA 
GVIVVNDGRT IRYCNDVAAK LFGVSANAVV NHSVITLVRD YQADTMIEQS IRRRDPQQVT
LQPVLSNRTI RIWCEPLPES GALILARDLT QLSLLERARR DLVANVSHEL RTPLASIKLL
VETLATQPPP ELAQRMLGQV DTELDAVMQL VDELHELSQI ESGRTALQLQ PTPVEDIVER
ANDRIQPQAK RKDLRVAVTI APDLPEVYVD RDRISQVLLN LLHNAVKWTD AGGTITIEAG
LRSRSELGHH LSRMLEPSNR WVIMAIHDTG AGIPAEAIPR IFERFYKVDR ARTRGVGGTG
LGLAIVKHLV EGHGGVVWVA SSEGRGSTFT VALPVVEDD