Gene Haur_4971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4971 
Symbol 
ID5736807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6307614 
End bp6308909 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content51% 
IMG OID641282138 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001547729 
Protein GI159901482 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAGC AACCAATCGT CCCATGGAAA GTGCGTGCTT GGCGCTGGAT CAAGCAATGG 
TGGCAGGATA TTGTCCATGG CGCTGATCCA TTTGCAACGC TCACGGCGAT TTTGGCCTTG
TTTACGGCAT TAATTGGGAT TATGGGCGCA GTCAATTTAG ATTATGGTGA TGTGCCAACC
GCCGTTATTT TGCTCGATTT TGGGGTTACA TTGTGTTTGG GCGGCTTAGC ATGGTGGATG
TTCCGCCAAC CTAATGCACT TTGGCCACGC TGGATTGTGG TTGCTGTGCT CACGATCGCC
TGTTTTTATG TGCTGGCGAC GATTCCCTCG GATGCGGCGG TGGTTGTCTT GACGATGTTG
CCAATTTTGA TGGCAATGTT GGCTAGTCGG CGGTGGTGGC CAATTGCCTT TTACCCGCTC
TCAGTCACGA TTGTTGCTTC GATTCATGCT GGTGGTTGGC GCGTTTTTAC GCCAGCTATC
ATCGCATTCA ACATTTTAGA GTTTTTGATG CTCTGGGGGG TCACGTCGCA GGCGGTGCAA
TTTGCCAAAG CACGGATCAA AACTGAAGCT CAACTGAGCA AAGAGCAGCA AGTCCGCCAA
TTTATGCGCT TCTTCCAGCA CGAATTGCGC TCCTACAGTA ATAGCCTTGA TGTGCTGTTG
CCAATGCTCA ATCAACATCT TGCCAAACAA TCAGTGATCG ACCACGATGC GATTCATCCG
CAGGAGTTGG TGCGTGTATT GCAAACCACC GCCGATTCAA TTCGCCAACT CACCACCGAA
TTGCTGGTGC TGACCCGCGA AGGCCAATTG CTGCCTCAAC AACGCATTCC CATCCAATTA
TTGCCACTGC TGCAAACTGA ACAAGCTGAA TGCCAAACCG AAATGCAGCG CCATGGCCTT
AATTCGACGA TTCAGATCGA TTGCCCGGCT GATTGTATTG TGGTTGGCGA TGCACTTTTT
TTGCGCTTGG CGATTCATAC GATTTTGCAA AACGCGATCG AGGCGCTGTT ACGCCACCCT
GGCGAAGGCC AAATCTTCAT TCAGGTTGAA CAAACGCCTC AATCGACGCA GATTGAGATC
TACGACACGG GCAAGGGCTT TCCTGCCGAT TTGCTTGATC ATTTAAATCA GGCTCAATCG
TTGAACCAAG CCTTGGGCTG GACGACCAAA ATGGGTGGCA GCGGCTTAGG TATGCCCTTG
ATTCGCCATG TGGCACTCTT GCATGGTGGC ACTGCCCAGT TTAGCAATCG GCCAGAAGGC
GGTGCACAAA TTAGGCTCAA TTTGCCGCAG GTTTAG
 
Protein sequence
MPEQPIVPWK VRAWRWIKQW WQDIVHGADP FATLTAILAL FTALIGIMGA VNLDYGDVPT 
AVILLDFGVT LCLGGLAWWM FRQPNALWPR WIVVAVLTIA CFYVLATIPS DAAVVVLTML
PILMAMLASR RWWPIAFYPL SVTIVASIHA GGWRVFTPAI IAFNILEFLM LWGVTSQAVQ
FAKARIKTEA QLSKEQQVRQ FMRFFQHELR SYSNSLDVLL PMLNQHLAKQ SVIDHDAIHP
QELVRVLQTT ADSIRQLTTE LLVLTREGQL LPQQRIPIQL LPLLQTEQAE CQTEMQRHGL
NSTIQIDCPA DCIVVGDALF LRLAIHTILQ NAIEALLRHP GEGQIFIQVE QTPQSTQIEI
YDTGKGFPAD LLDHLNQAQS LNQALGWTTK MGGSGLGMPL IRHVALLHGG TAQFSNRPEG
GAQIRLNLPQ V