Gene Haur_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1105 
Symbol 
ID5732996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1265195 
End bp1266835 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID641278243 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001543881 
Protein GI159897634 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCAGC ATCGTAAATT AACCCTCGTT CAACGCTTTG GGCAGCGCTT TGGCACGCTA 
CGCTGGAAAC TCACATGGTC GTATGTGTGG ATCAGCGTAG CGTTGACTTT GTTGATTAAT
GGTTTGGCGT TTGGTTTAGT GATGGGTGCT GCTCCACCGC TCAATGCTCC CGCCTTGCTT
AGCGCCGCCC AAAATCATGC CGCTGAGTTT AGCGAGGTTT TGCAAGAGCC AACCGATCAG
CACACAATCC AACAGTTGTT ACGCTATCGC TTTGGTTCGT TGCGCGGCCA ATTTCTGCAA
AATAACCTTG AATTAGCGAT TAGCAATACT ATCAATAATT CTCCCAGCGA TCTTTCGCAG
GAAATGATTT TGATGCCAGC TGAGGTATTT TCGCCTGAAG CGACCAATGT GTTTAGCGAG
AGCTACCACC TCTTATTGCT TGATGCGCAG GGTAATGTGC TCGGCGGGAC ATTTCCCAGC
CGTACTCCGG CGGGCCAGCC CTGGAACGAT GCGATAGTGG GCAATGATCG ACGGGTTGTG
CAGGCAGCAC TTGCTGGCAG CGACGATATC GATCAATTAA CTTGGAAGTA TGAGAATTAT
TTAGTGATTG CCACTCCAGT TCGCGATCAA GCCAACCAAG TGATTGGGGC ACTGTATGTA
CGTTCGCGGC CACTTAGCCA AAATCAAGTG ATTGTTGCAT TGTTGATGTT TGTTATTTTT
ATTGCCAGTT CAATTGTTTC GATTATTGCC AATGCCTTAA TTGGCATGGT CTATGGCTGG
TTTGTGGCCC GTAATTTTGT GCGCCGTTTG GTGCATCTCA CCCAAGCTAC CGATAGTTTG
GCCGCTGGCG ATTTGAGCGT GCGGGTCAAC GATGGATCAA TCGACGAAAT TGGGCATTTG
GCGCGGCGCT TCGACAGCAT GGCCCAACAG CTTGAATCAA ATGTCAAAAT GTTACGTCAA
CTAGCTGATC GCAATGCTGC CTTAGTTGAG CAGGCAGGCC AATTGGCAAT CGTCGAGGAG
CGTAATCGGT TGGCTCGCGA TTTGCACGAT AGCGTTAGCC AAGAGTTATT TAGCGTCACG
ATGTTGGCGG CGGCTGCGCG TAATTTATTG CCAGCCCAGC CCGATAAAGC GCGTAGCCAA
GTTGAACAAC TCAGCCAAAT GGCCCAACGC GCCTTGCACG AAACTCGTGG CTTGATCTTC
GCACTTCGGC CTGCTGCGCT CGGTGATCAA GGTTTAGTCC CAGCATTACG TCAACTTACC
GAAGAGGCAG CGCGTCGCCA AGGCTTGCAG ATTGAACTGA ATACCAACGG CGAACGGCGC
ATTCCCTTAG ATCATGAGCA GGCACTCTAT CGGATTTGCC AAGAAGCCTT GGCCAATGTG
ACCAAGCATA GCGGCGTGAA CAGCGCCAGC GTAAGCCTTG AATATGAAGC CCATCGCACC
ACTTTAGAGG TGCGCGATCG TGGCCGTGGC TTTGATCAAG ATAAACCGCG CAATTCGCAC
TCGCTAGGCT TGATTAGCAT TCAAGAACGC GCCAAAGCAG TTGGCGGCAC AGTTGAATTA
ACTGCTGCGC CAGGCCAAGG CACAAGCCTA CGCATCGTTG TACCACGAAC CCAAACTGGG
CTACTGGTCG AGCCACGTTG A
 
Protein sequence
MQQHRKLTLV QRFGQRFGTL RWKLTWSYVW ISVALTLLIN GLAFGLVMGA APPLNAPALL 
SAAQNHAAEF SEVLQEPTDQ HTIQQLLRYR FGSLRGQFLQ NNLELAISNT INNSPSDLSQ
EMILMPAEVF SPEATNVFSE SYHLLLLDAQ GNVLGGTFPS RTPAGQPWND AIVGNDRRVV
QAALAGSDDI DQLTWKYENY LVIATPVRDQ ANQVIGALYV RSRPLSQNQV IVALLMFVIF
IASSIVSIIA NALIGMVYGW FVARNFVRRL VHLTQATDSL AAGDLSVRVN DGSIDEIGHL
ARRFDSMAQQ LESNVKMLRQ LADRNAALVE QAGQLAIVEE RNRLARDLHD SVSQELFSVT
MLAAAARNLL PAQPDKARSQ VEQLSQMAQR ALHETRGLIF ALRPAALGDQ GLVPALRQLT
EEAARRQGLQ IELNTNGERR IPLDHEQALY RICQEALANV TKHSGVNSAS VSLEYEAHRT
TLEVRDRGRG FDQDKPRNSH SLGLISIQER AKAVGGTVEL TAAPGQGTSL RIVVPRTQTG
LLVEPR