Gene Haur_3459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3459 
Symbol 
ID5735320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4351458 
End bp4353092 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content50% 
IMG OID641280606 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001546223 
Protein GI159899976 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTG GTCAAGAAAC GATTCTAGTG ATCGACGATA GCGAAGCAAT TCGGGTCAAA 
CTTCAGGCTC AATTGCGTTC GCTGGGCTAT CAGGTCGTGT TGGCTGAAAC TGGGCGAAAT
GGGCTAAACG CGATTACCCA ACATCATCCG CACCTCATTT TGCTCGATTA TCAGCTGCCT
GATACCACTG GGTTGGATTT GCTGCGTAAG CTGCGCACCG ATGGCAACAC CGTGCCAATT
TTGCTGATGA CTGCCGAAGG TTCCGAACGC ATCGCCGTAA CGGCCTTCAA AATGGGCGTG
CGCGATTATT TGATCAAGCC TTTTGAGCCG CAAGATGTTG CTCAGGCGAT CGATCGGGCG
CTGCGTGAGT GGCGTTTGCA ACGTGAAAAA GAAATTTTGC TTGGTCAATT ACAAGGCCAA
GTGCGCCAAT TAACGGTTTT GCATCGGGTT GGTAAGGCGG TAACTGCCCA ACTTGATGCC
AGCAATTTGC TTGAACGAAT TGTTGAAGCC TCAGTTTTTC TTTCGAATGC CGATGAAGGT
TTTGTGCAAT TAATTGATGA TAATCAGTTG GTTGTGCGAG CCTCACACAA TATTAACCCA
TTGCATTTGC GTGAGCTAAG CAAACATACC GATTATGAAT TGGCAACGCG CACAATTAAA
ACCAACAAAC CAATTCGGAT CAATTCCGAG CGCGATGGAA TTCGGGTGCA AGCCAATTAT
TTGGCCCAAG CTGTGCTCAT GGTGCCGTTG TTGGTGGGGA CTGAAGCCTT GGGCGTGCTG
ACCGTGGCTG CGACAACCCA TCGCCGCAAT TTTGATGAAG GCGATGAGCG CTTGATGCAG
ATGCTGGCCG ACTACGCCTC GATTGCCTTG CACAACGCCC GCACCTACAG CGCACTGCGC
GAAACCCAAG GTCGCTTGGT TGAAGCTGAA AAATTATCGG GCATGGGGCG CATGGCGGCT
TCGCTAGCCC ACGAAATCAA CAATCCACTG GCGATTATTC GCTCAGGCTT GGAGTTGGTG
GCGCAACAAC ACACACCTGG CACGGCGCTT GGCGATTTGG TGCAAGGACT GGATGAAGAA
GTGGCGCGGA TTGCGCGGTT GCTCTATACC TTGGTGAATT TCTATCAGCC CAATAACGAT
GGTGTGCCGC CTGATCTCAA TCATTTGATC ATTTCACTGA TGCACATCAC CAAACCACAA
CTTGATAAAG CCAATGTTAA ACTGTATCAG GAGTTGGCGA CCGATCTGCC AGCGCCAAAC
ATTAGCAGCG ATGCTTGTAA GCAAGTTTTG ATCAATTTGG TGCGCAATGC GATTGATGCG
ATGCCCGATG GCGGCAAATT GACCATTCGC ACGGCCCATC AAAAGGGTCA AATTTTTGTC
AATGTTGAAG ATAGCGGAAT TGGGATTCCG CCCGAACATC GCGAACGTAT TTTTGAGCCA
TTCTTTAGCA CTAAAGGTGT GACGGGAACG GGGCTTGGGC TTTCAGTTGT TTATGGTATT
TTGCAACAAG TTGGCGGCGC AATTTCGGTA GAGAGCATCG TGGATAAAGG CTCGAACTTT
ACCTTGCGCA TTCCGGTTGC AGCCCAACGC AGCCAATCGC CCGATCTTGA TTCCGATGAA
TTGTTGATTG GTTAA
 
Protein sequence
MASGQETILV IDDSEAIRVK LQAQLRSLGY QVVLAETGRN GLNAITQHHP HLILLDYQLP 
DTTGLDLLRK LRTDGNTVPI LLMTAEGSER IAVTAFKMGV RDYLIKPFEP QDVAQAIDRA
LREWRLQREK EILLGQLQGQ VRQLTVLHRV GKAVTAQLDA SNLLERIVEA SVFLSNADEG
FVQLIDDNQL VVRASHNINP LHLRELSKHT DYELATRTIK TNKPIRINSE RDGIRVQANY
LAQAVLMVPL LVGTEALGVL TVAATTHRRN FDEGDERLMQ MLADYASIAL HNARTYSALR
ETQGRLVEAE KLSGMGRMAA SLAHEINNPL AIIRSGLELV AQQHTPGTAL GDLVQGLDEE
VARIARLLYT LVNFYQPNND GVPPDLNHLI ISLMHITKPQ LDKANVKLYQ ELATDLPAPN
ISSDACKQVL INLVRNAIDA MPDGGKLTIR TAHQKGQIFV NVEDSGIGIP PEHRERIFEP
FFSTKGVTGT GLGLSVVYGI LQQVGGAISV ESIVDKGSNF TLRIPVAAQR SQSPDLDSDE
LLIG