Gene Haur_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3062 
Symbol 
ID5734934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3868307 
End bp3869683 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content52% 
IMG OID641280206 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001545828 
Protein GI159899581 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC AACGAAAATT ATTATTAACC CACATCGCAG TTGCCCTCGT TGCAATTTTG 
CTGATTACGG CGGTGGCCAA CTTCACCGTC AATCGCTATT TTAGCGATCT TGCGGCCAAA
CAGGCCAAAC AAGCAGCCCA AGAGTTTGCG CCAACCTTGG CAACCTGCTA CCAAATTATT
GGCAGCTGGG ATTTTAATGG CCAGCGTTGT ATGCAGCTTG GGCCACGACC GTTCATGCCG
CCGCAATTTC GCCATGTCGT GGTAGTTGAT ACCGCTGGTG AGATTGTTTT TGATAGCCGT
GGACGCGGCC AAATTAATAA ACCAACCAAC ACCATTACCC AACGCGATAT TGAACGTGGC
GAATCAATTA ATGCTGAAGA TGGCACGGTG ATTGGCACGG TGATTGTGCG ACCCAATCAA
GGCCAATTTG GCGCAGATGA AGATTATTTT TTGAGTATGG TGCGGCGTAA TATTTGGTTG
GCCGGAGCAA TTACCGCCCT CTTAGCCTTG GCTATTGGCA TCGGCCTGGC GCGAACCTTG
GCCGCGCCGT TGCGCAGCCT GACTGCCGCC GTGCATCAAC TGGCTCAGGG CGAACGTTCA
GTTCAAGTTG ACGATTCGGG CAACGATGAA ATTGCCGAAT TAAGCCAAGC CTTTAACACC
ATGAGCAGCG AACTGCATCG CTCTGAGCAA GTGCGCCGCC AGATGGTTGC CGATATTGCT
CACGAATTGC GTACCCCTTT GAGCGTGCTG CAAATTGAGC TTGAAAGCAT CGAGGATGGC
GTGAGCAAGC CCACACCGGC GGTGATTAGC TCCTTGGGCG AGGAAGTACA ACAACTTAAT
CATCTGATTG AAGATTTACG CACACTTTCC TTGGCCGATG CAGGCCAGTT GACTCTCAAT
CCAATTGAGC TAGAACCCCA AGATGTGGTC AATCGTGCGG TCAATCGTAT GCAGTTGGCG
GCACGCGAAA AACAATTGGA GCTAGCCAAC GATAGCGCCG AACAGATCGA TTTGGTCCAT
GCTGATCCAT CACGCCTACA ACAAGTGCTG GTTAATCTTT TGCAAAATGC CGTTCGCTAC
ACCCCGCAAG GTGGTAAAAT TCGCGTGACC GCCCGCCAAA GTGCTGGTGA AGTTATTTTG
GGTGTTCACG ACACTGGTGC AGGCTTCGAC CCAACCGAGG CTGCCACGAT CTTCGAGCGC
TTTTATCGCA CCGATAAAGC CCGCGCTCGT GATACGGGCG GCACAGGCTT GGGCTTGGCA
ATCGTCAAAG GTCTCGTGAC CGCAATGGGT GGCCGGGTTT GGGCAACCAG TGTGCCGAAC
CAAGGTTCAA GTTTCTATGT TGCTTTACGA GCAATCAGCA CCAAGGAGGG TGTATGA
 
Protein sequence
MKLQRKLLLT HIAVALVAIL LITAVANFTV NRYFSDLAAK QAKQAAQEFA PTLATCYQII 
GSWDFNGQRC MQLGPRPFMP PQFRHVVVVD TAGEIVFDSR GRGQINKPTN TITQRDIERG
ESINAEDGTV IGTVIVRPNQ GQFGADEDYF LSMVRRNIWL AGAITALLAL AIGIGLARTL
AAPLRSLTAA VHQLAQGERS VQVDDSGNDE IAELSQAFNT MSSELHRSEQ VRRQMVADIA
HELRTPLSVL QIELESIEDG VSKPTPAVIS SLGEEVQQLN HLIEDLRTLS LADAGQLTLN
PIELEPQDVV NRAVNRMQLA AREKQLELAN DSAEQIDLVH ADPSRLQQVL VNLLQNAVRY
TPQGGKIRVT ARQSAGEVIL GVHDTGAGFD PTEAATIFER FYRTDKARAR DTGGTGLGLA
IVKGLVTAMG GRVWATSVPN QGSSFYVALR AISTKEGV