Gene Haur_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3020 
Symbol 
ID5734877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3814273 
End bp3816168 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content50% 
IMG OID641280164 
Producthistidine kinase 
Protein accessionYP_001545786 
Protein GI159899539 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000249854 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTATC TTTTCGATCA TCCAACATGT ATTCGGCGTA CATCGCCTCA ACATTTAGCA 
AGTTATGCCT ATGGCGTGCG GGCAAGCTGC TCGGTGCGCC ATCCGGCTGG CGATGGAGTC
GATGACCCAC GCTTTTTATT ATTTGGCCCC CAACGTGGCA ATTATCGCCA TTGGTTCGAT
ACCTATGAAA TTAAGCCCGA TAATGCGCCC GATTGGTATG CCCTGACCTT TCCCAAGCCT
ACCAAATTAA ATGTGGTTTA TTGGATGCAT GGCCCCATGT TTGATGATCA TGGTTGGTGG
ACTTCGCTCC AGGTCGAATA TCGCGATGCT GAGGGCGCTT GGCATAAAGT TGACGATTTG
CAGATCACGC CGAATTACCA TTTTAGCCCG CAACGCGGCG ATCGTAAGCC CTTCAGTGCT
TATGCTGCTC ATTTCAATCC AGTGCGAACC AGCGCGATTC GACTGATCGG CCAGCCCAGC
GGCAAGCCGC AAATTACGAC CATGGCCTAT CTTGCCGCCG ATTGGAGCAC TTGCGAAGGC
ATCGCCGCCC ACTTGCGCCA TATTCAGCAA CCATTGCCCG CTATTTTTGA TCTGTTACCA
GCCAATACCT TGTGGGATAC CTTGGCCAGC CTGCGTGATT TAACCAAAAT TGCCTTTGAT
CTACAAACCA GCGCGGGCTT GGGACTTGAC CACTTCTTGG AGCCACAGCA TTATCAACGG
TTTCACGAAA GCCAGCGTCA ATGCTTCGCT GATGATTCGC TCTATCAATT AATTGGTCAG
CATGCTGGTT GGCGCAAGCT GGGCCAAACC ATCGAGCAAG CTCGCGAACA AGCCTATGAA
ACCAAACAAC CAGTCATCGC CCAACATCAT GGTGGTTTGG TCTGGCTGGT CGTGCCAGTC
ATTAGCAATG ATCACGTTTT AGGCACAATC GAAAATCGCA ATTTTATCGC CCAAGAGCCA
ATCGATTGGC AATGGCATCG TTGTTATGCC CAAGAATTGG GGCTGGATTG GGAGCGTTAT
CGCACGGCGT TGGAGCAAAT TAGCACATTC AGTCAACAAC AAATCAATGC CACAATCAAT
CATGTGCAGC AGCTGGTACG CCTAGCTCAA CAATTGCTCA AAGATAGCCA AGAAATTCAT
ACGCTGAAAA GTAGCGCCTT GGCTGCCGAG GTGTTTAGTC GCTCCAAAAG CACCTTTATG
AGCATGATGA GCCATGAATT ACGCACCCCG CTGAATGCCA TTATTGGCTA TAGCGAGCTG
CTCATCGACG ATCTGAGCGC AACCAACCAA CAACAATCGA CTGATGATGC CCGCCAAATT
CGCAGCGCAG GTCGCCATTT ACTGCATGTG ATCGAAACAA TTTTGCAGGT ATCCAACTTA
GAATCAGGCG CATCAAATGT ACATTACGAC GATGTTGATC TTGAAATGCT GACCAATTCG
TTGGAACTTA GTTTGCGCAC CCAATTTCAA AAACGCAGCA ATCAACTTGA AATTAAGATC
GATCCTAATG CCTCGTGGGT CTATTCAGAT AGCTCGAAAG TGCGTCAGAT TATCTTCCAA
CTGCTGAATA ATGCCAATAA ATTTACCGAT CATGGTCAAG TGCGGCTAAC GATTGCCCCT
GATAGCCTTG ACCCAGAGAT GCTCGCATTC GAGGTTTGCG ACACAGGTAT TGGCATCGAT
CATGAGCATT TGCCGCTCTT ATTTGCTGAA TTCAGCCAAC TTGATGCCTC AAGCACTCGC
CGCTATGATG GCACAGGTAT GGGCTTGGCG CTTTGTCGCC ACTTGGCGCG GCTGCTTGGC
GGCGATATTC GGGTTTGGAG CGAGCCTGGC ATTGGCTCGA CCTTTACCCT CGCCATCCCA
CGTCAAAGCG TGATGACTCC AGTTAACGAC CTCTGA
 
Protein sequence
MHYLFDHPTC IRRTSPQHLA SYAYGVRASC SVRHPAGDGV DDPRFLLFGP QRGNYRHWFD 
TYEIKPDNAP DWYALTFPKP TKLNVVYWMH GPMFDDHGWW TSLQVEYRDA EGAWHKVDDL
QITPNYHFSP QRGDRKPFSA YAAHFNPVRT SAIRLIGQPS GKPQITTMAY LAADWSTCEG
IAAHLRHIQQ PLPAIFDLLP ANTLWDTLAS LRDLTKIAFD LQTSAGLGLD HFLEPQHYQR
FHESQRQCFA DDSLYQLIGQ HAGWRKLGQT IEQAREQAYE TKQPVIAQHH GGLVWLVVPV
ISNDHVLGTI ENRNFIAQEP IDWQWHRCYA QELGLDWERY RTALEQISTF SQQQINATIN
HVQQLVRLAQ QLLKDSQEIH TLKSSALAAE VFSRSKSTFM SMMSHELRTP LNAIIGYSEL
LIDDLSATNQ QQSTDDARQI RSAGRHLLHV IETILQVSNL ESGASNVHYD DVDLEMLTNS
LELSLRTQFQ KRSNQLEIKI DPNASWVYSD SSKVRQIIFQ LLNNANKFTD HGQVRLTIAP
DSLDPEMLAF EVCDTGIGID HEHLPLLFAE FSQLDASSTR RYDGTGMGLA LCRHLARLLG
GDIRVWSEPG IGSTFTLAIP RQSVMTPVND L