Gene Anae109_4258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4258 
Symbol 
ID5375533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4992836 
End bp4994356 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content76% 
IMG OID640845786 
Producthistidine kinase 
Protein accessionYP_001381420 
Protein GI153007095 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGTCG ACCCACCGCC GGGCGTGCGA GCGTGCGCGA CCCGCCGGCG GGCACCGGGC 
GGCCCCGCAG GTAAGATGCG ACACGGATAC GGATCGGGAA CGGCGCGCCG GCTCGCGCTC
GGCTTCGGGG CGCTCGTGCT CCTGTTCGCG GCGGCGTCGG CGGTCGCCAT CGCGGGCTCC
GTCCGCATCC ACCGCGGGCT CGCCGAGATG AAGCCCCGCG AGGAAGGCGT GCGCCTCTCG
CTCGAGCTCG CGAGCGCCGT CCGCGACCAG TACGCCCACC AGGCGCACAC CATCATCATC
GGCGACGCCT CGCACCTCGG GTTCTATGAC GGGGCGCGCG AGACGGTGCT GCGGCTCACG
CGGGCCCTCC GCCAGGCGGC GGAGGAGCCG GAGGAGCGCG CCCTCGTGGA TCGGATCGAG
GCGGAGAGCA GGACGCTGGA CGCCATCTTC CGGGAGCGGA TCGTCCCGGC GGTCCTCGCA
GGCGAGCAGG GGAGCGTCAA GCTCGAGCAC GATCGGGCGC AGCTGGTCGT GACCCGGATC
CAGGACCTCA CCCAGGCGCT GGTCGAGCGG TTCGAGTCGA AGATCCGGGC GTTCCGCCGC
GACGCGGAAG GGGTCCAGCG CCGCACGCTG GCCTTCCTCG TCGGGCTGCT CGTGGCGGCG
CCCGTGGTGG CGGTGATCGT CTCGATCGTG ATCGGCCGTT CCATCGCGGC GCCCGTGGCC
CAGCTCCAGG CCGGCGCGGC GCGCATCGCG GCGGGGGAGC TCGACGCCCG GATCGAGGTG
CACGGCGCGC CGGAGCTGGA GGCGCTCGCC CGCCAGTGGA ACGCGACGAC GGCCGCGCTG
CGCGACCACC AGGAGCGGCT CGTCGAGACG GAGAAGCTCG CCGGGATCGG CCGCCTCGCC
GCGGGAGTGG CGCACGAGAT CAACAACCCG CTCGGCGTCA TCCTCGGATA CGCCAAGCTG
CTCCGGAAGA AGGCCGAGCC CGCTGCGGCC GAGGACCTCG CCGTCATCGA GGAGGAGACC
CTGCGCGCCA AGGAGATCGT GGAGGGCCTG CTCGACCTCT CGCGGCCGCT GCCGGCCGCG
GCGCAGGCGG TGGACCTGCG CGCCCTCGCG GACGACGTGG TGGCGCGGCT GCGCGAGGCG
CGCCTCCTCG ATGGCGTGGC GGTCCACGTC GACGGCGGCG CCACGGCGCC CGGCCACCCG
GACAAGCTGC GCCAGGTGCT CGTGAACCTG GTACGGAACG CCGCCGAGGC GGCCGGGCCG
GGCGGGCGCG TGGCGGTGCG GGTCGGGGCG CTGGATGGGA CGGCCGAGGT CGCGGTGGAG
GACTCCGGGC CCGGGATCGA CGCGGCGACG CGGGGGCGAC TGTTCGAGCC GTTCTTCACC
ACCAAGCCGC GCGGCACCGG GCTCGGCCTG GCGGTCTCGC GCGCCATCGC GCGGGCGCAC
GGGGGCGACC TCGCCGCGGA TCCCGCGGAG CACGGCGGCG CGCGCTTCGC GCTCAGGCTG
CCCGCGCGGG GGGAGGCGTA G
 
Protein sequence
MSVDPPPGVR ACATRRRAPG GPAGKMRHGY GSGTARRLAL GFGALVLLFA AASAVAIAGS 
VRIHRGLAEM KPREEGVRLS LELASAVRDQ YAHQAHTIII GDASHLGFYD GARETVLRLT
RALRQAAEEP EERALVDRIE AESRTLDAIF RERIVPAVLA GEQGSVKLEH DRAQLVVTRI
QDLTQALVER FESKIRAFRR DAEGVQRRTL AFLVGLLVAA PVVAVIVSIV IGRSIAAPVA
QLQAGAARIA AGELDARIEV HGAPELEALA RQWNATTAAL RDHQERLVET EKLAGIGRLA
AGVAHEINNP LGVILGYAKL LRKKAEPAAA EDLAVIEEET LRAKEIVEGL LDLSRPLPAA
AQAVDLRALA DDVVARLREA RLLDGVAVHV DGGATAPGHP DKLRQVLVNL VRNAAEAAGP
GGRVAVRVGA LDGTAEVAVE DSGPGIDAAT RGRLFEPFFT TKPRGTGLGL AVSRAIARAH
GGDLAADPAE HGGARFALRL PARGEA