Gene Anae109_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4054 
Symbol 
ID5375769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4741878 
End bp4743533 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content74% 
IMG OID640845581 
Producthistidine kinase 
Protein accessionYP_001381216 
Protein GI153006891 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTC CCGATCGACC CTCCGTGCGG CTCGCGCAGT TGCTGGACAT GTCGATCGTC 
CAGCGGCTCG CGGAGGCGAA CCACAGGGCG TACGGGATGC CGATCGGCAT CGTGGACGCC
TTCGACGGCT CCATCCTCGT CGGCTGCGGG TGGCAGGAGA TCTGCCTCGA CTTCCACCGC
GCCAACCCCG CGTCGCGCGA GCGCTGCCGG GAGAGCGACG ACTTCATCAA GAGCCACCTC
ACGCCGGGCG AGGCCTGCGC GTACACCTGC AAGAACGGGA TGCGCGACAT CGGCGTCCCC
ATCGTCGTCG CCGGGGAGCA CCTCGCCACG CTCTTCCTCG GCCAGTTCTT CTACGAGAAC
GAGTCGCCGG ACCGGGAGTC CTTCGTCCGG CAGGCGAAGG TCTTCGGGTA CGACGAGGCC
GCCTACCTCG CCGCGCTCGA CCGCGTCCCG ACCTTCGACC GCCGGTCGGT CGACGACATC
GTCGCCTACG ACCGCGCCCT CGTTCGGTTC ATCGGCGAGC TCGCCGAGGG CGCGCTGCGG
CAGCGTCAGG CGGAGGAGGC GCTGCGCGAG GCCGACCGGA GGAAGGACGA GTTCCTGAGC
TTGCTCTCGC ACGAGCTGCG GAACCCGCTC GCGCCCATCC GCAACTCCAT CTACGTCCTC
GAGCACGCGG AGCCCGCCGG AGAGCAGGCG CGGCGCGCGC GGGCCGTCAT CGAGCGCCAG
ACCGACCACC TCACGAAGCT GGTGGACGAC CTGCTCGACG TGACCCGCAT CGCGAAGGGG
AAGATCGCGC TGCGCCGCGC GCGGCTGGAC CTCGCGACCG TCGCGCGCCG GACCGGCGAG
GACCTGCGCT CGATCGTGGT CGCGCGGGGC CTCGAGCTCG TGCTCGAGCT CCCTTCCGCG
CCCGTGCTCG TGGACGGGGA CGAGACGCGC CTCGGGCAGG TGCTCGGCAA CCTCCTGCAG
AACGCCGCGA AGTTCACGCC GGCCGGCGGG CGGATCGTCC TGTCGGCGCG CGCCGGCGGC
GGCGTCGCGG AGATCCGCGT CCGCGACACG GGCGTGGGCA TCGCGCCCGG GATGCTGGAG
CGGGTGTTCG AGCCGTTCGT CCAGGCCGAG AGCTCGCTCG CGCGGACCGA CGGCGGGCTG
GGGCTCGGCC TCGCGCTCGT GAAGGGCCTC GTCCAGCTGC ACGGCGGGGA GGTCCGCGCG
GAGAGCGCCG GGCGAGGCGA GGGGACCGAG GTGATCGTCC GCTTCCCCCT CGCGCCCCCC
GCGCTCGCGC GCGACGCGAC CGCCGAGGGT CCGGCCCCGG CGTCGCGCGG GCGGCTCGTC
CTCGTGGTGG ACGACAACGT GGACGCCGCG GACTCGCTCG CCGAGCTCGT GGAGATGCTC
GGCCACTCGG CCATGGTCGC CTACGACGGG CCGAGCGCCA TCGCCAGGGC GGCGGCGAAC
CCGCCCGACG TCGTGCTGTG CGACCTCGGC CTGCCCGGCA TGAGCGGCTA CGAGGTGGCC
CGCGCCCTGC GCGCCGGCAG GGGCGACGAC CTGCGGCTCG TCGCGGTCAG CGGGTACGCG
CAGCCGGAGG ACGTGAAGCG CGCCGCGGAG GCCGGCTTCG ACCGCCACGT CGCGAAGCCG
ACGGATCCCG GCGTCATCGA GCGGATCCTC GGGTGA
 
Protein sequence
MTRPDRPSVR LAQLLDMSIV QRLAEANHRA YGMPIGIVDA FDGSILVGCG WQEICLDFHR 
ANPASRERCR ESDDFIKSHL TPGEACAYTC KNGMRDIGVP IVVAGEHLAT LFLGQFFYEN
ESPDRESFVR QAKVFGYDEA AYLAALDRVP TFDRRSVDDI VAYDRALVRF IGELAEGALR
QRQAEEALRE ADRRKDEFLS LLSHELRNPL APIRNSIYVL EHAEPAGEQA RRARAVIERQ
TDHLTKLVDD LLDVTRIAKG KIALRRARLD LATVARRTGE DLRSIVVARG LELVLELPSA
PVLVDGDETR LGQVLGNLLQ NAAKFTPAGG RIVLSARAGG GVAEIRVRDT GVGIAPGMLE
RVFEPFVQAE SSLARTDGGL GLGLALVKGL VQLHGGEVRA ESAGRGEGTE VIVRFPLAPP
ALARDATAEG PAPASRGRLV LVVDDNVDAA DSLAELVEML GHSAMVAYDG PSAIARAAAN
PPDVVLCDLG LPGMSGYEVA RALRAGRGDD LRLVAVSGYA QPEDVKRAAE AGFDRHVAKP
TDPGVIERIL G