Gene Anae109_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2030 
Symbol 
ID5376742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2300690 
End bp2303473 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content71% 
IMG OID640843542 
Producthistidine kinase 
Protein accessionYP_001379217 
Protein GI153004892 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.182465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCAGG GCTGGGTGAT CGTCGTCGCC TCGTTCGCCT ACCTGGGCGT GCTCTTCGCG 
ATCGCGTACT GGGGCGACAA GCGGGCGGAG GCCGGCCGCT CGATCATCGC CAACCCGTAC
ATCTACGCGC TGTCGCTCGC GGTGTACTGC ACCACCTGGA CGTTCTACGG CAGCGTGGGT
CGCGCCGCGT CGAGCGGCAT CGGCTTCCTT CCGGTGTACC TCGGGCCGAC CCTCATGGTG
CCGCTGTGGT GGTACGTGAT GCGGAAGATC ATCCGCATCA GCAAGGCCTA CCGCATCACC
TCGATCGCCG ACTTCGTCGC GTCGCGCTAC GGCAAGAGCC AGCTGCTCGG CGGGCTCGTC
ACCGTGATCG CGGTGGTCGG GGTCATCCCC TACATCTCGC TGCAGCTCAA GGCGATCTCC
GGCAGCTTCA CCATCCTGCG GCACTACCCG GACGTCGTCA TGCCGGCCAA GGCGCTGGCG
CTCCCGTTCC TGCAGGACAC GGCCTTCTAC ATCGCGCTGA TGCTGGCGGC CTTCACGATC
CTGTTCGGCA CGCGCCACCT CGACGCGACC GAGCGGCACG AGGGGCTCGT CGCGGCGATC
GCCTTCGAGT CGGTCGTGAA GCTCGCCGCC TTCCTGGCGG TCGGGGCGTT CGTGACCTTC
GCGGTCTACC GGGGCTTCGG CGACGTCCTC GGGCAGGCGG CGAAGACGCC GGAGCTGCGC
GGGCTGTTGA CCGTGCCCGC CACGAGCGGC AGCTACGTCA GCTGGACGTT CCTCACGCTG
CTCTCGATGC TCTCGATCCT CTTCCTGCCG CGGCAGTTCC AGATCACCGT CGTGGAGAAC
GTGGACGAGG GCCACCTGGG CAAGGCGATC TGGCTGTTCC CGCTGTACCT GCTGCTCATC
AACGTCTTCG TCCTGCCGAT CGCCATCGGC GGGCTCGCGC TGTTCTCGGG CGCGAGGGTG
GACGCGGACA CGTTCGTGCT CACCCTGCCG ATGTTCCGGC GCGAGGAGGC CCTCACCCTC
TTCGCCTTCA TCGGCGGCCT CTCCGCCGGC ACCGGCATGG TGATCGTCGA GACGATCGCC
CTCTCCACCA TGGTGTGCAA CGACCTGGTC ATGCCGGTGC TGCTGCGCAT GAGGTCGCTC
CGCCTCAACG AGTGGCGCGA CGTCTCCGGC CTGCTCCTCT CCATCCGCAG GCTGGCGATC
GGCGCGATCC TGCTCCTCGG CTACGCGTAC TTCCGCGTCG CCGGCGAGGC GTACGCGCTC
GTCGCGATCG GCCTCATCTC GTTCGCCGCG GTCGCGCAGT TCGCGCCGGC GATCCTCGGC
GGGATCTACT GGCGCGGCGG CACCCGCGCC GGCGCGTTCG CGGGCCTCTC GGCGGGCTTC
GCGGTCTGGG CGTACACGCT GCTCCTGCCC TCCTTCGCCA AGTCGGGCTG GCTGCCCGCG
AGCTTCCTCA GCGAGGGGCT CCTCGGGGTC GCCCTGCTGA AGCCCCAGCA GCTGTTCGGG
CTGACGGGCA TGGACGAGAT CCCGCACGCC CTGTTCTGGA GCATGCTCGC CAACGTCGGC
CTGTACGTCG CGGTGTCCGT CGCGAGCCGG CCGGGGGTCT CGGAGACGAG CCAGGCGGCC
CTGTTCGTGG ACGTGTTCGA GCGCACGCAC GCCTTCGACC GCTCGCGGCT GTGGCGGGGC
AGCGCCTCGG TGCAGGACCT GCTGCCGCTC ACGGGGCGCT TCCTCGGACC GGAGCGGGCG
CGCGAGGCCT TCCTCGCCTA CGCGCGGCGG CGCGGCGTCG GCTCCCTCGA AGCGCTGCCC
GCGGACGCGG ACCTCGTCCA CTTCGCGGAG ACCCAGCTCG CCGGCGCCAT CGGCGGCGCC
TCGGCGCGCG TCCTCGTCGC CTCGGTGGTG CAGGAGGAGC CCCTCGGCCT CGACGAGGTG
ATGGACATCC TCGACGAGGC CTCGCAGGTG CGCGCCTACA GCCGCGAGCT GGAGCAGAAG
TCGCGCGCCC TCGAGACCGC CTCCGCCGAG CTCCGCGCCG CAAACGCCCA GCTGCAGGAG
CTCGATCGGA TGAAGGACGA GTTCATGTCC AGCGTCACGC ACGAGCTCCG CACGCCCCTC
ACCTCGATCC GGGCCTTCTC GGAGATCCTC CGCGACGATC CCAAGGCCCC GATCGCCGAG
CGGGTGAGGT TCCTCGCCAT CATCGTGAAG GAGTCCGAGC GGCTGACCCG CCTCATCGAC
CAGCTGCTCG ACATGGCCAA GATCGAGTCG GGCAACGCGG AGTGGCACGC GGCGGAGCTG
GACGTGCGGG AGGCCATCCA GGACTCGGTC GAGGCGACGA GCCAGCTCTT CCGCGACAGC
GGCGTGGAGC TCGCGGTGGC GCTGCAGCCG GCGCCGCGCG TGCGCGCCGA CCGCGACCGG
CTGGTCCAGG TGATCATGAA CCTGCTCTCG AACGCGGTGA AGTTCTGCCC TCGCGGGGGC
AGGGTGGAGG TCCGGCTCGC CCCGGCGCCG GAGGGGGTCC GGGTGGACGT ACAGGACGAC
GGCCCCGGCA TCAGCCCGGC GGACCAGGAC ATCATCTTCG AGAAGTTCCG CCAGGTGAGC
GACACGCTGA CCGGGAAGCC GCGCGGCACG GGGCTGGGGT TGCCGATCAG CCGCAGGATC
GTGGAGCACT TCGGCGGGCG GCTGTGGGTC GAGAGCGAGC TGGGACGGGG CGCGACGTTC
TCGTTCGTGC TGCCGCTGGA CGCAGCCGCG CAGGCCGCCG AGGAGCCGCG GCGCGCCGCG
CAGGGGACCG GAGCCGGCCG ATGA
 
Protein sequence
MLQGWVIVVA SFAYLGVLFA IAYWGDKRAE AGRSIIANPY IYALSLAVYC TTWTFYGSVG 
RAASSGIGFL PVYLGPTLMV PLWWYVMRKI IRISKAYRIT SIADFVASRY GKSQLLGGLV
TVIAVVGVIP YISLQLKAIS GSFTILRHYP DVVMPAKALA LPFLQDTAFY IALMLAAFTI
LFGTRHLDAT ERHEGLVAAI AFESVVKLAA FLAVGAFVTF AVYRGFGDVL GQAAKTPELR
GLLTVPATSG SYVSWTFLTL LSMLSILFLP RQFQITVVEN VDEGHLGKAI WLFPLYLLLI
NVFVLPIAIG GLALFSGARV DADTFVLTLP MFRREEALTL FAFIGGLSAG TGMVIVETIA
LSTMVCNDLV MPVLLRMRSL RLNEWRDVSG LLLSIRRLAI GAILLLGYAY FRVAGEAYAL
VAIGLISFAA VAQFAPAILG GIYWRGGTRA GAFAGLSAGF AVWAYTLLLP SFAKSGWLPA
SFLSEGLLGV ALLKPQQLFG LTGMDEIPHA LFWSMLANVG LYVAVSVASR PGVSETSQAA
LFVDVFERTH AFDRSRLWRG SASVQDLLPL TGRFLGPERA REAFLAYARR RGVGSLEALP
ADADLVHFAE TQLAGAIGGA SARVLVASVV QEEPLGLDEV MDILDEASQV RAYSRELEQK
SRALETASAE LRAANAQLQE LDRMKDEFMS SVTHELRTPL TSIRAFSEIL RDDPKAPIAE
RVRFLAIIVK ESERLTRLID QLLDMAKIES GNAEWHAAEL DVREAIQDSV EATSQLFRDS
GVELAVALQP APRVRADRDR LVQVIMNLLS NAVKFCPRGG RVEVRLAPAP EGVRVDVQDD
GPGISPADQD IIFEKFRQVS DTLTGKPRGT GLGLPISRRI VEHFGGRLWV ESELGRGATF
SFVLPLDAAA QAAEEPRRAA QGTGAGR