Gene Anae109_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3068 
Symbol 
ID5375830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3582314 
End bp3583771 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content73% 
IMG OID640844592 
Producthistidine kinase 
Protein accessionYP_001380248 
Protein GI153005923 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.158411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.556433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCGT CCCAGCCGGC GAGCCTCGAC GTCGCGGCAG CTTGGGAGGC GGCGTCCCTC 
CAGCGGGAGC GCTGGGAGGG GACCACCGCC GTCGTGCTCA CCGGCGTGCT GGTCCCGCTC
TGGATCGCGT TCGACGCGTA CCTCGAGCCG GCGCTCCTGA ACCCCTTCAC GGCCATGCGG
CTCGCCGCGG CGGCCGTGGC GTGGGCCGTG GTGCTCGTCG TCCGCCGGGT CGGGACCACC
CGCGCGCTGC GGACGTGGGT CGCGGCCGAG CTGGCGTACA GCGGCGCGAC CATCGCGCTC
ATGCTCCCGC ACGTCCGGCA CTTCCCGGCC TACGTGTTCG GGTTCTCGCT CTACTTCTGG
GGCGTCGGGG CGCTGTTCTC CTGGCCGACG CGGTGGGCGG CGGGGCTCTT CTCGTGGCTC
GTCGCCGTCA TGTGCGCGGG GTTCGCGCTC TGGCCCGGGA TGCGCGAGGC CGCCGACTAC
GTGGCGGTGG GGTTCTACAT CGGCAGCGCG GGGATGATCG CGACCATCAT GGTCTGGGTG
CGCGGCCGGC TCGTGCGCGA CGCCTTCGAC GCGTCGCACG CGCTGGCCAA GCGCAACGCC
GACGTGGAGC GCACGCTGGA GCAGCTCCGC GACGCGCAGT CACGGCTGGT CGCCTCGGAG
AAGCTGTCCG CGCTGGGGCG GCTCCTCGCC GGCCTGTCGC ACGAGATCAA CAACCCGCTC
AACGTCCTGC ACAACAACCT CGAGCCGCTG CGCGCCTCGA TGGACGGCCT CCTCGGAGTG
GCGCGGGCCG CGGAGGAGGC GACCCCCGCG GACGTGGACG CGCTGCGACG GCGGTGCGCG
GAGCTGGACG TCGCGACCAC CGCCGCCGAC GTCCGCGACG CGACGGACAT GATGCGCGCG
GCGATGGAGC GGGTCCGGCA GGTGCACGCG GACCTGCGCT CCTTCATCCG CGGCGACGCT
CCGGACATGG TGCTCGACGA TCCGAGCCAG GGGCTCCGCG CGACCGCCAC GCTCCTCTCG
CGCCGGCTGC CCGAGGGCGT GCACGTCACG GTCGAGGTCG GTCCGCTCCC GCGCATCACC
TACCAGCCGG GACAGCTCAA CCAGGTCTGG CACAACCTCA TCCAGAACGC CCTGGACGCC
GTCGGCGCCG ACGGGACGGT CGCGGTGGCG GCGCGAACGG CGGGCGATCG GATCGAGGTG
ACCGTGACGG ACAGCGGGCC GGGCGTGGCC GCCGAGCATC AGGCGCGCCT GTTCGAGCCC
TTCTTCACGA CCAAGGGCGT GGGCAAGGGC ACCGGCCTGG GGCTCGCGAC GAGCTACCAG
ATCGTCGAGC GCCACGGCGG CACGATGTTC CTCGACGGCG TCCATACCGG AGGAGCGCGC
TTCGTGGTCT GGCTCCCGGT GCATGCGGTG AGGCTCGGCC CCCCACTCGG CCTCGACGCG
CATCGCCCCA CGGGCTAG
 
Protein sequence
MSASQPASLD VAAAWEAASL QRERWEGTTA VVLTGVLVPL WIAFDAYLEP ALLNPFTAMR 
LAAAAVAWAV VLVVRRVGTT RALRTWVAAE LAYSGATIAL MLPHVRHFPA YVFGFSLYFW
GVGALFSWPT RWAAGLFSWL VAVMCAGFAL WPGMREAADY VAVGFYIGSA GMIATIMVWV
RGRLVRDAFD ASHALAKRNA DVERTLEQLR DAQSRLVASE KLSALGRLLA GLSHEINNPL
NVLHNNLEPL RASMDGLLGV ARAAEEATPA DVDALRRRCA ELDVATTAAD VRDATDMMRA
AMERVRQVHA DLRSFIRGDA PDMVLDDPSQ GLRATATLLS RRLPEGVHVT VEVGPLPRIT
YQPGQLNQVW HNLIQNALDA VGADGTVAVA ARTAGDRIEV TVTDSGPGVA AEHQARLFEP
FFTTKGVGKG TGLGLATSYQ IVERHGGTMF LDGVHTGGAR FVVWLPVHAV RLGPPLGLDA
HRPTG