Gene Anae109_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1771 
Symbol 
ID5374128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1994247 
End bp1997018 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content77% 
IMG OID640843279 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001378958 
Protein GI153004633 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.357218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCC CCGTGCTCCC CGGCGCCGGA GAGGACGCGC CGGCCACCCC GCCCGGCGTC 
CTCCTCGCCC GCGCCGCCGC CCTGTTCCTC GGCTACGTGC TCCTCGGCGC CGCCGGAGAC
GCGCTCTCCG CCTACCAGGG GGCCTTCGCC ACCTACTGGC CGCCGGTCGG GCTGTATGCC
GGCGCGCTGC TGCTCGCGCG CGGGCGCGAG CGGTGGGCAT TCGTCGCCGC GGCGGTCGCC
GCGGAGCTCG TCGCGAGCCA GGCCACCGAC CGCAGCCTGA TGCTCACCGG GGTGAACGCG
GCGGTGGACC TCCTCGACGC GGTCCTGGCG GCGGCGCTCG TGTCGCGCCT CTCCGGCGGG
CGGCGCGCCA CCGGGAGCGT CCGCGGGGTC GCCGCGCTGG TGCTGCTCGG CGCCCTCGTC
GCGCCCCTCG TGCCGTCCGC GCTGGGCGCG GCCGCCGCGA TGGTGCACTA CGGCTGGAGC
GGGGCGGAGC GGTTCCTCCA GGTCTGGAGA GCGTGGTGGA CCGGAGACGC GCTGGGGATC
CTCGTCTTCG CGCCGCTGGT CGTGGCGTGG GTTCCGGGGC GAGGTCCGGG CCGCCCGCTG
CGCCTCGCGC GCGCCGGCGA GGCGGCCCTC GTGGCCGGGG CGCTGGCCGG CTCGGTCGTG
CTCGTGTTCG CAGGCGGGGT CCGCCTCGAG CGGAGCTACC TGCTGCTGCC GCCGCTCGTC
TGGGCGGGCT CGAGGTTCGG GGTTCGCGGC GCCTCGGCGG GAGCGGCGCT CGTCTCGCTG
CTCGCCGCCG GGCTCACCTC CGGGGCCCGC TACTTCGACC CGAGCGCCAC GATCGCGGCG
GCGGACGTGC AGGTGCAGCT GCTCGTCTTC ATCGGCGCCG CGACCGCGCT CGTGCTCGGC
GCGGCGCTCG CCGAGCGGGA GCGCACGCTG TCGGCGCTCG CGGAGAGCGA GGCGCGGTTC
GAGGCCTTCA TGCGCCACGC GCCCGCGGTC ATCTTCATCA AGGACGCCGA GGGGCGGATC
GTCGCCGGCA ACCCGAGCTT CGCGAGCGCG CACGGCGCCC AGGTCGGTGA CCTGATCGGC
AGGACCGCCG GGACGATGTT CGACCCCGGG CTCGCGGGGC GGATCGCGGC GACGGAGCGA
GGCGTCCTCT CGACGGGGGA GACCGCGCGG GAGGAGCTGT ACCTCTCCGG CCGCGCGTTC
GTCACCCTCA AGTTCCGCAT CCCTCGCGAC GAGCAGCCGC CGCTCCTCGG CGGCGTGGCG
CTGGACGTCA CGGACCTCCG CCGGGCCGAG CGCGCGCTGC GGCTCGCGCA GACGGCGCTC
GAGCGCGGCT ACGCGCCGGT CCTCATCCTG GATCCGGGCG GCCGCATCAC CTACGCGAAC
GACGCGGCGC AGCGGCTCTT CGGGAGGAGC GGGCCCGAGC TCGCCGGGCG CGCCGTGTGG
GACGTGGACG GTGGCTTCGC GGAGTCGGGC TGGCCCGCGC AGTGGGAGGA GATCCGCGCG
CGGGGGGCGG CCGTGCTGGA CGGGAGCGTG CGCCGTCCGG ACGGGCGCGC CAGCGCCGAG
GTGGCGGTGT CGCACCTCGC GTTCGACGGC GCCGAGTACG CCATCTACAC GGCCCGAGAT
CTCACGGATC GGCGCCGCGC GGAGGCGGCC GAGCGGCTCG CCGCCGTGGG TACCCTGGCG
GCGGGGATGG CGCACGAGAT CAACAACCCG CTCACCTTCG TCTCCGTGAA CCTCGGCTGC
GCCCGCGAGG CGCTCGCCGC GCGCGCGGGG GGGCCGGAGA TGGCCGAGGC CCTCCAGGCG
CTCGACGAGG CCGCGGAGGG CACCCGCCGC ATCGCCCGCA TCATCCGGGA CCTCGGCATC
GTGTCCCGCT CGCGGCGGGA CGGGCGCCTG GCCGTGAACG TCCACGAGGA GATCGGGGGC
GCGGCGAAGC TCGCCGAGCA CGAGATCCGG CACCGCGCGC GGCTCGTGCT GAGGCTCGAT
CCGGTCCCGC CCGTCCTGGC GTCCGAGTTC CAGCTCGGGC AGGTGGTGCT GAACCTGCTC
GTCAACGCCG CGCACGCGAT CCCCGAGGGC GCCGCCGCCA CGAACGAGAT CCGGGTGGTC
TCGCGCACCG GACCGGACGG GCAAGCGATC GTGGAGGTGT CGGACACCGG CGCGGGCATC
CCACCCGAGC TGGGGCGGCG CGTGTTCGAG CCGTTCTTCA CGACCAAGCC GCCGGGGCAG
GGCACCGGGC TCGGGCTCTC GGTCTGCCAC GGGATCGTGA GCGGGCTGGG AGGGCGGATC
GAGCTGGAGA GCGAGCCCGG GAGAGGGGCG CTGTTCCGGG TGGTCCTCCC CCCGGCGCCG
GACGAGGAGC ACCCGGCGCC GGCCCCGCCC CGCGCCGCGC GCGCCGCGCG CGCCCGCATC
CTGCTCGTGG ACGACGAGCC GCTCATCGGC AGCACCGTGC GCCGCGTCCT CGCCGAGCAC
GAGGTGGAGG TCCTCACCGA CGCGAGGGCG GCGCTGGCCC GGCTCGAGGC GGGCGAGCGC
TTCGACGTGA TCCTGTGCGA TCTCATGATG CCCGACCTCA CGGGGATGGA CCTGCACGAG
GCGCTCACCC GCGCCGCACC GGCGGTGGCC CGGCGGATGG TGTTCATCAC GGGCGGCGCC
TTCACCGAGC GGGCGCGCCG CTTCCTCGAG CGCAGCGACC TGCCGCGGAT CGAGAAGCCG
TTCGCGCCCG CCACGCTGCG CGAGGCCGTG GACGCGCTGC TCGCCCTGGG GTCCGAGCGC
CGCGCGGGTT GA
 
Protein sequence
MMRPVLPGAG EDAPATPPGV LLARAAALFL GYVLLGAAGD ALSAYQGAFA TYWPPVGLYA 
GALLLARGRE RWAFVAAAVA AELVASQATD RSLMLTGVNA AVDLLDAVLA AALVSRLSGG
RRATGSVRGV AALVLLGALV APLVPSALGA AAAMVHYGWS GAERFLQVWR AWWTGDALGI
LVFAPLVVAW VPGRGPGRPL RLARAGEAAL VAGALAGSVV LVFAGGVRLE RSYLLLPPLV
WAGSRFGVRG ASAGAALVSL LAAGLTSGAR YFDPSATIAA ADVQVQLLVF IGAATALVLG
AALAERERTL SALAESEARF EAFMRHAPAV IFIKDAEGRI VAGNPSFASA HGAQVGDLIG
RTAGTMFDPG LAGRIAATER GVLSTGETAR EELYLSGRAF VTLKFRIPRD EQPPLLGGVA
LDVTDLRRAE RALRLAQTAL ERGYAPVLIL DPGGRITYAN DAAQRLFGRS GPELAGRAVW
DVDGGFAESG WPAQWEEIRA RGAAVLDGSV RRPDGRASAE VAVSHLAFDG AEYAIYTARD
LTDRRRAEAA ERLAAVGTLA AGMAHEINNP LTFVSVNLGC AREALAARAG GPEMAEALQA
LDEAAEGTRR IARIIRDLGI VSRSRRDGRL AVNVHEEIGG AAKLAEHEIR HRARLVLRLD
PVPPVLASEF QLGQVVLNLL VNAAHAIPEG AAATNEIRVV SRTGPDGQAI VEVSDTGAGI
PPELGRRVFE PFFTTKPPGQ GTGLGLSVCH GIVSGLGGRI ELESEPGRGA LFRVVLPPAP
DEEHPAPAPP RAARAARARI LLVDDEPLIG STVRRVLAEH EVEVLTDARA ALARLEAGER
FDVILCDLMM PDLTGMDLHE ALTRAAPAVA RRMVFITGGA FTERARRFLE RSDLPRIEKP
FAPATLREAV DALLALGSER RAG