Gene Anae109_4372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4372 
Symbol 
ID5375240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5114304 
End bp5116229 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content74% 
IMG OID640845900 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001381534 
Protein GI153007209 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.755089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0153537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TCCTCATCGT CGATGACGAG CCCGTCATCC TCGACGTCTT CCGCCGCTTC 
CTCGAAGGCG ACGGGCGCAG GCTCCTGCTC GCCGGCTCCG TCCGCGAGGC GCTCGCCATC
GCCGCCGAGG CCCGCGAGAT CGACGTCGCC ATCATCGACA AGAACCTGGG CGACGGCTCG
GGCCTGGACG TCGCGCGCGG GCTCAAGGCC GTGAAGCCAG ACGCGGAGGT GATCCTCGTC
ACCGGCTACG CCTCGATCGA CTCCGCCATC GCCGCGGTCC AGATCGGCGC CTACGACTAC
GTGACGAAGC CGGTCTCGGA CTACGACGCG CTGAACCTCA AGGTGCAGAA CGCGATCGAC
AAGGGCCGCA TGCGGCGCGA GCAGCGCGAT CTCGTCGCGC GCCTCATGGA GAGCGAGTCC
CTGCACCGCG GCGTCTTCGA GACCTCGTCC GACCCGATCC TGCTGGTGGA CGTGGACTCC
GGCCGCATCG GCGACGCGAA CCCCGCCGCG GAGCGGCTCT ACGGCGAGCC GCGCGAGCGG
CTGCGGGAGC GGTGCTACGG CGAGCTGCAG CCGCCCGGGA AGGACGGCGA GGTGGCCACC
CTCCCCGCGC CCGGCGCCCG CTCGCTCCCG GCGCGCCATC GCCGGCCGGA CGGCTCGGAG
CTGCCGGTGG AGCTCACCGC CGGCGAGCTG CGGCTCCAGG ATCGCTCGCT GCGCGTGCTC
TCCATCCGCG ACGTCTCCGA GCGCGAGCGC GCCGAGGAGG CGCGGCGCGC GCTCGAGCAG
AACCTCCGCC AGGCGCAGAA GATGGAGGCG GTCGGGCGGC TCGCGGGCGG CGTCGCGCAC
GACTTCTCCA ACGTCCTCGC GGTGATCCTG GGCTACTCCG AGCTGCTCAT GCGCGATCTG
CCGGCGGGCG ACGCGCGGTC GCGCGAGAGC GCGGAGGGCA TCGTCGAGGC GGCGCACCGG
GCGGCGGGGG TGACGCGCCA GCTCCTCACC CTCTCGCGCA AGAAGCTGCT GCGCCCGGAG
GTCCTCTCGC TCAACAAGGT GGTGCAGGAC CTCGGCAAGC TCCTCGCCCG CGCCATCGGC
GAGCGCATCG AGCTCACGAC GCGGCTGCAG GACGGCCTCT GGCCGGTGCT CGCCGACGCG
GACCAGCTCG CGCAGGTGCT GCTCAACCTG GCGGTGAACG CGCGCGACGC CATGCCCGAC
GGAGGCCCGC TCGCGATCGA GACCGCCAAC GTCGAGCTCT CGGAGCCGCC GCGCGACCTC
CCCATCCCCG CGGGCCGCTA CGTGTCGCTG GCCGTGAGGG ACGGCGGCTG CGGCATGACG
GAGGAGGTCC GCTCCCGCAT CTTCGACCCC TTCTTCACCA CCAAGGAGAC CGGCACCGGC
CTCGGGCTGG CCACCGTCTA CGGCATCGTG CGCCAGGCGG GCGGGGCCAT CCGGGTCGAC
TCGGAGCCGG GCCAGGGCGC CACGTTCACG GCGTTCCTCC CGGTGGGCGC CGAGCAGGGC
GGCCGGGCGC ACGTCCCCGT CGCGGCGGCC GCGCCGCGCG GCCTCGGCGA GGTGGTCGTG
CTCGCGGAGG ACGAGGACGC GCTGCGGGTC CTGCTCGGCC GCGTGCTGGC CGGGAGCGGC
TACGAGGTCG TCGCCGGGCG CAACGGGGCG GAGGCCCTGC AGGCGGCGCG CGAGCGGGGA
GGCCGGGTGG ATCTCCTCCT CGCCGACCTC GTCATGCCGC GCATGACCGG CGCGGAGCTG
GCGCACGCGC TCCAGGGGGA GCAGCCGGCC ATGAAGGTGC TCTTCATGAC CGGCCACACC
GAGGACGCCC TCGTCGAGGA CCGGCTGCGC GACGGCGACG TCGAGCTCAT CCAGAAGCCG
TTCACGAGCG AGATCCTGCT CGGCCACGTC CGGCGGCTGC TCGGGCCGGC GAAGGCGAGC
GCGTGA
 
Protein sequence
MATILIVDDE PVILDVFRRF LEGDGRRLLL AGSVREALAI AAEAREIDVA IIDKNLGDGS 
GLDVARGLKA VKPDAEVILV TGYASIDSAI AAVQIGAYDY VTKPVSDYDA LNLKVQNAID
KGRMRREQRD LVARLMESES LHRGVFETSS DPILLVDVDS GRIGDANPAA ERLYGEPRER
LRERCYGELQ PPGKDGEVAT LPAPGARSLP ARHRRPDGSE LPVELTAGEL RLQDRSLRVL
SIRDVSERER AEEARRALEQ NLRQAQKMEA VGRLAGGVAH DFSNVLAVIL GYSELLMRDL
PAGDARSRES AEGIVEAAHR AAGVTRQLLT LSRKKLLRPE VLSLNKVVQD LGKLLARAIG
ERIELTTRLQ DGLWPVLADA DQLAQVLLNL AVNARDAMPD GGPLAIETAN VELSEPPRDL
PIPAGRYVSL AVRDGGCGMT EEVRSRIFDP FFTTKETGTG LGLATVYGIV RQAGGAIRVD
SEPGQGATFT AFLPVGAEQG GRAHVPVAAA APRGLGEVVV LAEDEDALRV LLGRVLAGSG
YEVVAGRNGA EALQAARERG GRVDLLLADL VMPRMTGAEL AHALQGEQPA MKVLFMTGHT
EDALVEDRLR DGDVELIQKP FTSEILLGHV RRLLGPAKAS A