Gene Anae109_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1735 
Symbol 
ID5374468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1954324 
End bp1956234 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content78% 
IMG OID640843243 
ProductSARP family transcriptional regulator 
Protein accessionYP_001378922 
Protein GI153004597 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.271583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG AGATCCGGCT GCTCGGAGGG CTCGAGATCG CGGCGGACGG CCGTCGTGTA 
TCGCTCCCCA CGCGCAAGAC CGAGGCGCTC CTCGCCCGCC TCGCCCTCCG GCCGGGCGAG
CCGGTCTCGC GCGCGCAGCT CGCCGGACTG CTGTGGCCGG ATCGCCCCGA CGCGCAGGGA
CGCGGGAGCC TCCGCCAGGC GCTGGCCGCC ATCCGGCGGG CGTTCGAGGA GGCCGGCGCG
CCGGGCCCGT CGGCGGGCGG CGACGCCGTG TCGCTCGACG CCCGCGGCGT CTCGGTGGAC
GTCGCGCGCC TCGGCGCGGC GATCGCGGGG ACGGGCGACC TCGCGGAGGC GATCGCGCTT
CACCGCGGCC CGCTCCTCGC CGGCTTCCCC CCGGTCGAGG ACACCTTTGA CGCGTGGGTG
GAGTCGGAGC GCGCGGTGCT CGAGCGGAGG ATGCTCACCG CGATCCGCGC GGCGCTCGCC
CGCGGCGTCG CGGGGGACGA GGTCCTCGCC CTCGCCGAGG TCGCGCTGGC GATCGATCCC
GCCTTCGAGG AAGGCTGGCG CGCGCGCATG CGCGCCCTCG CGGCCCGGGG CGATCGCGCC
GGCGCGCTCC GCGAGTACGA GCGCTGCCGC GAGACGCTGC GCCGCGAGCT GTCGGTCGCG
CCCGCGCCGG AGACCGAGAC GCTGCGCCGC GAGCTGGCGG GGGAGACCCC GCGCGCCGAG
CCGTCGCCGA CCCGCACGCC GGGGCTGGCG GTGCTCCCGT TCGAGCTCCT GTCCGCCGAC
CCCGCCCACG AGGCGTTCGC GCGTGGCCTC CAGGAGGACG TGATCGGCGC CCTCTCCCGG
TTTCGAACCC TGCGGGTGGT GGCGCCTGCC GCCGGCCGCC GGGAAGCCCC GGCGGGCGTG
GACTACCTCC TCGTCGCGAC GGTGCGCGCC GCCTGCGGCA GGCTGCGCGT CGCGGCGCGG
CTCGCCGTGG CGCAGGGCGA CGCGCAGCTC TGGTCGGAGC GCTTCGAGGG AGACCTCGCG
GACGCGTTCG CGCTGCAGGA CCGCGTCGCC GAGGCCGTGG CCGGCGCCCT GGCGCTCCGC
ATCGACGAGG CCGAGCTGCG GGCCGCGGCG CGCCTCCCGC CCGAGTCGCT GGAGGCGCAC
GCCCTCTGGC TGCGCGGCAT GCAGTGCCTG AAGCGCGGGT CCCCGGAGAG CGATCTCGAG
GCGCGCCGGC TGTTCGAGCA GGCGCTCGCG CGCGATCGCG ACTACGCGCG CGCGTGGGCA
GGGCTGTCGC TCTCGCACTT CAACGACTGG AGCTGCGCCG CGTGGGATCG CTGGGACGAG
ACCGAGCGGA GGGCGTTCGA GTACGCGGCG GAGGCCATCC GCCGCGATCC CCACGACCCC
GTCACCCAGT GCATCCTCGG ACGGATCCTC CTGTACCGCA GGGAGTTCGA ACGCGCCGCC
GAGCACCTCT CGCGCGCGCA CGCCCTCAAC CCGAACGAGC CGGACGTGCT CGCGCACCTC
GCCGTCGGGT ACGCCTACCT CGGCGAGCCC GAGCGCGGGC TCGCGCTGGG CGAGGCGGCG
CGGCGGCTCA ACCCGTTCCA CGCGGACTGG TACCTGCCGT GCGTCGCCGC GAACCACCTC
GTCGCCCGCC GCCCGGGCGA GGCGCTCGAC CTCCTCGCGC GCGCGCCGGA CGGCCACGTG
GACACGCGCG CCTTCCTCGC GGTCGCCCGC GCGCACCTCG GGGACGACGC AGGCGCCCGC
GACGACGCGC GCCGGTTCGT CGAGCGGTTC CGGGCGGGGA TCGTGCGGGG GCGGCCGTTC
GCCGCCGACG AGCCGGTCCG CTGGGTGCTG CACGTCAACC CGCTGCGGCG TCCCGAGGAC
CGGGAGTGGG TCGTCGCGGG GCTCGCGCGC GCGGGGCTCG CGTGCCCTTG A
 
Protein sequence
MAIEIRLLGG LEIAADGRRV SLPTRKTEAL LARLALRPGE PVSRAQLAGL LWPDRPDAQG 
RGSLRQALAA IRRAFEEAGA PGPSAGGDAV SLDARGVSVD VARLGAAIAG TGDLAEAIAL
HRGPLLAGFP PVEDTFDAWV ESERAVLERR MLTAIRAALA RGVAGDEVLA LAEVALAIDP
AFEEGWRARM RALAARGDRA GALREYERCR ETLRRELSVA PAPETETLRR ELAGETPRAE
PSPTRTPGLA VLPFELLSAD PAHEAFARGL QEDVIGALSR FRTLRVVAPA AGRREAPAGV
DYLLVATVRA ACGRLRVAAR LAVAQGDAQL WSERFEGDLA DAFALQDRVA EAVAGALALR
IDEAELRAAA RLPPESLEAH ALWLRGMQCL KRGSPESDLE ARRLFEQALA RDRDYARAWA
GLSLSHFNDW SCAAWDRWDE TERRAFEYAA EAIRRDPHDP VTQCILGRIL LYRREFERAA
EHLSRAHALN PNEPDVLAHL AVGYAYLGEP ERGLALGEAA RRLNPFHADW YLPCVAANHL
VARRPGEALD LLARAPDGHV DTRAFLAVAR AHLGDDAGAR DDARRFVERF RAGIVRGRPF
AADEPVRWVL HVNPLRRPED REWVVAGLAR AGLACP