Gene Anae109_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4242 
Symbol 
ID5378351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4971872 
End bp4973854 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content74% 
IMG OID640845770 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_001381404 
Protein GI153007079 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCT CCGAGGTCAC CGCAGAGCGA CGCATCGACG AGGAGCTCGC CCTCCGCGCC 
GTCGTGGAGG GGACCGCCTC TGAGACCGGG CAGGAGTTCT ACCGGGCGCT CGTCAGGAAC
CTGGCCCAGG CGCTCGACAC CTACGGCGCC TGGCTCACCG AGTACGACGA GCAGCGGGAT
CGGCTCCGCG CCCTCGCGTT CTGGTTCGGG GGGCAGTGGA TGGAGGGCTT CGAGTACGGC
ATCGCCGGGA CCCCCTGCGA GACCGCGGTG CGAGAGCGGC GCGTCGTCCA CGTCCCCGAT
CGCGTCATCG AGCTCTACCC CAGCGACGAG GTGAGGTTCC CTCGCTCCGG CGTGGTCAGC
TACCTCGGCG TCCCCCTGCT CGGCGCGGAT CAGCGCGTGC TCGGGCACCT GGCGGTGGTG
GACATCCGCC CCATCCCTGC CGAGAAGCGG CTCCTCACGC TGTTCGGGAT CTTCGCGAAC
CGCGCGGTCG CCGAGGTCCG CCGCGTCCGC GTGGAGGCGG AGCTGCGCGA GCGCCAGGAG
AAGCTCTCCG GCCTCATCGG CAGCGCCATG GACGCGATCG TCGAGTTCGA CGGCGACTTC
CGCATCACCC TCCTGAACGC CGCCGCCGAG AAGACGTTCG GCTGCATCGC CGTGGAGCGC
CGGGGCCGCC CGGCGGAGGA GCTGCTCGGC GCGGCGGCGA CGGCCAAGCT GAGGGCGCTC
AGCGGCGAGC TCGAGCGGCG CGCGGGAGGT GAGCGCTCGC TCTGGATCCC CGACGGGCTG
GTGGCCGAGC CGCCTGGACG CAAGCCCTTC CCCGCGGAGG CGACGCTCTC CCGCTTCGAG
GTCCGGGGCC GGCCGTTCTT CACGCTCATC CTGAGGAACG TGGACGAGCG GGTCGAGGCC
GAGCGGCGGA TCCGCGCGCT CACCGCCCAC GCCGATTACC TGCGCGAGGA GCTCGCGGCG
GAGCACGGGT TCCACGAGAT CCTGGGGCGC AGCCCGGCCC TGCGCCGGGC GCTCCAGGAC
GTGACGCAGG TGGCGGCGAC GGAGGCGACG GTGCTCCTCG TCGGGGAGAC GGGCACCGGC
AAGGAGCTGT TCGCCCGCGC GATCCACGAC CGGAGCGCGC GCCGCGCGCG GCCGCTCGTG
AAGGTGAACT GCGCGGCCAT CCCGGCGACC CTCATCGAGA GCGAGTTCTT CGGGCACGAG
CGCGGCGCCT TCACCGGCGC GACCCAGCGG CGCGATGGCC GGTTCGCGCT CGCCGACGGC
GGCACCATCT TCCTCGACGA GATCGGCGAG CTCCCGCTGG AGCTGCAGGG CAAGCTGCTG
CGGGTCCTGC AGGAGGGCGA GTTCGAGCCG GTCGGCGCCT CGCGCACGCG AAGGGTGGAC
GTGCGCGTGG TCGCGGCGAC GAACCGCGAT CTCCAGCGGG CGGCGCGCGA GGGCACGTTC
CGCCAGGACC TCTACTACCG GCTCAGCGTC TTCCCGATCC AGCTCCCGCC GCTGCGGGAG
CGCGGCGACG ACGTCGTCCT GCTCGCCGCC GCGATGGCCG AGAAGCTCGC GCCCGGTCTC
GGGCGAAGGG TCGCCCCGCC GGACGCCGCG GACGCCGCGG CGCTCCGGTC GTATCCGTGG
CCGGGGAACG TCCGGGAGCT GCGCAACGTG GTCGAGCGGG CGATCATCAC CTCCACCGAC
GGGCGTTTGA ACCTGCACCG GTGCCTGCCC GCGCCAGCGG TCGCGCCCGC GAGCGAGCCG
GCGCCGGCCA CGCGGGGCGA CCCGGAGGTC CTGACCGACC GGCGCCTGCG GGAGCTCGAG
CGCGACAACC TGCTCGCCGC CCTCGAGCGG ACGCGCTGGA GGGTCGGCGG CAAGGACGGC
GCCGCGGCCC TGCTGAGCGT GAGCCCGTCC ACGTTGAAGT CGCGCATGAA GGCGCTCGGG
ATCGCGCGGC CCGCCGCGGG CGCCGGCGCG GGCGCCGCGC TCGGCGATCT CGCCCATCGC
TGA
 
Protein sequence
MQPSEVTAER RIDEELALRA VVEGTASETG QEFYRALVRN LAQALDTYGA WLTEYDEQRD 
RLRALAFWFG GQWMEGFEYG IAGTPCETAV RERRVVHVPD RVIELYPSDE VRFPRSGVVS
YLGVPLLGAD QRVLGHLAVV DIRPIPAEKR LLTLFGIFAN RAVAEVRRVR VEAELRERQE
KLSGLIGSAM DAIVEFDGDF RITLLNAAAE KTFGCIAVER RGRPAEELLG AAATAKLRAL
SGELERRAGG ERSLWIPDGL VAEPPGRKPF PAEATLSRFE VRGRPFFTLI LRNVDERVEA
ERRIRALTAH ADYLREELAA EHGFHEILGR SPALRRALQD VTQVAATEAT VLLVGETGTG
KELFARAIHD RSARRARPLV KVNCAAIPAT LIESEFFGHE RGAFTGATQR RDGRFALADG
GTIFLDEIGE LPLELQGKLL RVLQEGEFEP VGASRTRRVD VRVVAATNRD LQRAAREGTF
RQDLYYRLSV FPIQLPPLRE RGDDVVLLAA AMAEKLAPGL GRRVAPPDAA DAAALRSYPW
PGNVRELRNV VERAIITSTD GRLNLHRCLP APAVAPASEP APATRGDPEV LTDRRLRELE
RDNLLAALER TRWRVGGKDG AAALLSVSPS TLKSRMKALG IARPAAGAGA GAALGDLAHR