Gene GSU1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1940 
Symbol 
ID2685487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2123258 
End bp2124625 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content58% 
IMG OID637126631 
Productsigma-54 dependent DNA-binding response regulator 
Protein accessionNP_952989 
Protein GI39997038 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02915] putative PEP-CTERM system response regulator 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.525901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAC TGCTGATCGT CGACGACAAC GAAGATATTC GCAAACAGCT GAAATGGGGC 
ATCGGCAAGG AGTACACGCT GTTCCTGGCG GCCGATGCCC GGGAGGCCAT TGATGTTTTT
CGCAAACAGC GGCCGACGGT GGTTACCCTT GACCTGGGGC TGCCACCCCA TGAGGATAGC
TCCGAGGAGG GGTTCCGCTG TCTGGAGGAG ATGCTGCGCA TCGCTCCCGA TGTGAAGGTA
ATCGTCATTA CCGGCAACGA TGGCAGGGAA AACGCGGTCA AGGCGGTTCA ACTGGGGGCC
TACGATTTTT ATCAGAAACC GATCAATCTC GACGAGTTGA AGGTGATCGT GAAGCGGGCG
TTCCACCTTC AGACGTTGGA GGAGGAGAAT CGGCGCCTCC AGAGTGCGCT GGACGGCGGC
TCCACTGAAT TCAGGGGGTT TGTCGGCCAA TGTCCCGAGA TGCAGCAGGT ATTTTCCACA
ATCCGCAAGG TAGCGGCTTC CGATATCTCG GTACTGATCC ACGGCGAGAG CGGCACGGGC
AAGGAACTGG TGGCCCGGGC GATCCACGCC ATGAGTCTGC GCAAGGACGG GCCGTTTATT
CCCATCAACT GTGGCGCCAT CCCTGAAAAC CTTCTGGAGG CGGAGCTCTT CGGACACGAG
AAGGGGGCGT TCACCGGCGC CCTCAATCGA GTGCTGGGGA AGGTCGAGTA CGCCCACAAA
GGGACCCTGT TCCTCGACGA AATCGGAGAG TTACCGCTTA ATCTTCAGGT AAAGCTTTTA
CGTTTTCTGC AGGAGAAGGT TATCCAGCGG GTCGGCGGCA GGGAGGACAT CGCCGTTGAT
GCCCGGATCG TGGCTGCCAC TAATGTTGAC ATCGCCCGCT CCATGGCCGA GGGGACCTTC
AGGGAAGACC TGTTCTACCG GATCGGTGTG GTTTCCATCA CCCTGCCTCC CCTCAGAAGC
CGGGGAGAAG ATGTAATGCT CCTGGCGAAT CTGTTCCTGA AACGATTTTC GATTGAGTTG
CGCAAGAAGA CCAAGGGGTT CAGCTCAACA TCGCGTGAAT ATCTCGAAGC CTACGCCTGG
CCCGGCAACG TGCGGGAGTT GGAGAACAAG GTCCAGCGGG CCGTGCTCAT GGCAGCGTCG
CCGATTATCG AGCCGGATGA CCTGGGATTC ACCGAACGGC CGATACCGAG GGCATCCACA
TCGCTGGAGG GCGTATCGCT CCGTGAGGCA CGGGATCGAG TTGAGCGTGA GATGGTACGC
GAGGCCATAT CAAGGTGCAA GGGGAACATT GCCCGAGCAG CCGAGGAGTT GGGGATCAGC
CGGCCAACTA TCTACGACCT GATGAAAAAA CACGGGATAA GTGGGTAA
 
Protein sequence
MEKLLIVDDN EDIRKQLKWG IGKEYTLFLA ADAREAIDVF RKQRPTVVTL DLGLPPHEDS 
SEEGFRCLEE MLRIAPDVKV IVITGNDGRE NAVKAVQLGA YDFYQKPINL DELKVIVKRA
FHLQTLEEEN RRLQSALDGG STEFRGFVGQ CPEMQQVFST IRKVAASDIS VLIHGESGTG
KELVARAIHA MSLRKDGPFI PINCGAIPEN LLEAELFGHE KGAFTGALNR VLGKVEYAHK
GTLFLDEIGE LPLNLQVKLL RFLQEKVIQR VGGREDIAVD ARIVAATNVD IARSMAEGTF
REDLFYRIGV VSITLPPLRS RGEDVMLLAN LFLKRFSIEL RKKTKGFSST SREYLEAYAW
PGNVRELENK VQRAVLMAAS PIIEPDDLGF TERPIPRAST SLEGVSLREA RDRVEREMVR
EAISRCKGNI ARAAEELGIS RPTIYDLMKK HGISG