Gene GSU1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1039 
Symbol 
ID2688726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1125639 
End bp1127339 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content63% 
IMG OID637125708 
Productsigma-54 dependent DNA-binding response regulator 
Protein accessionNP_952092 
Protein GI39996141 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATACA GCATTCTCGT GGTAGACGAC GAGGAAAGCC TTCGCTACAC CTTCCGCTGC 
TTCCTGGAGC ATCAGGGCTA TGAGGTAGCG ATTGCGGCGA ACTACGAAGA GGCCTTGGCG
GCTCTGCCCG CCGGGTTCGA TCTCGTTTTC GCCGATATCG TCCTCGGCGG CTGGACCGGC
GTCGATCTGC TGCGCGAGGC CCGTTTGCGC GGTTTCGACG TTCCCTTCGT CATGGTGACC
GGCTACCCCC ACATCGATAC CGCCTCTGAA GCGCTCAGGC TCGGCGCTTA CGACTACATT
CCCAAGCCGG TGAAGCAGGA GACCATCGTC CGGGTTGCCC GCATGGCGAT CCGTCACCAC
CTGGTCTCCA AGGAGAAAGA TCAGTACCGC TCCAACCTGG AGGCAATCTT CCGCAGCGTC
AAGGATGCCA TCGTCACTGT CGACCGGGGG ATGCGGCTCA TCGCCTGCAA CGCTGCCGCC
ACCACGCTCT GCGGCATCAC GAAGGACAGG GTCGAGGCTC AGGAGCTCTT CTGCCCCGAA
GATTGCCAGC ACCAGTGCCT GAAGCTTCTG GAGGAGACCC TCGTCAAGCG TCAGCCCATC
GAAGGGCAGC GGTTCGTCTG CCGCAGACAG GGACAGCCCG ATCGCATAAT CGCCGCAGCC
ACGTCGCCCT TGCTGGATCA ACTGGAAGGT TTCGACGGAG CCGTCCTCGT GGCCCGGGAC
GAAACCCGTC TCGACCACCT GGAGCGCCGG GCTTCGGGGC GCCGTTCCCA TCTGTCGCTG
GTCGGCGGCA GCGCGGCGAT GCAGCGGCTC TATTCGCTGC TGGAGGCGCT GTCGGAGGTG
GATACCACCG TCCTCATCAC CGGCGAGAGC GGAACCGGCA AGGAGCTGGC CGCAGATGCC
CTCCACCGGG GCGGACACCG TGCGGACAAG CCCTTTGTTG CGGTAAACTG CGCCGCTCTT
CCCGAAACCC TTTTGGAGAG CGAGTTGTTC GGCCATGTGA AGGGGGCGTT CACCGGCGCG
CTCAGGGACA AGGAGGGGCG CTTCCAGAAG GCCGATGGCG GAACGATTTT CCTGGACGAG
ATCGGGGATA TATCCCCCGC CATGCAGCTG AGACTGCTGC GGGTGCTTCA GGAGAAAGTG
TTCGAGCGGG TAGGCGATGC GACGCCCATC CGGGTCGACG TGCGGATCGT GGCGGCCACC
AACCGCGACC TGGCGCAACT GGTCCGCGAA GGGCGGTTCC GGGAAGATCT CTATTACCGT
CTTAAAGTCG TGCACCTGGA ACTTCCTGCA CTCCGGGAGC GCAAAGAGGA TATCCCTTTG
CTGGTGGAGC ACTTTCTCGG GCGACTGACC GCCCGCTTCG AGAGAAATCC GCTCACCATC
TCCCAGGAGG CGCTGGACAC GCTGATGTCC CACCCCTGGC CCGGCAACGT GCGCGAACTT
GAACACGTTC TGGAACATGC GGCGGTCATC TGCCGGGGGC ACGTCATTCT GTCCGATCAT
CTGCCTGCCG ACCTTGCTCC TGCCGTGCCG GCTTCCGCGG CGCCGCGGAC GGTCAGGGTA
ACGGACGAAG CAGCCCTGAT CCGCGATGCC CTCGTCCGGG CCGGCGGCAA CAAGACCAAG
GCGGCAAAGC TGCTCGGCAT GAGCCGCCGG ACTATTTATC GCAAGCTTGA TGAACACGGC
ATCTTCGGTG ACTTTAAGTA G
 
Protein sequence
MTYSILVVDD EESLRYTFRC FLEHQGYEVA IAANYEEALA ALPAGFDLVF ADIVLGGWTG 
VDLLREARLR GFDVPFVMVT GYPHIDTASE ALRLGAYDYI PKPVKQETIV RVARMAIRHH
LVSKEKDQYR SNLEAIFRSV KDAIVTVDRG MRLIACNAAA TTLCGITKDR VEAQELFCPE
DCQHQCLKLL EETLVKRQPI EGQRFVCRRQ GQPDRIIAAA TSPLLDQLEG FDGAVLVARD
ETRLDHLERR ASGRRSHLSL VGGSAAMQRL YSLLEALSEV DTTVLITGES GTGKELAADA
LHRGGHRADK PFVAVNCAAL PETLLESELF GHVKGAFTGA LRDKEGRFQK ADGGTIFLDE
IGDISPAMQL RLLRVLQEKV FERVGDATPI RVDVRIVAAT NRDLAQLVRE GRFREDLYYR
LKVVHLELPA LRERKEDIPL LVEHFLGRLT ARFERNPLTI SQEALDTLMS HPWPGNVREL
EHVLEHAAVI CRGHVILSDH LPADLAPAVP ASAAPRTVRV TDEAALIRDA LVRAGGNKTK
AAKLLGMSRR TIYRKLDEHG IFGDFK