Gene GSU1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1037 
Symbol 
ID2685671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1121523 
End bp1123922 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content61% 
IMG OID637125706 
Productsensory box/response regulator 
Protein accessionNP_952090 
Protein GI39996139 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGGG AGGGCAGAGG TACACGACAA AGGAATATAA TGCGGTTGGT CAGCCAGCCG 
GGCTTGGATC CCAAAGCGTC GGTCCCTTCA TTCGACCATG ACGCCCGCCC GGTGCCTGCC
ACCTCGGCCG GCGTCTCCCG GCCGGCGATA GATCCCCCGG CCGAATGGAC CGTTCGCCAG
TACATACGTG AGAAAAAGGA TCAGCGGTTT CCCTCTCCCA GGGGCCAGCA CCGCTGTTTT
TCAGGGCTTG ACGCCCTGTG CGGCATGAAG AGCGAGGTGC CGGAACGGCC GACGGTCGTT
CTGATCATCG AGGATGACCG GCAGGTTCGC CTCCTTGCCA GGGATGTGCT TGAGGAAGCA
GGGTTTGTGG TGGAAGAGGC ATGTGACGGC CGGGAAGGCG TGGTCAAGTT CGGTCGGGTA
CGGCCCGACG TGGTGCTGCT CGACGTGATC CTCCCCTACA TGAACGGCAT GACCGTGTGC
AGGGTCTTGC GGGAGATGCC CGGCGGCGAG CGGGTCCCCG TCGTCATGAT GACGGCTCTG
GGCGATTCGG ATGCGGTTCG GGAGGCGTTC GACGCGGGAG CCACCTTTTT TGTCCCGAAA
CCGATCAACC TATCGACCCT CCCCGAACAC CTCACCTACC TTGTCAGATC AAGCCGTGTC
CTCCAGGAAC TCCATGAAAG CGAAGCGCAA AACCGGGCAT TGCTGAACGC CATTCCGGAC
ATGATGCTCC ATGTCTGCCG CGAGGGAACC ATTCTCGACT TCCGTGCCGG CAACGGTTGT
GCCTATGCGG CGTTCGGGGG AAACCTCCGC GGCAGGACCA TTGCCGATCT TCTTCCCCCC
GGCGCGGCGG CCCAGTTCAT GGCTCAAGTC GGACACGCGC TTGCCACGGG AACCATGATG
GTCTGCGAGT ATGAGCTGGA GGGCCCCGGC GGTCTCAACA CCCACGAGGC CAGGATCGTT
GCCAACGGTA CCGGCACGGT GCTCGCCATC GTGCGCGACG TGACGGAGCG GCGGCAGTCG
GAGGAGATGA TCCGCTACCT GGCGCATCAC GACAGCCTCA CGGGACTCCC CAACCGGATA
TTTTTCGGCG AAATGCTGGA GCTCGTTCTT TCCCGCGCCC GGCGCGACGG CAAGCAGGTT
GCCATTCTTT TCGTGGACCT GGACGACTTC AAGCTCATCA ACGACACCCT CGGCCACGCC
GCCGGTGACC ACCTGCTGCG GGAAGCTGCC CAGCGCCTCA AGGGATGCGT CCGGAGCAGC
GACTACGTGG CCAGCGGGGC GGAGAGCGGC CTGTGCGGCA ATATCGCCCG CTTCGGCGGC
GACGAGTTCA TCCTTTCATT GGGAAACCTG GACTCCGTTG ACGATGTGGT GGCGGTGACG
CAGCGGATTA TCAGCGAGTT TTCCCGGGAG TTCCGGATTG ATGGTCATGA GATATTCGTA
TCAGTGAGCA TCGGCATCTC CATGTATCCC GACGATGCCG AAGATACCGG CTCCTTGGTC
AGAAACGCCG ACATGGCCAT GTTTGCCGCA AAGGAAGAGG GAGGACCGTC GTTCCGCTTC
TTTACCCGGG GCATGAACGA AGCCGCCCAG CAGCGTCTCT CCATAGAACT CAGCCTGCGC
AAGGCGTTGG AACGGGGAGA GATCTTCGTC CACTATCAGC CGAAAGTTGA CCTGTCCTGT
GGGGAGATAA CCGGTTTCGA GGCCCTTGCC CGTTGGCACC ATCCGGAACT CGGACTGATC
CCTCCCGATA CGTTCATTCC CCTGGCGGAA AAGAACGGCC TGATCGTCCG GATCGGCGAG
TGGGTGCTGA GAGCCGCCTG TGCCCAGGCC AGGACATGGA TCGACGAAGG CTTCTCCGAT
TTGTGCATGG CCATTAACCT GTCGAGTCAC CAGTTCACCA GCGGCAACCT CACCTCTATG
GTAAAAGAGG TGCTTGACGA GACGGAACTG CCTCCCTCCT GTCTAGAGTT CGAAATCACC
GAGGGAATCC TCATGGAGCG GACCGAGAAA ACCATGCGGA TCCTTTCCGA GCTGCGGGAC
ATGGGAGTCA GGATATCCAT CGACGATTTC GGCACCGGCT ACTCGTCGCT GGGGTATCTC
AAGCGCTTCC CCGTGAACAC GGTCAAGATC GACCGCTCAT TCGTGCGGGA AATCGATTCA
CCCCACGAGG ATGCGGCGAT CATCAAGGCC ATCATTGCCG TCGCGCGTAA CCTGAATCTC
CAGGTGGTGG CCGAAGGGGT CGAAAGCCAT CGACAGACGG AATTTCTCCT TGCCCATGGC
TGTAACGAGG CTCAGGGGTT CCTCTTCGGC AAGCCGGTGC ATCCCGATGA GGCGAGCCTG
ATGCTCCGGG CCGGGAGATG GCGCGACGCG CCCGCACGTG CCCTGGCGCG GGGAAGGTGA
 
Protein sequence
MGREGRGTRQ RNIMRLVSQP GLDPKASVPS FDHDARPVPA TSAGVSRPAI DPPAEWTVRQ 
YIREKKDQRF PSPRGQHRCF SGLDALCGMK SEVPERPTVV LIIEDDRQVR LLARDVLEEA
GFVVEEACDG REGVVKFGRV RPDVVLLDVI LPYMNGMTVC RVLREMPGGE RVPVVMMTAL
GDSDAVREAF DAGATFFVPK PINLSTLPEH LTYLVRSSRV LQELHESEAQ NRALLNAIPD
MMLHVCREGT ILDFRAGNGC AYAAFGGNLR GRTIADLLPP GAAAQFMAQV GHALATGTMM
VCEYELEGPG GLNTHEARIV ANGTGTVLAI VRDVTERRQS EEMIRYLAHH DSLTGLPNRI
FFGEMLELVL SRARRDGKQV AILFVDLDDF KLINDTLGHA AGDHLLREAA QRLKGCVRSS
DYVASGAESG LCGNIARFGG DEFILSLGNL DSVDDVVAVT QRIISEFSRE FRIDGHEIFV
SVSIGISMYP DDAEDTGSLV RNADMAMFAA KEEGGPSFRF FTRGMNEAAQ QRLSIELSLR
KALERGEIFV HYQPKVDLSC GEITGFEALA RWHHPELGLI PPDTFIPLAE KNGLIVRIGE
WVLRAACAQA RTWIDEGFSD LCMAINLSSH QFTSGNLTSM VKEVLDETEL PPSCLEFEIT
EGILMERTEK TMRILSELRD MGVRISIDDF GTGYSSLGYL KRFPVNTVKI DRSFVREIDS
PHEDAAIIKA IIAVARNLNL QVVAEGVESH RQTEFLLAHG CNEAQGFLFG KPVHPDEASL
MLRAGRWRDA PARALARGR