Gene GSU0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0474 
Symbol 
ID2686193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp503803 
End bp506181 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content61% 
IMG OID637125141 
Productsensory box/GGDEF family protein 
Protein accessionNP_951533 
Protein GI39995582 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTCA CAAAAATCTC CGCACTCCAC TACCGCCTTC CTTCACCGGT GCGGCAGTTC 
ATCGTGCTGC TTTCGATCAT TTTTCTGATC GAAATGCTGC TCATGGCGGT CCTGCCCCAA
GTTACCGGCC GTGAAGTCGA GTTTAAGGAT GCACTTGTCG ACAGCATGGG GCTCGTCACG
GTTGCCGCGC CGTTTCTGTG GGCTTTCATT GTGCGTCCCC TGCGCCGCAC GGCCGTTGCG
GCCGTTTTCC GGGAGGAGGT ACTCCTCCGC CAGATGGTCG ACGGGGTGAT TACTTTCGGG
GAAGACGGCA CGATCCGTTC CCTCAATCCG GCCGCCGAGC GGATGTTCGG CTATCGGGAC
GATGAGGCCG CCGGCATGGC CATCGACTGC CTGCTGTCGG CCGACGGGGG CTATTTCCGA
TTTGCGGTAC AGGCATCCGG CGGCCATGGC ACACGCCAGC TCGCCTATGA ACTGGAAGGG
ATCCGCCGGG ATGGCAGCCG CTTCGTTGCC GATCTTTCCG TCAGCCGGAT CGTTTTTGAA
GATCACCGGG CAGTGATCGG CATTGTCCGG GACATCACCG CCCGCAAACG GGACGAGCAG
AATCTCCTCG TCTTCAAACG GGCCATCGAG TCGAGCGTAA ACGGCATCAC TATCACCGAC
GCGACTAACG GCGAAAACCT AATCATTTAC GTGAATCCTG CATTCGAGCG GATGACCGGC
TACGCGGGAC ACGAGGTTCT TGGGAAGAAC CCGCGTTTTC TCAGGGGGGA TGACCGGGAC
CAGGTGGAAC TCAGGAAGCT GGCCATGGCC CTGGAGGAAC GCAGGGAGGG CTACTTTGTC
CTGCGCAACT ACCGCAAGGA CGGCAGCCAG TTCATGAACG AACTTTACGT GGCGCCGGTG
CGCGACCGTG ATGGCGCAGT CACCAATTAC ATCGGCATCA TGAATGATAT CAGCGACCAG
CGGCGCTACG AGGAACAGCT GGTCTACCAG GCCACTCACG ACCCCCTGAC CGGCCTTCCC
AACCGTAATC TCCTGCAGGA CCGCCTTGGG CAGGCGCTCG CTCTGGAATC CTTCCGCCGT
CGCAATCCCA TCGGCGTCAT GTTCCTGGAT CTCGACAACT TCAAAAAGAT CAACGATACC
CTGGGGCACA CGGTGGGAGA CATGCTGCTC AAGGCCGTCG CCAACCGCCT GCGCAACTGC
GTGCGCGGCG GCGACACGGT TTCGCGGCTG GGAGGGGACG AATACATCCT TATCCTTCCC
AATGTAAAGG AAATGCATGA CGTGACCACT GTGGCCAAGA AGCTGCTCGG CGTATTTTCC
ACGCCGTTCC TGCTCATGGG ACACGAGCTC TACATCACGG CAAGCATCGG TATCACCCTC
TTCCCCTCCG ATGGCGACAC GGTGGATGCG CTCCTCAAGA ATGCCGATGC CGCCATGTAT
CATGCCAAGG AGCAGGGGAA GAACAACTAC CAGTTCTATT CAGAGGAGAT GAACACGCGG
GTCTTCGAGC GGATGGCTCT GGAAACGAGC CTCCATCGGG CGATTCGGCA GCATGAGTTT
CTGCTCTGCT ACCAGCCTCG GGTCGATCTG CGGACCGGAA GGATCAGCGG GGTGGAAGCC
CTTGTGCGTT GGAACCACCC GGAGATGGGG CTCGTGCCGC CGGCCAGGTT CATCCCTCTG
GCCGAGGAAA CGGGGCTCAT CGTGCCCATC GGCGAGTGGG TCCTGCGCAC CGCCTGCGCC
CAGAACAAGG CATGGCAGGA TGCGGGGCTC CCGCCACTGA GGATGGCGGT GAACCTTTCG
GCCCGGCAGT TCCGTCAGGA AAATCTCATC CAGATGGTCG CCGACGCCTT GGCCGAAACG
GGTCTCGACC CCCGCTGGCT GGAGCTGGAG CTGACCGAAA GCCTCCTCAT GGAGCGGGCC
GAGCAGTCCG TGTCGATCCT CCGCTCCCTG GCGGATATGG GGATCGACAT CGCCGTGGAC
GATTTCGGCA CCGGCTATTC GTCTCTGGGG TATCTCAAGC GGTTCCCGAT TACGAACCTG
AAGATCGATC AATCGTTCAT CCGCGACATA GCGAGCGATC CGGACGACGC CATTCTGGTG
CGGACCATCA TCACCATGGC CCACGGCCTC GGCATGAAGA CCGTCGGCGA AGGGGTCGAA
TCTCTTGAGC AGATTGATTT CCTCTATCGG CACGGCTGCG AAGAAGTGCA GGGATACTAT
TTCAGCAGGC CGCTTACCGC TGAGGGATGC GAGGAGCTGC TTCGCGAGGA GAGGTTTCTG
GATCTTCGGG CGCTCCGGGA TGGGTTGCCG GGCCAGAGGA GCGAGGAGCG CATTCGCGCC
CTGGCGTCAT CCGCTGATGA CTGCCGCGTT ACAGCGTAA
 
Protein sequence
MDFTKISALH YRLPSPVRQF IVLLSIIFLI EMLLMAVLPQ VTGREVEFKD ALVDSMGLVT 
VAAPFLWAFI VRPLRRTAVA AVFREEVLLR QMVDGVITFG EDGTIRSLNP AAERMFGYRD
DEAAGMAIDC LLSADGGYFR FAVQASGGHG TRQLAYELEG IRRDGSRFVA DLSVSRIVFE
DHRAVIGIVR DITARKRDEQ NLLVFKRAIE SSVNGITITD ATNGENLIIY VNPAFERMTG
YAGHEVLGKN PRFLRGDDRD QVELRKLAMA LEERREGYFV LRNYRKDGSQ FMNELYVAPV
RDRDGAVTNY IGIMNDISDQ RRYEEQLVYQ ATHDPLTGLP NRNLLQDRLG QALALESFRR
RNPIGVMFLD LDNFKKINDT LGHTVGDMLL KAVANRLRNC VRGGDTVSRL GGDEYILILP
NVKEMHDVTT VAKKLLGVFS TPFLLMGHEL YITASIGITL FPSDGDTVDA LLKNADAAMY
HAKEQGKNNY QFYSEEMNTR VFERMALETS LHRAIRQHEF LLCYQPRVDL RTGRISGVEA
LVRWNHPEMG LVPPARFIPL AEETGLIVPI GEWVLRTACA QNKAWQDAGL PPLRMAVNLS
ARQFRQENLI QMVADALAET GLDPRWLELE LTESLLMERA EQSVSILRSL ADMGIDIAVD
DFGTGYSSLG YLKRFPITNL KIDQSFIRDI ASDPDDAILV RTIITMAHGL GMKTVGEGVE
SLEQIDFLYR HGCEEVQGYY FSRPLTAEGC EELLREERFL DLRALRDGLP GQRSEERIRA
LASSADDCRV TA