Gene GSU2314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2314 
Symbol 
ID2687347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2531957 
End bp2534905 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content61% 
IMG OID637127007 
Productsensory box histidine kinase/response regulator 
Protein accessionNP_953363 
Protein GI39997412 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.581555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGAGG CGGCGAAGGT GGACGCCAGG GCCCAGTTCG AGAAGGATAT CGTGTACCGG 
CGCTGGAATG CCGGTCACGG CGGAGTATAT GTGCCCGTCA CGGAGCGTGC CCAGCCCAAC
CCGTGGCTTG CCGGCCTGAA GGACCGGGAT GTGGAAACCA CCACGGGGAA ACGGCTCACC
CTCATCAATC CCGCCTACAT GACACGACAG GTCCACGAGC TCAATTTCGA AAGCTCGGGC
ATCCGGGGCC ACATCACGAG CCTCAGACCC ATTCGGCAGG CCAATGCGCC TGACAACTGG
GAAACCCGGG CATTGGAATC CTTCGAGCGG GGACAGCAGG AATACTCCTC GGTTGAACTC
ATGGACGGTG TCCCCCATCT TCGCCTGATG AGGCCGCTGA TGGTGGAGCA GTCCTGCCTC
AGGTGCCATC CCGGCCCGAG CTATTCAGTG GGCGACGTTC GCGGCGGCAT CAGTGTTTCA
GTTCCCCTTG CGCCCTATTC GGCCCTTGCC CGGAACAAGA TGGCCGCCAC AGGCGCAGGT
CATCTGACGT TCTGGGCGGC AGGGCTCGGG GGCATCATCC TGGCGGCGCG CCGGATCACC
CGCGACGACC GCTCTCTTCT CTTGCAGCAG GCACGGCTTG CCGAAAGCGA AGAGCGTAAC
CGGCTCCTGT CGGAGGTAGC TCTGGAAGGG ATCGTTATCC ACGACAGGGG CATTGTTCAG
GATATGAACG CCAGGTTTGC AAAACTGTTC GGCTACCCGC GGGAGGAGTT GGAGGGAATC
AACGTGATAT CCCTGCTGTT CCATCCTGAT GATATCGGTT CCATGATCGA TAAGATGAGC
AGGGCACATT CTGAGCCCTA CCAGGTGCGT GGGGTGAAAA AGGACGGAAC GGTCTTTGAT
GTTGAGATCG AGGGCTACAA TCTGCAGCAC GGGGACAAGT CCGTCAGGGT CGTGTCGGTG
CGGGACATCA CCGAGCGTAG GCGGGCTGAG GAGGCCTTGC GCCAAAGTGA GGCACTCTTC
AGGAATCTGT TCGAGCACCA TGCGGCCGTC AAGCTGATCA TTGATCCCGT CAGTGGCGCC
ATCGTCGACG CCAATAACGC GGCGGTGAGC TTTTACGGCT GGTCCCGTGA GCAGCTCAGG
GCCATGAATA TCCGGGACAT CAACATGCTT TCGCCCGAGG AGCTGGTGAG CGAGCTGGAA
AATACCAAAA ATATGGAGCG GATCCACTTC TTTTTCCGTC ACCGCCGGGC GGACGACTCC
GTCCGAGATG TTGAGGTGTT CAGCAGCAGG ATCGAGGTGA AGGGAAGGGA GTATCTGCAC
TCCATCGTTC ACGACGTTAC CGAGCGCAAA CGTGCGGAGG AAGAGCTCCT TCTCGCCAAG
GAACAGGCCG AATCCGCCAA CAGGGCAAAG TCGGAATTCC TCGCCAACAT GAGCCATGAG
ATCCGCACTC CCATGAACGG CGTGATCGGC ATGACCGGCC TGCTGCTGGA AACGGGGCTC
GCCGACGAAC AGCGGAGATA TGCCGAAATA GTCAGGACCA GCGCCGCATC GTTGTTGCAG
GTCATTGACG ATATTCTCGA CTTCTCCAAG ATCGAGGCCG GCAGGCTGGA GATGGAAACC
ATCGGTTTCG ACCTGCGCAC CTTGCTTGAT GACCTGGCGG AATCTCTGGC GTTCAAGGCG
AATGAAAAGG GGTTGACGTT CACCTGCCTG CTCCGGCCCG AGGTGCCCCG GTTCCTGATG
GGCGATCCGG TCCGATTGAA ACAGGTCCTG GTCAACCTCG CGGGGAATGC GCTCAAATTC
ACTCACCAGG GCGAAATCTC CGTGGAGGTC GGCTCCCTGA CTAAAACGGG CGACTCGGTC
AAGCTCCGTT TTGCCGTGCG TGATACCGGC ATCGGCATTC CGCCCGAAAA AACGGAGCTT
CTGTTTGAGA AGTTCACCCA GGCGGATGCT TCGATTACGC GTAAATACGG CGGCACTGGT
CTCGGCTTGG CCATCTGCAA ACGGCTGGTG CAGATGATGG GCGGAGAGAT CGGGGTCACC
AGCCGGCCCG GCGTGGGCTC CGAATTCTGG TTTACTGCTT CCTTCGGCAC ACAGAACCTC
CGGGACTCCT CTCCGGAGCC GGACGGCGGT GACGGTCCGC TGCAGCTGCT GAGCGATCTG
GGGCACGACG ATGTCCGTAT CCTGCTGGTC GAAGACAACG CCACCAACCG ACAGGTGGCT
CTTGGGATCA TAAAACGGCT CGGCCTGCGG GGCGATGCCG CGGCCAACGG CGCAGAGGCA
TTGGACCTGC TGGCGACGAC TCCCTACGAT CTGGTTCTGA TGGATGTTCA GATGCCTGTC
ATGGACGGGT TCGAGGCAAC CAGGCACATC CGCGATGTCC GGTCGCCGGT CCTCAACCAC
GAGATTCCCG TCATTGCCCT CACTGCCCAT GCGATGAAGG GAGACCGGCG CAAGTGCCTG
GACGCGGGCA TGAATGACTA TCTCGCCAAA CCGATCTTCC CTGACGCTCT GGCCGAGATG
TTGGTCAAGT GGCTCCCTCG CCGCTGCCGG ATTTCAAACG ACGGTGACGG GGAGCGCGCG
AACGCGGTGC CGGCTGTGCC GGATGGAACG CCCGTCTTTG ACCGTCCGGG CCTGGAGGCA
CGGCTCATGG GAGACGAATC TCTGGTGACG GAGGTCGTTG CGGCATTCCT GGCTGATATG
CCTGACCTGA TCGAACGGCT CAGGGCGGCA GTGGCTGACG GTGAGCCGGC AACCGTTGCC
CACTGGGCCC ATACCATCAA GGGGGGGGCC GCCAGTGTCG GGGGAGAGCG GCTGCGGGCG
GCCGCGGCAG CCTTGGAGTA TGCGGCGGTT GCGGGAGGCA TGGGCGGTGT CGCCTGCTGC
ATGAACATGC TGGAAATCGA ATTCGGCCGA TTATGTGAAA TCATGAGACA AGACTACGTC
AAGGAGTGA
 
Protein sequence
MHEAAKVDAR AQFEKDIVYR RWNAGHGGVY VPVTERAQPN PWLAGLKDRD VETTTGKRLT 
LINPAYMTRQ VHELNFESSG IRGHITSLRP IRQANAPDNW ETRALESFER GQQEYSSVEL
MDGVPHLRLM RPLMVEQSCL RCHPGPSYSV GDVRGGISVS VPLAPYSALA RNKMAATGAG
HLTFWAAGLG GIILAARRIT RDDRSLLLQQ ARLAESEERN RLLSEVALEG IVIHDRGIVQ
DMNARFAKLF GYPREELEGI NVISLLFHPD DIGSMIDKMS RAHSEPYQVR GVKKDGTVFD
VEIEGYNLQH GDKSVRVVSV RDITERRRAE EALRQSEALF RNLFEHHAAV KLIIDPVSGA
IVDANNAAVS FYGWSREQLR AMNIRDINML SPEELVSELE NTKNMERIHF FFRHRRADDS
VRDVEVFSSR IEVKGREYLH SIVHDVTERK RAEEELLLAK EQAESANRAK SEFLANMSHE
IRTPMNGVIG MTGLLLETGL ADEQRRYAEI VRTSAASLLQ VIDDILDFSK IEAGRLEMET
IGFDLRTLLD DLAESLAFKA NEKGLTFTCL LRPEVPRFLM GDPVRLKQVL VNLAGNALKF
THQGEISVEV GSLTKTGDSV KLRFAVRDTG IGIPPEKTEL LFEKFTQADA SITRKYGGTG
LGLAICKRLV QMMGGEIGVT SRPGVGSEFW FTASFGTQNL RDSSPEPDGG DGPLQLLSDL
GHDDVRILLV EDNATNRQVA LGIIKRLGLR GDAAANGAEA LDLLATTPYD LVLMDVQMPV
MDGFEATRHI RDVRSPVLNH EIPVIALTAH AMKGDRRKCL DAGMNDYLAK PIFPDALAEM
LVKWLPRRCR ISNDGDGERA NAVPAVPDGT PVFDRPGLEA RLMGDESLVT EVVAAFLADM
PDLIERLRAA VADGEPATVA HWAHTIKGGA ASVGGERLRA AAAALEYAAV AGGMGGVACC
MNMLEIEFGR LCEIMRQDYV KE