Gene GSU1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1941 
Symbol 
ID2685496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2124642 
End bp2126675 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content61% 
IMG OID637126632 
Productsensor histidine kinase 
Protein accessionNP_952990 
Protein GI39997039 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR02916] putative PEP-CTERM system histidine kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTGG TCCTTTCAGC GGCGGCGGTA CTGCTCTCCC TGGCCGTAAT CGGAGTCACC 
ATCCGCCGTC AGGGGCTTAC CCTTTCAGTC TGCGCGACAG TTGCCGCCCT GACGGCGGCT
TCGGCTCTCG AAGTCTTGGA CCTTGTCGTA ATGCTCGACC CGGATCGACT GTTTGAATGG
AAACGGTGGG TTCTGGGGGT CGAATCCCTT TTGCCTGCGG CTTGGCTTAC CTATAGCCTC
ACCCACTCCC GGCGGACTGC GACATCTGCT GTGCCCCGGT TGCAGCGGTT TTTCCTGGCC
GCTACGGCTG CGTTCCCACT GGCCGCTCTG CTGCTGCCGC CGGAGGTTTT CTTCTATTCA
CCGGATTTCG CCACGGAGCG GGTCCTGTTT CTGACGGTTC ACGGCTACTA CTTTTATCTC
GGCCTGCTTC TCTTTGCTGT GATTTCCCTC GTCAATCTGG AGGGGACGTA TTCTCATGCC
TCTCTCCTGG AGCGCTGGCG AATCAAATTC GATTTCATCG GCGCGACCAG TTACCTTGCG
CTTCTGGTGC TCTACTACAG CCAGGGGCTG CTCCATCGCT CTCTCAACAT GGGGCTGTTG
CCGGTACGAT CCCTGATTCT TGCCCTTGCC GCTGCCATGA TGCTCTATTC GCTCCTCGCA
CGGGGCAGCG GCGTGCGAAT CGCGGTCTCG CGGAACATGG CCTACAAGTC GGTGGTGCTG
GTTGCCGTAG GGCTGTACCT GGTTGCCGTG GGGGTGATGG GGGAGGGGTT GCGCTACTTC
GGGGAAGGCT TCCCCAAGGC AATGGCCATT GCCGCCCTGT TCTGCATGGG CATTGCCCTG
GTGGTCATTC TCTTGTCAGA GACCCTGCGC CGCAGGGTCC GGGTCTTCAT TCACAAGAAT
TTTTACCGGA GCAAGTACGA TTACCAGACC CAGTGGCTCC ACTTCACCGA CCTGTTGGCC
TCTGCCCGCA GCTCTGACGG TCTCATGGAG GCAATCCTCG CCGGCTACAG CGGGGTTTTC
GGCATGAACA GCGGAGTTCT TTTCCTTAAA AGCGGAGACG ACGGCTTATT CCGCTGGGCC
GTCGCTCGGG AGCAGGCGCT GGCCGGTGCA TTTTTCTCAT CTAACGATCC TCCGGTGCGC
CGGATGGCGG ATGAGGGCTG GATCGTCAAT CTCCGCGAGG CAAACCCTGA GGAGTTCTCC
GGTGCCGGTG AGTTTGTGCG GGAGAACAAC GTCGTATTCC TCATCCCTCT GTCGTCCGGC
GAGGGGCTTG AGGGGATCGT TGCCCTGGGA CGACCCGTGC ATGAGGGAGA GTTCTATCAC
TACGAAGATT ACGACCTGAT GAAGACCATG GCTCGCCAGG CCGCATCGGC CCTCATGAAC
CTGCGCCTCT CCGAGGAACT GGCAAGCGCC CGGGAACTGG AAGTGATGGG GCGGGTATCG
ACGTTTATCA TCCATGACCT CAAAAACCTG GTCTATACCC TGTCGCTTAC CGTGGACAAC
GCCCGTGACC ATATCGCCGA TGCCGAATTC CAGGAAGACA TGCTCGGCAC CCTCGGCAAT
ACGGTGAACC GGATGAAGCT GCTCATAGCC CGGTTGCGCG GCCTGCCTGA GAAGCAGTCC
CTCTGTTTCG AAGAGGTCGA TCTCCTTCGA CTTGCCGAGG AGAGCGCGGG GCTTGCCGGC
GGCCGGGGCA GCATCAGTGT CGGCGGATCT GCTGTTTCCG CCCGGGTCGA CCGCGAGGAG
ATCCAGAAAG TGGTCGTGAA CCTGGTGGTG AACGCCTTGG AGGCCACGGA AGGGAAGGGG
CCGGTGGCGG TGGAGGTGGG CTGCGGCACC GCCCCTTACA TCCGCGTGAC CGATGCCGGC
TGCGGTATCC CCGATGCGTT TCGTGCCCAC CTGTTCTCGC CCTTCAGGAC CACCAAGAAG
AAAGGGCTCG GCATCGGGCT CTACCAGTGT CGCCGGATCG TGGAGGCACA CGGCGGGAGA
ATAGACGTGG AGAGTACGCC CGGACAGGGT GCGATGTTTA CGGTGTGGTT GTAA
 
Protein sequence
MQLVLSAAAV LLSLAVIGVT IRRQGLTLSV CATVAALTAA SALEVLDLVV MLDPDRLFEW 
KRWVLGVESL LPAAWLTYSL THSRRTATSA VPRLQRFFLA ATAAFPLAAL LLPPEVFFYS
PDFATERVLF LTVHGYYFYL GLLLFAVISL VNLEGTYSHA SLLERWRIKF DFIGATSYLA
LLVLYYSQGL LHRSLNMGLL PVRSLILALA AAMMLYSLLA RGSGVRIAVS RNMAYKSVVL
VAVGLYLVAV GVMGEGLRYF GEGFPKAMAI AALFCMGIAL VVILLSETLR RRVRVFIHKN
FYRSKYDYQT QWLHFTDLLA SARSSDGLME AILAGYSGVF GMNSGVLFLK SGDDGLFRWA
VAREQALAGA FFSSNDPPVR RMADEGWIVN LREANPEEFS GAGEFVRENN VVFLIPLSSG
EGLEGIVALG RPVHEGEFYH YEDYDLMKTM ARQAASALMN LRLSEELASA RELEVMGRVS
TFIIHDLKNL VYTLSLTVDN ARDHIADAEF QEDMLGTLGN TVNRMKLLIA RLRGLPEKQS
LCFEEVDLLR LAEESAGLAG GRGSISVGGS AVSARVDREE IQKVVVNLVV NALEATEGKG
PVAVEVGCGT APYIRVTDAG CGIPDAFRAH LFSPFRTTKK KGLGIGLYQC RRIVEAHGGR
IDVESTPGQG AMFTVWL