Gene GSU2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2789 
Symbol 
ID2686952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3068098 
End bp3070410 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content64% 
IMG OID637127479 
Productsensory box histidine kinase 
Protein accessionNP_953833 
Protein GI39997882 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATCT GCTATGCTCA GCCGACATTG CACAGAGATA TCACCTGGTC GGCAAGGATG 
GTCGATTTCA TGGCTTACGA CGCGGCAAAC GAAGAGTACA ACGGACCCCC TGTGGACGCG
GCAGCCTTCT GCGTTGCCGA CGGCCAGGGG CGGCTGCTCG ACGTCGGCGA GGGATTGTGT
CTTTTGAGCG GCTACCCGCG GGCCGAGCTT CTCTCCCTGT CCCTGGCCGA GCTTGAAGCC
GGCGGTTCCG GCACATGGCA CACCGCGTTG ATAAACACGG CAACCTCGGG CAGCCCGGCC
CGCGCCGAGA TCGGGCTCAG GCAGCGGGAA GGCGGTGTCA GGCAGGTGGA GGTGGAGGCG
GCCTTCCTGC CGGCGCTGGG GCAGGTGATC CTCACCTTCA GGGACAGTGC GATATTCTCG
CGAGTCCTGG CAGAGTTGAG GCAGAAGTCG GAAGAATGCG ACACCTACTT CCGCAACAGC
CTCGACCTGC TCTGCGTGGC CGACAACAGT GGGTATTTCC GGCGGCTCAA CCCTGCATGG
GAAGAGGTGC TCGGTTTCCC TCTCCATGAG CTGCTCGGCA GGAAGTTCAT CGACTTGGTG
CACCCCGACG ACCGCGAGGC GACGATGGGG GCTATGTCCG ACCTTGCGCA CCAACGGCAG
GTGAAGGATT TTACCAATCG CTACCGCGTC CGGGACGGTT CGTACCGGTG GCTCGAATGG
CGCGCCGCAC CGGCGGGGGA CCTGGTGTTC GCCGTCGCGC GCGACGTGAC GGACAGAATT
CTCGCCCAGG AAAGGCTCTG CCGGAGCGAA GAGCGATATC GCCTCCTGTT CGAGGAGATG
ATGAGCGGCT GCGCTCTCCA CGAGTTGATC TGCGACCGGG ACGGCAATCC CGCGGACTAC
CGGTTCCTGG CGGTGAACGG GGCCTTTGAG CGGATGACGG GGCTTCGGGC CGCCGACATC
GTCGGCAAGA CGGTCCTTGA GGTGATGCCG GGCACCGAGC GCCACTGGAT CGAGCGGTAC
GGGGACGTTG CCCTGTCGGG GCGGTCGGCG GAGTTCGAAG AATACGCGGG CGCGATCGGG
AGATACTACA GCGTCCGCGC CTACTGCCCT GAGAGGGGGA AGTTCGCGGC GGTCTTCAAC
GACATCACCG AGCGCCGCCT CATGGAAGAC GAGCTGCGGA GGCAGGAGCA GAGCCACCGG
CTCGTCCTCG ACAACATCCC GGACATCATC GCGCGGTTCG ACCGCGAGCT GCGCCATCTC
TATGTGAGCG CCGCCATAAC AAGGGTGACG GGGTTCCCGC CGGAGCACTT CATTGGCAGG
ACCAACCGGG AGATCGGCAT GCCGGACGAA CTGGTCGATC GCTGGGATGC GGCGATCAAC
GAAGTCTTCT CGTCCGGGAG GGAACAGCGG CTCGAATTTA CCTACCAGTC CCCTCGCGGC
CTGCGCCACT TTGAAACCAC GCTGATCCCC GAACGGGGTG CGGACGATTC GTCCGCCGCG
GTCCTGGTCG TAAACCACGA TATTACCGAA CTCAAGAACG CCGAACTGGG GATGCGGCAG
CTGAACGAGG AGCTTGAACG GAGGGTGGGG GAGCGCACGA GGGAGCTCGA GCTATCCAAT
CGCGAGCTGG ATTCTTTCTG CTCGGCGGTT TCCCATGACC TGCGGGCGCC GCTGCGTCAC
ATTGCCGGGT TCAGCCGGGT CGTCGCGGAG GACTACGGCG ACCGGCTCGA CCACGGGGGC
CGTGATCTGC TCACGCGGAT CGAAGGCGGT GCCCAGAGGA TGGATACCCT GATCAACGAG
CTGCTTAGTC TCTCACGGAC GAACCGTTCA CCCCTGGAAT GTCACCCCGT CAACCTGAGC
ACCATTGCGA TGGACGTCGT GACCGACCTG CGGGAGGCCG CGCCGGACCG GCTGGTGGAC
ACGGAGATCG CGCCCGATGT CATGGCCCAG TGTGACGGCA ACCTGATCCG CGTGGTATTG
TCCAACCTGC TCGGCAATGC CTGGAAATTC ACCTCCCGCA CTGAGCGGGC GCGCATCGAG
TTCGGCGTTA TTCCGTCGGG CGATCAGCAG ACGTTTTTTG TCAGGGATAA CGGCGCGGGT
TTCGACATGA ATTTCGCCGA CAAGCTTTTT GTCCCGTTTC AGCGGCTCCA TGCCCGTGAT
GACTATGAAG GCACCGGCAT CGGCCTCGCC ACGGTCCAGA GGATCATCCA TCGCCACGGC
GGGAAAATAT GGGCGGACGC CGCTGCCGAC CGCGGCGCCA CCTTCTGGTT CACCCTCGGG
GACTGCCTCG GCGGCAGACC TGTTTCGCCC TGA
 
Protein sequence
MIICYAQPTL HRDITWSARM VDFMAYDAAN EEYNGPPVDA AAFCVADGQG RLLDVGEGLC 
LLSGYPRAEL LSLSLAELEA GGSGTWHTAL INTATSGSPA RAEIGLRQRE GGVRQVEVEA
AFLPALGQVI LTFRDSAIFS RVLAELRQKS EECDTYFRNS LDLLCVADNS GYFRRLNPAW
EEVLGFPLHE LLGRKFIDLV HPDDREATMG AMSDLAHQRQ VKDFTNRYRV RDGSYRWLEW
RAAPAGDLVF AVARDVTDRI LAQERLCRSE ERYRLLFEEM MSGCALHELI CDRDGNPADY
RFLAVNGAFE RMTGLRAADI VGKTVLEVMP GTERHWIERY GDVALSGRSA EFEEYAGAIG
RYYSVRAYCP ERGKFAAVFN DITERRLMED ELRRQEQSHR LVLDNIPDII ARFDRELRHL
YVSAAITRVT GFPPEHFIGR TNREIGMPDE LVDRWDAAIN EVFSSGREQR LEFTYQSPRG
LRHFETTLIP ERGADDSSAA VLVVNHDITE LKNAELGMRQ LNEELERRVG ERTRELELSN
RELDSFCSAV SHDLRAPLRH IAGFSRVVAE DYGDRLDHGG RDLLTRIEGG AQRMDTLINE
LLSLSRTNRS PLECHPVNLS TIAMDVVTDL REAAPDRLVD TEIAPDVMAQ CDGNLIRVVL
SNLLGNAWKF TSRTERARIE FGVIPSGDQQ TFFVRDNGAG FDMNFADKLF VPFQRLHARD
DYEGTGIGLA TVQRIIHRHG GKIWADAAAD RGATFWFTLG DCLGGRPVSP