Gene GSU2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2531 
Symbol 
ID2687804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2791476 
End bp2794637 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content63% 
IMG OID637127221 
Productsensory box histidine kinase 
Protein accessionNP_953577 
Protein GI39997626 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.606569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCC CCCGCAGTAT CCGCACCAAG GTGTGGCTCT GTGTTCTCGT GGCCTTCGTG 
GGTTACCTGA TCGCAACCCT TTCCAGTTAC TACTCCAATC TACTTTTTTC CCACAACCTC
GACCACCTCA AGCGGGTCGA ATTTCCCCTG GCCCTCAATG GGAACGATAT CCTGAGCCTC
TACCGCAAGC AGTTGAAGCT CTACGAGGAT GCCTTTGTCA CCGGCGAGCG GGATGAAGCG
CTCAGGGCCA ATGAACTGGG GCGCGAGATC GTGACGAGCC TGGCCCGGGT AGCTGAGCTC
ACCCGGGAGG ACACCGAGCA CCGCTATGGC GACGTCGAGG CGCTCAAGTC CCGCTACGAG
GCCTATTTCC GGAAGGCGAC GGAGCTGTAT CCCCGGGTGC TCGGCAGCAC CGACACGTTC
CTGAGCGGAG AGATCGCCCG TCTCGGGGCG GACGGCCGCT TGATCCTGGT GGACTTCGAG
CGGATGTCGC GGGATTACGT CACCTCGGTC GAGCACCAGA TCGAGCGCAA CCGTGCGCTT
GCCCGCGATA CCTCGATCTA TCTTTTCGTC CTGTTCGGGA TGGTGGTACT GCTGGCGGCG
CCTGCCATCA CCTTTGTGGC CAACCGGCTC CTGATCCGCC CCCTGGAAGA GCTGCGCGGC
ATGGTTACCT CCTTTGCCGG GGGCAGTCTG GACCTTTCCG GACTCCCTGA CTACGATGCC
GGAGACGAGA TCGGCTCCCT CTGCGCCTCG TTCAGGTCCA TGGTGGAGGG GCTCCAGGAG
ACCACGGTCT CCCGCGACTA CGTGGACAAT ATCATCGAGA GCATGAGCGA CTGCCTGATA
GTGGTCGACA CCCGCGCCGC CATCCGGCGG GTGAATCGCG CCGCCCTGAC CATGCTCGGC
TACGATGAGG AGCAGTTGCT CGGCATGCCG GTCGGTGTCA TCTTCGCCTG GGAACGTGAC
AAGGAGCGCC TGCGGACCGA GGGGCTCAGG TGCTTTGTGG AGTCCATCGG CTCCAGTCCG
GCCGAATTCA CCTTCCTGAG CAGCGACGAA CGCCGGACCC CGGTACTGCT CTCCGCGTCT
CCCATGTTAG GCCGTGACGG CTCGTTCCAG GGATCGGTCT TCGTTGCGGT GGACATCGGG
GAGAGGAAGC GGACCGAGCA GGCCCTGCGG GAAAACGAGC AGCACCTGAA GAACATCCTC
GATTCCATCC ACGCGGGAAT CCTTGTCATC GACCCCCAGG ATCACCGGAT CGTGGATGCC
AACACCTTTG CCCTGGACAT GATCGGGGCG CCCAAGGGGC TCGTGGTGGA CCGGATCTGC
CACAATTTCG TCTGTAGCGC CGTTCAGGGC GCCTGTCCCA TCACCGACCT GCTCGAAAAC
ATGGACAACT CGGAGCGGCG CCTGCTGCGG TTCGACGGCA CGTCCATCCC GATCCTCAAA
TCGGTGGTGC CGGTCACCTA CCAGGGAAGG AAGCTGCTGA TCGAGAGCTT CATCGATATC
TCCGAGCGCA AGTGGGCCGA AGAAATGCTC CGCTACCTCA CCGAGGGGAC GGCTTCGGTG
ACGGGCGAGG AGTTTTTCCG TTCGCTCGTC CGGCGTCTCT CCTCGGCCCT CGGGACCCGG
TTCGCCTTTG TCACCCGCCT GCTGGACGGT TCTCCCGCCC GGATGCGCAC CCTGGCCTTC
TGGACCGGCA ACGGCTACTG TCCCACCATG GAGATGCCCC TGAACGGCAC CCCCTGCGAA
CGGGTCATCG CTGACAGCGA CATCCTGTTC CATCCCAGCG GCTTGGGACA GCTCTATCCC
TCCGCCGAAA CCATGAAGCG AATGGGTGTC GAGAGCTTCC TGGGCGTGCC GCTGTTCGAT
TCCCGCGGCA CCGCCATCGG CCACCTGGCG GTTCTGGACG ATAAGCCCAT GCGCGAGGAC
GAGGGGCACC GCTCCCTGTT GCGGATATTC GCGGCCCGGG CCGGGGCGGA GCTCGAGCGG
ATGAGGTGGG ACGAGGCCCT GCGCGAAAGC GAGACCCGCT ACAAGGATCT CTTCGAAAAC
GCCAACGACC TGATCCAGAG CGTGTCACCC TCGGGCGAGA TCCTCTACGT GAACCGGGCC
TGGCGCGAAA CGCTCGGCTA CAGCGAGGAG GAGGTGAAGG GGCTCACCTT TGATCAGATC
ATCGATCCGG AGTGCATTGG CCACTGCATG GGCGAGTTCC GCAGGGTCAT GGAGGGCCAG
AGCCTGGAGG CGGTGGAGGC CCGGTTCATG GCGAAAGATG GCCGCTCCAT CCTGGTGGAG
GGGAGTGTCA ACTGTAACGT GGTGGACGGC AAGCCGCTGG CCTCCCGGGG GATTTTCAGG
GATATCACCG AGCGGAAGCA CCATGAGGAC ACGCTCCGGA AATACGCGAC AGAGCTTGAG
CAGACCAACG AGGAACTGAA GGACTTCGCC TATATCGTCT CCCACGACCT GCGCGCGCCG
CTGGTCAGCA TCAAGGGGTT CTCCATGGAG CTGGTTACGG CTATGGATGA GCTCAGGGGG
GTCATAGCGG AGCTTTCCCC GAACATCGAG ATGATGACCC GCGAGCGGCT CCGCCGGCTC
TTCGAGCAGG ATATCGACGA GGCGGTCGGC TTCATCAACT CCTCGTCCCA GCGCATGGAT
ACCCTCATCA CCGCCATTCT CAACCTGTCG CGCCTCGGTC GCAGGGAACT CAAGACGGAA
CCGGTCGACA TGGGCGCTGT CGTGCGTTCG ATCCTCGACT CCCTGGCCCA TCAGATCGAG
ACGAACCGTA CCGAAGTGGC GATCCGCGAC CTTCCGGTCA TCACCGCGGA CCGGATTGCC
ATGGAACAGA TCATGGGCAA CCTGCTGGAC AACTCCCTCA AGTACCTGGA GCCCGGGCGG
CCGGGCCGTC TGGAGATCTG GGCCGATGTG GGCGGCGAGG AGTGCGTCTT CCATGTGAAG
GACAACGGGC GCGGCATCAG GGAGGAAGAA ATCCCCCGGG TCTTCGAACT GTTCCGCCGC
GCCGGCCGGC AGGATGTCCC CGGCGAGGGG ATGGGGCTGG CCTTCGTGAA GACCCTGGTG
CGGCGGCTGG GTGGCCGCAT CTGGTGCGAG TCCGAACCGG GCGTGGGGAG CACGTTCAGC
TTCACCCTGC CGTCTTCCTT TTCTCACAAA CTCCCGCTAT AG
 
Protein sequence
MKLPRSIRTK VWLCVLVAFV GYLIATLSSY YSNLLFSHNL DHLKRVEFPL ALNGNDILSL 
YRKQLKLYED AFVTGERDEA LRANELGREI VTSLARVAEL TREDTEHRYG DVEALKSRYE
AYFRKATELY PRVLGSTDTF LSGEIARLGA DGRLILVDFE RMSRDYVTSV EHQIERNRAL
ARDTSIYLFV LFGMVVLLAA PAITFVANRL LIRPLEELRG MVTSFAGGSL DLSGLPDYDA
GDEIGSLCAS FRSMVEGLQE TTVSRDYVDN IIESMSDCLI VVDTRAAIRR VNRAALTMLG
YDEEQLLGMP VGVIFAWERD KERLRTEGLR CFVESIGSSP AEFTFLSSDE RRTPVLLSAS
PMLGRDGSFQ GSVFVAVDIG ERKRTEQALR ENEQHLKNIL DSIHAGILVI DPQDHRIVDA
NTFALDMIGA PKGLVVDRIC HNFVCSAVQG ACPITDLLEN MDNSERRLLR FDGTSIPILK
SVVPVTYQGR KLLIESFIDI SERKWAEEML RYLTEGTASV TGEEFFRSLV RRLSSALGTR
FAFVTRLLDG SPARMRTLAF WTGNGYCPTM EMPLNGTPCE RVIADSDILF HPSGLGQLYP
SAETMKRMGV ESFLGVPLFD SRGTAIGHLA VLDDKPMRED EGHRSLLRIF AARAGAELER
MRWDEALRES ETRYKDLFEN ANDLIQSVSP SGEILYVNRA WRETLGYSEE EVKGLTFDQI
IDPECIGHCM GEFRRVMEGQ SLEAVEARFM AKDGRSILVE GSVNCNVVDG KPLASRGIFR
DITERKHHED TLRKYATELE QTNEELKDFA YIVSHDLRAP LVSIKGFSME LVTAMDELRG
VIAELSPNIE MMTRERLRRL FEQDIDEAVG FINSSSQRMD TLITAILNLS RLGRRELKTE
PVDMGAVVRS ILDSLAHQIE TNRTEVAIRD LPVITADRIA MEQIMGNLLD NSLKYLEPGR
PGRLEIWADV GGEECVFHVK DNGRGIREEE IPRVFELFRR AGRQDVPGEG MGLAFVKTLV
RRLGGRIWCE SEPGVGSTFS FTLPSSFSHK LPL