Gene GSU1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1050 
Symbol 
ID2688693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1137432 
End bp1138790 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content63% 
IMG OID637125719 
Productsensory box histidine kinase 
Protein accessionNP_952103 
Protein GI39996152 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACAA TGAGTGACCC GCCTAAAAAC GGTATTGAAT CGATCATCGT CACGACCCTC 
GTGGCGCTTA CGGCCTTCTG GTTCGCCGAC ACGGTGATCG ATGCCGTCAT TCCCCAGATG
GGATGTGAAT GCGGCCTCCT TCACGCTCCG ACCGTGCGTG CCTCTCTGTT CCACCTGATC
CCCCTCATTG CGCAACTGCT GCTCATTTTC TTCGTCCGCA GACTGTTCAG GGAGCGCCGG
CTGCTGGTCC GGAAGCTGGA AGCGGCGGTG GCCACTACCC TGGATGAAAA GGCCAGGACC
GATGCCGTCA TTGCCGCGGT CGGCGACGGA GTCTGCATGC TCGACCGGGG TTTCCGCATC
GTCTACCAGA ACCGGGCCCA TGAGCGCCTG CTCGGCGAGC ACGGTGGAGA ACGCTGCTCC
GACGCATACG GCGAAGACCC GGATGCCTGC CGTGACTGTC CCATGGCCCG CGCCATGGAT
ACGGGCGAGG TATGCACGGG AAGCCGCCGG TTCATCAGCA AGGGGGAGAC GCGTCTTCTG
GAGATCACCT CCTCGCCGGT GCGCAATGCC GCCGGTGAAA TCGTCGCCGG CGTCGAGGTG
GTTCGCGATG TGACCGAGCG CAGGCGCAGC GAGGAGGAAA TCAAGTCCCT CAATGCCGCC
TTGGAGCGCC GCGCCAGGGA TCTGGCAGCC AATAACCGGG AGTTGGAGGC GTTCAGCCAT
TCTCTCTCCC ACGATCTGAG CGCCCCGCTC GCCAAGATAT CCTGCGCTGT TGAGACGCTG
CGGGAAACCT ACGGCGAGCA AATGGGCGAC GACGGTCGGT TCCTGTTGTC GTGTATCTGC
GAAGGGAGTT CCCAGATGGA TGACCTCATG GAAGCCCTCC TTGCCTTGAA CCAGGTTTCG
CGAAAAGACC TCCGCCGTGA AAAAGTCGAC ATGGGAGCAC TAGTCAGCCA GCTCGCTCTG
GACCTGCGCC GGTCGGAGCC GGCCCGCCGG ATCGATTTTG TCATTGCCCC CGATCTGACG
GCCGAGGGGG ACCCCTCAAT GCTCCGTGTG GCCTTGCAGA ACCTGCTGTC CAATGCCTGG
AAATTCACCC GGAACGTGGA TGGCGGCCGG ATAGAATTCG GCTCCGTTGA CCGCAACGGC
GAACGAGTCT TTTATCTTCG CGACAATGGC GCCGGTTTTG ACATGAGGCA AGTGGGACGG
ATATTCGAGG TGTTCCAGCG GCTCCACGAC GAGAGCCTGT TCCCGGGCAC CGGAGTGGGG
CTCGCCACCG TACAACGGGT CATCGAGCGG CACGGCGGCA CCGTTACCGC CCACGGCGTG
CCGGGCCATG GGGCAACCTT CACCTTCACC CTGCCGTAG
 
Protein sequence
MKTMSDPPKN GIESIIVTTL VALTAFWFAD TVIDAVIPQM GCECGLLHAP TVRASLFHLI 
PLIAQLLLIF FVRRLFRERR LLVRKLEAAV ATTLDEKART DAVIAAVGDG VCMLDRGFRI
VYQNRAHERL LGEHGGERCS DAYGEDPDAC RDCPMARAMD TGEVCTGSRR FISKGETRLL
EITSSPVRNA AGEIVAGVEV VRDVTERRRS EEEIKSLNAA LERRARDLAA NNRELEAFSH
SLSHDLSAPL AKISCAVETL RETYGEQMGD DGRFLLSCIC EGSSQMDDLM EALLALNQVS
RKDLRREKVD MGALVSQLAL DLRRSEPARR IDFVIAPDLT AEGDPSMLRV ALQNLLSNAW
KFTRNVDGGR IEFGSVDRNG ERVFYLRDNG AGFDMRQVGR IFEVFQRLHD ESLFPGTGVG
LATVQRVIER HGGTVTAHGV PGHGATFTFT LP