Gene GSU0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0103 
Symbol 
ID2688164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp118723 
End bp120834 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content58% 
IMG OID637124770 
Productsensory box histidine kinase 
Protein accessionNP_951165 
Protein GI39995214 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAATG CACTGACGCT TCAACCCCTC GAACGAATTG TCACCGAACA CGAAGGATGG 
CTTGCGGCTC GCGCGCTCGG CTACGCCAAA AGCTGCGGCT ACACGCGCTA TACGTCCACG
CTTGAAGAAG CCTGGCGGAT TTCCATCGCT GGCCTTTCCG CGTCGCTCAT CCATGCTCTC
CGGGCAAAAG AGCCGATTCC CGAGTTGTCG CCGGATTACG ATTCCGTTGC AGACCCCGTT
ACGCTGTTCG GCATTGAGGA AGCCCGACGT CACCGCATGC GCGGTGTGAC GCTCGGTATG
TTCCTCGGCC TCATGAAATA CTACCGCCGC AGCTACCAGG ATCTGGTGCG CGGCAGTGGG
GTGGATGAGA AAACCCTTGA ATCCGGCCGG CTCTTCATCG ACCGTTTTTT TGACCGTGTT
GAAATTGGGT TCACCACTGA ATGGTCATCG GTTTCCGATC CCCAGCGGCT TGCCGAACTG
GAGGAGGCCA ACCGGTCCAT CACCAACGAG AAGAACAGGC TCCTCACACT GTTTGAAAGC
ATCCCTTCTC CTGTTATGCT TCTGGGTGCC GACGGCCGGC CAGTACTCAT GAACCATGCT
GCAGTCGAGT TATTCGCCGG TCCGACGCCG CCCGGCACGC ATTACTATCG TGAGAGTGGA
AGCCCAGGGC TGGAGGTGGC CGTCCCCCGC TGGAGCCGGC TGAAGCGTTT TCTGAAGGGG
AAGGCGCGGG AGGCGCGTTT CGAGGAGAGC ATCGACACAG CTGCCGGGTG CCGGTATTTT
CTGGTTACCG TGCAGCGGAT GCTCGACGTC AGTGAAAAAT TTTGCGGAAC GACTATCATC
CTTTCCGACC TCACCGGCCG CAAACTCGCC GAAGATGAAC TCCAGCACCT TGTGGCAGAG
ATGGAAGATC GGATCAGGTC TCGAACCGCT GAATTGCTGG ATGCGAACAC GCGTCTGCTG
GAAGAGATTG AGGAGCGCCG CAGGTCTGAG CGCTTTTTCC ACGCCACGAT TGATTCTCTG
ACAGCTCCAG TTGCCATTTT GAACGGCGAG GGGGACATCG TCACCGTCAA CCGGGCTTGG
CGAAAATTCG CCGATGAAAA CGGGGGGCTT CACCCCAATC ACTTCATTGG GTACAATTAT
CTCGGATTGT GCGATAACGT GAAGGGTGAC GATGCCGTCA CCGCTTCCGC TGTGGCGCGG
GGTATTCGAG AGTTGATAGC CGGCCGCATC AATGAGTTCC ATGTGGATTA TCCATGCCAC
TCCCCCGATG AGCAACGGTG GTTCCAGGTC CGTGTCACGA GGCTTGAGAA CGGCAGCCGT
GGCCGGGTCG CCGTGGTCCA TGAAAACATC TCCGAGGTGA AGAAGGCCCA GGAGGAAATT
CTGTCCCTGA ACCGTAGCCT CGAGGAGCGA ATCAGGCACC GGACCCAGCA GCTCGAATGC
TCTAACAGGG AGCTCGAAGG ATTCTGCTAC GCCGTATCCC ATGATCTTCG CTCCCATCTG
GCGAGGCTTG AGGGCCTTGG CCGGGGACTG CTCGAAGACT GTTTCGCAGG TCTCGACAAC
CAGGCCCGTC ACTATGCGGA GCGAATTTGC CGCATCAGTA TGGACCTGCG CAGGGCCATT
GACTCCTTGC TGGACCTCAG TCGGCTCGCT CGGTGCGAAC TTTCTAACGT AACCGTGGAT
TTGAGTGGGG TGGCGGAGCG GGTCGCTGAA GAACTGCACC AGTTGCAGCC GTTAAGACGG
GTGAGTTTTT CCCTTGAGCC CGGCGTAATC GTGCGAGGTG ACCCGCAGCT CCTGGAAACG
GTCATGCGGC ACTTGATGGG TAATGCATGG AAATTCACCA GCCGCCGCGA GGACGCAGAG
ATAGCCTTCG GCTCGACCAT GCTCCACGGT ATGAAGACCT GCTTTGTGCG TGACAACGGC
GTCGGTTTCG ACATGAGGTA TGCCGGCACT CTGTTCCAGC CATTCCAGCG GCTTCATGGC
CCTGCCGAGT TCGACGGCGC CGGGATTGGA CTGGCAACTG TGCAGAGGAT CATTCAGCGT
CACGGCGGGC GCATCTGGGC CGAAGGAGAG AGTGGAAAAG GTGCCGTATT TTTCTTCGTT
TTTCCCGATT AA
 
Protein sequence
MDNALTLQPL ERIVTEHEGW LAARALGYAK SCGYTRYTST LEEAWRISIA GLSASLIHAL 
RAKEPIPELS PDYDSVADPV TLFGIEEARR HRMRGVTLGM FLGLMKYYRR SYQDLVRGSG
VDEKTLESGR LFIDRFFDRV EIGFTTEWSS VSDPQRLAEL EEANRSITNE KNRLLTLFES
IPSPVMLLGA DGRPVLMNHA AVELFAGPTP PGTHYYRESG SPGLEVAVPR WSRLKRFLKG
KAREARFEES IDTAAGCRYF LVTVQRMLDV SEKFCGTTII LSDLTGRKLA EDELQHLVAE
MEDRIRSRTA ELLDANTRLL EEIEERRRSE RFFHATIDSL TAPVAILNGE GDIVTVNRAW
RKFADENGGL HPNHFIGYNY LGLCDNVKGD DAVTASAVAR GIRELIAGRI NEFHVDYPCH
SPDEQRWFQV RVTRLENGSR GRVAVVHENI SEVKKAQEEI LSLNRSLEER IRHRTQQLEC
SNRELEGFCY AVSHDLRSHL ARLEGLGRGL LEDCFAGLDN QARHYAERIC RISMDLRRAI
DSLLDLSRLA RCELSNVTVD LSGVAERVAE ELHQLQPLRR VSFSLEPGVI VRGDPQLLET
VMRHLMGNAW KFTSRREDAE IAFGSTMLHG MKTCFVRDNG VGFDMRYAGT LFQPFQRLHG
PAEFDGAGIG LATVQRIIQR HGGRIWAEGE SGKGAVFFFV FPD