Gene GSU3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3357 
Symbol 
ID2686425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3687303 
End bp3689687 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content63% 
IMG OID637128051 
Productsensory box histidine kinase 
Protein accessionNP_954397 
Protein GI39998446 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCT TGGCCGCCAG CCACTTCCCC TGGTTGCTTT TGCTTCGGAC ACGCCTCGAA 
CGGTCGTGGG CGCATCTGTG GCGCTCTTTC GCATCGATCC TGGCGGTCCT GACACTTCTC
CTTGCCCTCT CGCCCCAGAC GGCCCATGCC TCCGCTCCCG GCTCCCTCCT CGGCAAGATC
CTTGTGCTGC ACTCGTACCA TCCTGGCTTC ACCTGGACCC GCGAAGTGAC GGCCGGCATC
ATGGCTGCCT TCAGGGAGGC CGACCCTGAT GCGGACCCGT TTGTGGAATA CCTTGACGCC
AAGCGTTACG GTGGGCCAAT TCACGAGCGG CTCCTGGCGG ACCTCTTCCG CCACAAGTTT
GCCGGCGCCG GCCTCCAGGT GGTCGTCACC ACCGACAACG CCGCCTTCGA CTTTGCCGTG
AAACACCGTG CCGGGATCTT TCCCCGAGCG GCCATCGTGT TCTGCGGCCT GAACGGATAT
TCCGACAGCA CCCTGGCCGG CATGTCCAAC ATCGCCGGCG TCGTGGAGGA AGCCGATCCC
CTCAGGACCA TCTCTCTCGC CCTCTCCCTC CACCCCGACC GACGCCGGGT GGTGGTCATC
AGCGACACGA CCGAAACCGG CCGGGCCATT GCCGACAGTG TCCGGCAGGC CAAGGGCAGG
TACCGCGACC GGGCAGACAT CGCAATCATC CAGGATGTGA CCATGACGGA ACTTGCCACG
GCCGTTGAAC GCCTGGGCAA TAACTGCCTG ATCATCCTTG GGGCCTTCAA CCGGGATCGC
CAGGGCCGAT CCTTCAGCTA TGAGGAGGTC CTGAGGTTTG TGCATGACCG GACCAAGGTC
CCCATCTACG GTCTCTGGAC CTTCCAGCTG GGCAAGGGCA TCGTCGGCGG CAGCCTGCTG
TCCGGCCAAC AGCAGGGAGA GGCCGCCGGA CGGATCGCGT TGCGCATCGT CAAGGGTGAA
CTGCCCGAAT CCATCGGGAT AAGTCACGCG GTCCCGCCGG TTCTCCGCTT CGACTACAGC
GAGTTGAAAC GCTTCGGCAT CAGCCACGTC AAGTTGCCCG CCGGGAGCGA AATCGTCAAC
GAGCCTAACC GCCCCCTGCA TACCTATCGC CGCGAACTGG CCATCGCCGC CTCAGCCTTC
CTGGTGCTGT GCGGGATCAT TGCTCTGCTC GTTATCATGC TGCGCCAGCG TACCCTCCTG
GAGCGCAGAC TCCGCACCTC CGAGCAGGAG TACCGCCAGC TGGTGCAGAA CGCCAACAGC
ATCATCCTGC GTTTCGATAC GAGCGGGAAC ATAACCTACC TGAACGAGTT CGCCGAGAGC
TTTTTCGGGT ACCCGGCCAT AGAGCTGGTG GGTAAAAGCG TGGTGGGGAC CATCACCCCG
GAACGGGAAT CTACGGGCAG GGATCTGGCA GCCATGATCG GGGAAATCTG CGAGAACCCC
GACAGCTACA CCCACAACGT CAACGAGAAT GTCATGCGCA ACGGTGAACG CGTATGGATC
AGCTGGGCCA ACAAGCCCCT GGCGGACGAA CGGGGCAACC TTCGGGAGAT TCTGAGCATC
GGCAACGACA TCACCCATCT GAAGGAAGCA CAGGAGGAAA TCCTCAGGCT CAACGTCGCC
CTGGAGGAGC GGGTCCGGGA GCGGACCGCC CAGCTGGAGA GTTCCAACCG GGAGTTAGAG
TCGTTCTGCT ACTCGGTTTC CCATGACCTG AGGGCTCCCC TGCGGCACAT CAACAGCTAC
AGCAGGATTC TTCTGGATGA GTACCTGCCG CTGCTGGACG AGCAGGCGCA GCATTTCCTG
CAACGCCTTC AGGCGGCAAG CCACCGTATG GGACAGCTCA TCGACGATCT GCTGGAGCTG
TCGCGAGTGT CCCGTGCGGA GATGGAGTGT GAACGGGTCG ATCTGACGGA GCTGTCACGA
GGTGTGATCG AGGACCTGCG CGAGGGTGAT GACGACCGGA ATGTCGAGGT GCGGATCACG
GAGGGAATGA CCGCTTGGGG CGACCGACGG CTGCTCCTGC TCGTGCTTCA GAATCTCATC
GGCAATGCCT GGAAGTACAG CTCGAAGCGG GAATATGCGG TTATTGAGGT TGATGTGGTG
CGTCGGAACG GCCGTGAGGT GTTCCTGGTG CGCGACAATG GCGTGGGGTT CGACATGGCC
TTTGCCGATA AGCTCTTCGG AGCCTTCCAG CGTCTCCATT CGCCGGCGGA ATTCGAGGGG
ACCGGCATCG GGCTTGCTAC CGTGCAGCGG ATCGTCCATC GACACGGGGG GGACATCTGG
GCCGAGGCGG AACCGGATCG GGGGGCGACG TTCTTTTTTA CCCTCCCGCC CGAGCAGTGC
TCTCCGGCAG AGATGCGGAC AGCCTCTCCG CCCGAGGGGA CGTGA
 
Protein sequence
MNPLAASHFP WLLLLRTRLE RSWAHLWRSF ASILAVLTLL LALSPQTAHA SAPGSLLGKI 
LVLHSYHPGF TWTREVTAGI MAAFREADPD ADPFVEYLDA KRYGGPIHER LLADLFRHKF
AGAGLQVVVT TDNAAFDFAV KHRAGIFPRA AIVFCGLNGY SDSTLAGMSN IAGVVEEADP
LRTISLALSL HPDRRRVVVI SDTTETGRAI ADSVRQAKGR YRDRADIAII QDVTMTELAT
AVERLGNNCL IILGAFNRDR QGRSFSYEEV LRFVHDRTKV PIYGLWTFQL GKGIVGGSLL
SGQQQGEAAG RIALRIVKGE LPESIGISHA VPPVLRFDYS ELKRFGISHV KLPAGSEIVN
EPNRPLHTYR RELAIAASAF LVLCGIIALL VIMLRQRTLL ERRLRTSEQE YRQLVQNANS
IILRFDTSGN ITYLNEFAES FFGYPAIELV GKSVVGTITP ERESTGRDLA AMIGEICENP
DSYTHNVNEN VMRNGERVWI SWANKPLADE RGNLREILSI GNDITHLKEA QEEILRLNVA
LEERVRERTA QLESSNRELE SFCYSVSHDL RAPLRHINSY SRILLDEYLP LLDEQAQHFL
QRLQAASHRM GQLIDDLLEL SRVSRAEMEC ERVDLTELSR GVIEDLREGD DDRNVEVRIT
EGMTAWGDRR LLLLVLQNLI GNAWKYSSKR EYAVIEVDVV RRNGREVFLV RDNGVGFDMA
FADKLFGAFQ RLHSPAEFEG TGIGLATVQR IVHRHGGDIW AEAEPDRGAT FFFTLPPEQC
SPAEMRTASP PEGT