Gene GSU0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0822 
Symbol 
ID2687230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp882167 
End bp884356 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content63% 
IMG OID637125494 
Productsensory box histidine kinase 
Protein accessionNP_951879 
Protein GI39995928 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA GGACAAAACT CATTATTCTG GTGTCGATTC TCGTAATCAT CCTGATGGTG 
GTGACCTCGA GGATCACGCT GGGCTACCTG GAGCGGCACC TCCACGACTC CATCGCCGCC
CAGCAGACCG CCACCGTCGT CCACGTGGCC ACGGACATCG ACAGCACCCT CCGCTCCATG
CTGGAGTTGC TGACGGCAAG CGCCCGGGTC GTCCCGCCCG CAGCACTGGC CGACCCCTCG
GGCGCCAAGG CGTTTCTGGA AAGCCGGACC GGCCTGCGCA CGCTGTTCAA CAATCACCTC
TTCGTCGTTG ATGCCGCCGG CAACCTCCTG GCAGAAGTGA CCAACCAGGA AGTCCGACGG
GTGGAAAACT TCTCCGGTCA CGCCTTCTTC AACAAAGCCA GGGCAACCCG CAAGCCGCTC
ATCTCCGAGC TGACCACCTG CTGCACCGGC ACCGCCAACT TGAGGGAAAT CGTCTTTATC
AGCCCCATCC TCGGGAAAAA CGGATCATTC AAGGGAGCCT TGCTGGGGGG AATTGACCTG
AGCGAGGAAA ACGCCCTGAG CCGGTTCGGC CGGATCACCG TGGGCAACAA GGGGTTCATC
CGCATCATTG ACCGCAACCA TATGGTACTG ATCCATGCGG AACGGGAGCA CACCCTGATC
AAGGCGGTTC CTGAAGTTGC GCGCCTGGCC GACGCCGCCC GGGAGGGCTA CGTGGGCACC
CGCGAAACAA AGGGGCGATA CGGCGACATC CTTCTGACAT CGGTAGCGAA GCTCGGCAGC
AAGGATTGGG TCGTGGCGGC GAGCTATCCG CTCACAGTGG CCTACGAGCC CGTGAAGGTG
GTACGCCGGC TGTTCATCAT CAGTACCGTG GTCGCCATCC TCGGTGTTCT GGTAGTGGTC
TCGCTCTCCA TGCAGTACCT GACCCGCCCC ATTCTGGCGC TGGAGCGGCA CATCAACGAA
CTGAGCGGCA AAAAAGGCAA GGAACGGCTG GTGCCGGTAT CGAACGAGGA CGAGTTGAGC
CGGCTCACCG AAACCTTCAA CACTATGCTT GCGGAGATCG ACCGGCAAAC CGAGAGCCTC
AGGGAGAGCG AGGACCGTTT CCGGGGGGCT TTCGAGCAGG CGGCGGTGGG CATGGCCATC
ATCGACCGCG AAGGGCTGCT GCTCAGGACA AACCGGCGCT TCTGCGACAT TACCGGTCGC
CATGACGAGG ATCTGGCGGG GCTCGACTGC CTTACCCTGG TGCACCCGGA GGACCGCAAC
GCCACCCGGG AGATCATGCC GACCATGGCC GCCACTGAAG GGGAGCCGTT GACGCGTGAG
CTCCGGTTCA CCCATGGCAC GGGCCGGACC GTCTGGGCAA ATACCGCGTT TTCGCCGGTC
CGGGGGAGGA GTGGCACCGA CGATTCCTTC ATAATGGTGG TGGAGGATGT CACCGAGCGC
AAGCGTGCCG AAGAGGAAAT CCTGCGGTTG AACTCCGACC TGGAACAGCG GGTGGCGGAT
CGTACCGCCG CACTGGAGAG CGCCAACCGG GAGTTGGAGG CATTCAGCTA CACTGTCTCC
CACGACCTGA AGGCCCCGGC CCGCCATATC TCCGGGATCG TCGACATAGT GCTGGAAGAT
TGGGGTGGCT GCATGGAGCC TGCGCACCGC GAGTTGATGG AACGTGTGGC CGCGGCAGCC
GGCAGGATGC AGTCCATGAT CGACGGATTG CTGGAACTGA GCCGCGTTGG CAGCGATGAA
CTGCGGCGCC AGGAGGTTCG CCCGGCCCAC CTGGCCCGGG AGATCTGCAT CGAGCTGGCC
GCGGCAGAGC CGGCGCGGCA GGTGGACTGG AGCGTAAAGG ATGTGCCGCC GGCCAATGCC
GATCCCGAAC TCCTCATGAC CGTGCTGGAA AATCTCCTGG GCAACGCCTG GAAGTATACC
TCGCGAAACG AACGGGCTGA GGTGGAATTC GGCTGCGAAT ACTGTTCGGG AAAAAACATG
TACTACGTAA AGGACAACGG CGCCGGGTTC GACATACGCG AGGCCCAGCG GCTGTTCGCC
CCCTTCCAGC GCTTCCATCC GGCATCCGAG TTCGAAGGAA ACGGCATCGG GCTCGCCACG
GTCGCCAGGA TCATCCATCG CCACGGCGGC ACCATCCGCG CCGAGTCGGC CCCCGGCCGG
GGAGCCACCT TCTATTTCTC CCTCGACTGA
 
Protein sequence
MKLRTKLIIL VSILVIILMV VTSRITLGYL ERHLHDSIAA QQTATVVHVA TDIDSTLRSM 
LELLTASARV VPPAALADPS GAKAFLESRT GLRTLFNNHL FVVDAAGNLL AEVTNQEVRR
VENFSGHAFF NKARATRKPL ISELTTCCTG TANLREIVFI SPILGKNGSF KGALLGGIDL
SEENALSRFG RITVGNKGFI RIIDRNHMVL IHAEREHTLI KAVPEVARLA DAAREGYVGT
RETKGRYGDI LLTSVAKLGS KDWVVAASYP LTVAYEPVKV VRRLFIISTV VAILGVLVVV
SLSMQYLTRP ILALERHINE LSGKKGKERL VPVSNEDELS RLTETFNTML AEIDRQTESL
RESEDRFRGA FEQAAVGMAI IDREGLLLRT NRRFCDITGR HDEDLAGLDC LTLVHPEDRN
ATREIMPTMA ATEGEPLTRE LRFTHGTGRT VWANTAFSPV RGRSGTDDSF IMVVEDVTER
KRAEEEILRL NSDLEQRVAD RTAALESANR ELEAFSYTVS HDLKAPARHI SGIVDIVLED
WGGCMEPAHR ELMERVAAAA GRMQSMIDGL LELSRVGSDE LRRQEVRPAH LAREICIELA
AAEPARQVDW SVKDVPPANA DPELLMTVLE NLLGNAWKYT SRNERAEVEF GCEYCSGKNM
YYVKDNGAGF DIREAQRLFA PFQRFHPASE FEGNGIGLAT VARIIHRHGG TIRAESAPGR
GATFYFSLD