Gene GSU2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2297 
Symbol 
ID2687190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2514738 
End bp2517026 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content59% 
IMG OID637126990 
Productsensory box histidine kinase 
Protein accessionNP_953346 
Protein GI39997395 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00806153 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCA AACAAAGAAT GACACTCGCC ATTTCGGCGT TGATGGTTTT GTTCGCCGTG 
TTCATGGCGG TCGTTGCGGT CGGCTATTTC GAGCGCAAAC TCAACGAGAC CATTATCGAT
CAGCAAAGTT CGCTGGTCAG CTACATTGCC GGTGAAATCG ACAATACCCT CGGCATTGCC
CAGGGTCTTC TCGTTGCCAG CGCCACCTTC GTTCCCGCCG ACGCCCTTGA TTCCCCGGAG
CGAACCCGGC GATTCCTCCA GAGCAGGACA AGCCTCAACA AACTGTTCGG CAACCGGTTG
CTGGTCATCT CCGCCGACGG ACGGCTGGTC GCCACATCCC ATAACGTTGC CGGAAGCAAC
CCTCCGTCCA ATTTTACCGA CACCCCCTTT TTTCGCGAAA CTCTCAGGAA ACGAATTCCA
CTCATCTCCG AGATGACGAC CTGCTGTATC TCGCCCCATT CTTCGGAAAT CGTCATGACC
GCGCCGATCT TTGGCCCGGG CAACCAGGTA CGGGGAGTTC TCGCCGGGAC CATCGCCCTG
CGTGAGACGA ACCTGCTGTC GCGTTTCGGC ACCCTTCGCA TCGGCAAAAC AGGCTATCTT
CAGCTGGCAA CGGCTAACCA GACATTGATA GTGCACCCTG AGCGCGAGTG TATCCTGCAT
CAGGTCGGAC CTCTACACGG CAATCTCGTG ACCGACGCCG TCAAGGGGTT TTCCGGCACC
CGGCGTACGG TGAACAACAA GGGAGTGCCC ATGTTCACCA CGGCGCGCAA GCTGGCGTTG
AAAGATTGGG TAGTGATGGC CAACTACCCG GAAAAAGAGG TGTTGGCGCC CATACGTGCC
GCCAGGGGTG TTATGGGGGT CTTTACGGTA GCGGGGGTTG GCGCGCTGAT CATTCTCACC
TCCCTATCGA TCGGCTATCT CACCCGTCAA CTGACAGCAT TCACCGAGCA TATCCGCGGG
CTTTCCGAAA AGGAGGGCGC CGACCGCCGG TTCCCCATGC CCGATACCGG TGACGAGATC
TCCGCACTGT CGACCGCATT CAATGAGATG GTGACGTCGC TTGACCAGCG GACCGAGAGC
CTGTGCGAGA GCGAAGAGCG GTTTCGCAGT ACGTTCGAGC AGGCTGCGGT CGGTATTGCC
CATGTGGATA TGGATGGACG GTTTATCAGG CTCAATAGTC GCTTCAGCGA CATCCTCGGC
TATGCCGAGC ACGAATTGCT CGGTATGACC GTTGCCGATA TTACCTGTCC GGAATATCTG
GAGAGCTGCC AATCCTGCCG GCAGCAACTT CTTGAAGGAA AGCGTTCGTG TGCCGCAGAA
AACCGCTATC TCCGCAAAGG GGGAGCACAC GTCTGGGTCA ACCAGACGGC ATCGGTACTG
CTGGATTCAT CGGACAAGCC AAAGTATTTC ATCATGGTCA TCGAGGATAT TTCGGCCCGG
AAAGGGGCCG AGGAAGAGGT CCGCCGGCTC AATGCGGGCC TTGAGCAGCG GGTCGCCGAG
CGGACCGCTG AACTGGAGTC GGTCAACATC CGTCTCATGG CGGAAATTGA GCAGCGGGCC
CAGGCACAGG AGGAGATCGG TTGGCTGAAC GAGGATCTCA TGCGGCAGAA GGCGGCCCTG
GAAGCCGCGA ATCGAGAACT GGAGGCGTTC AGCTACTCGG TTTCCCATGA TCTGCGAGCA
CCCCTGCGGC ATATTGCCGG CTATGGCCTC GCCCTTCGGG AAGACTACGG CGGGCAGCTC
GATCCGCAGG CTCTCGAGTT CATCGAGCGG ATGAACGCAG CCGCTCGACG GATGGAGCAG
CTGATCGAAG CTATGCTCAA CCTGTCGCGG CTGGGGCAGG GTGACGTCAA GCGTATGCCT
ATCGACATGA GCGCCATGGC ACGCGAGGTC GTGGCCGGCT TGCGCGAGGC CGAACCGAAC
AGGAATGTAG TGATTACCAT TGCCGATGGG CTCACTGCCA CAGGGGATCC CCAGCTTATG
CGCATTGTTC TGGAAAATTT ACTGGGAAAT GCGTGGAAAT ATACGGGCAG ATCGCCGCTG
GCCGAAATTG AGTTCAGTGC TCTGACCGAC GGGCAGCCGG CGTTTTGCGT GAGGGATAAC
GGAGCGGGAT TCGACATGGC CTTTGCCGAT AGGCTGTTCG GACCGTTCCA GCGGTTACAC
CGTGATGACG AATTCGCCGG AACCGGCATC GGCCTTGCCA CGGTCCAGCG CATTATCAAC
CGGCACCGGG GAAGCATCTG GGCCGAGAGT GCACCCGGCG CCGGTGCTGT GTTCTACTTC
TCAATCTGA
 
Protein sequence
MNLKQRMTLA ISALMVLFAV FMAVVAVGYF ERKLNETIID QQSSLVSYIA GEIDNTLGIA 
QGLLVASATF VPADALDSPE RTRRFLQSRT SLNKLFGNRL LVISADGRLV ATSHNVAGSN
PPSNFTDTPF FRETLRKRIP LISEMTTCCI SPHSSEIVMT APIFGPGNQV RGVLAGTIAL
RETNLLSRFG TLRIGKTGYL QLATANQTLI VHPERECILH QVGPLHGNLV TDAVKGFSGT
RRTVNNKGVP MFTTARKLAL KDWVVMANYP EKEVLAPIRA ARGVMGVFTV AGVGALIILT
SLSIGYLTRQ LTAFTEHIRG LSEKEGADRR FPMPDTGDEI SALSTAFNEM VTSLDQRTES
LCESEERFRS TFEQAAVGIA HVDMDGRFIR LNSRFSDILG YAEHELLGMT VADITCPEYL
ESCQSCRQQL LEGKRSCAAE NRYLRKGGAH VWVNQTASVL LDSSDKPKYF IMVIEDISAR
KGAEEEVRRL NAGLEQRVAE RTAELESVNI RLMAEIEQRA QAQEEIGWLN EDLMRQKAAL
EAANRELEAF SYSVSHDLRA PLRHIAGYGL ALREDYGGQL DPQALEFIER MNAAARRMEQ
LIEAMLNLSR LGQGDVKRMP IDMSAMAREV VAGLREAEPN RNVVITIADG LTATGDPQLM
RIVLENLLGN AWKYTGRSPL AEIEFSALTD GQPAFCVRDN GAGFDMAFAD RLFGPFQRLH
RDDEFAGTGI GLATVQRIIN RHRGSIWAES APGAGAVFYF SI