Gene GSU0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0452 
Symbol 
ID2686308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp481730 
End bp483139 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content62% 
IMG OID637125118 
Productsensor histidine kinase 
Protein accessionNP_951511 
Protein GI39995560 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTTTCA GGTCAATCCG CTTTTCGCTC ACCCTTTGGT ATGCGGTGAC GCTCGCGGTC 
ATTTTGGTTC TGTTCAGCTC GTTCATCTAC CTGGTTCTGA GCAATCAGCT CAACAAGGGG
ATTGATCGTG AACTGCTGAC CGTGGCCGAG GCCGTTGCAA GCCCGACTCT GGAGCCTTTC
CGCCATGCCG CTCCGTCGGT CTTCGACCAG GTGCTCGAAG ACTTCATCGG CACGCGCCTG
ACGGGCAAGC ACGTCCAGGT GCTCGACGGC TCCGGCGCGG TTGCCGCCTC GTCCAAGAGC
ATGGAGGAAC TGCGCATTCC TCTGGGCAAG ACTGCGTTGC GTCGCGTTCA GGCCGGCAAG
GTTTCCTATG AAACCAGGGT CAATCTGGAC GTGTATCCGG TCCGCACCAT CGTGTACCCC
ATCATGACCG ACGGCCGGCT CGACCAGATC GTTCTGGTCG GAACGTCCAT GCGGGGGCCG
GCCGAGACCC TTGAAAAGGT CCAGCTGGTC TTTGCCGTCT CCATCCCCCT GGCTCTCATC
CTGTGGAGCC TGGGCGGATG GTTCCTGGCG GGCAGGGCGC TGAAACCGGT TGACCTGATA
ACCCGCAGCG CCCGCAAGAT TACTGCCGAA AACCTGGGAC TGCGCCTGGA GGTCATCAAT
CCCCAGGACG AGATCGGCCG GCTCGCCACT ACGTTCAACG ATACCCTGGA ACGCCTGGAA
AACGCCTTCA ACCGGATCAG GCAATTCACC GGTGACGTCT CCCACGAACT GCGGACGCCC
CTCACCATCC TGCGCGGCGA AGCCGAGGTG GGGCTCAAGT GGGCCAAGGA GCCGGAGGAG
TTTCGTGAGC TTCTGCGTAG CAATCTGGAA GAGATCAACC GGATGTCCAA GATCATCGAG
ACGCTCCTGG AACTCTCCCG GGTTGAGGGG GGAGTCAAGC TGGAGCTCGC CGATCTGGAT
CTGAGCGACC TTCTTGCCGA ACTGGTTCAG CAGTCGCGCC TCATCGCACC GGACAAGAGT
CTCCGCATCG CTTTCGTGGG GCAGGAGCCG GTTACCATCC TCGGCGACTG GCTTCGGCTG
CGCCAGGTTT TCATGAACCT GCTGGACAAT GCCGTCAAGT ATACCCCTGC CGATGGGGAG
ATCTCCGTGG TGGTTGATAC GACCGGTGAC AGCGCCCGCG TGGCGATCAT CGACAGCGGC
CCGGGTATTC CTCCCGAAGA TCTGCCGCAC ATCTTCGAGC GGTTCTACCG GGTCGACAAG
GCCCGCAACC GGGCCGACGG CGGTTGCGGC TTGGGACTCT CCCTGGTCAA GACGTTCGTG
GAGGCTCATG GCGGACGCAT TGAGGTGGTG AGCGAAGCGG GCAAGGGGAG CATCTTCACG
GTCCTGCTGC CGCGGGTCGT CGCAGGATGA
 
Protein sequence
MFFRSIRFSL TLWYAVTLAV ILVLFSSFIY LVLSNQLNKG IDRELLTVAE AVASPTLEPF 
RHAAPSVFDQ VLEDFIGTRL TGKHVQVLDG SGAVAASSKS MEELRIPLGK TALRRVQAGK
VSYETRVNLD VYPVRTIVYP IMTDGRLDQI VLVGTSMRGP AETLEKVQLV FAVSIPLALI
LWSLGGWFLA GRALKPVDLI TRSARKITAE NLGLRLEVIN PQDEIGRLAT TFNDTLERLE
NAFNRIRQFT GDVSHELRTP LTILRGEAEV GLKWAKEPEE FRELLRSNLE EINRMSKIIE
TLLELSRVEG GVKLELADLD LSDLLAELVQ QSRLIAPDKS LRIAFVGQEP VTILGDWLRL
RQVFMNLLDN AVKYTPADGE ISVVVDTTGD SARVAIIDSG PGIPPEDLPH IFERFYRVDK
ARNRADGGCG LGLSLVKTFV EAHGGRIEVV SEAGKGSIFT VLLPRVVAG