Gene GSU1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1101 
Symbol 
ID2688576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1185871 
End bp1187484 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content64% 
IMG OID637125770 
Productsensory box histidine kinase 
Protein accessionNP_952154 
Protein GI39996203 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGCCG GCCGTGAGAT CCGTGATATG CGGCGGGATG CGCCCGCAGT GGCAGCGGCC 
ATTTCGCGCG AGACCCGGGC ACGCGTGACG GTGATCCTTG AGGGGGGCGA GGTGGTGGCC
GACTCTGACA TCGCACCCCA ATACCTGGGA ACGCTGGAAA ACCACGCGAA CCGCCCCGAA
GTCCGCCAGG CGCTGGAACA GGGAGGGGGC GCCGCGGTCC GCTATTCAGC TACCATCAAG
ACCCCCATGC TCTACGTGGC GACCGTGCTC AACACCACAA CGGGAGAGCG CGGCGTGCTG
CGCCTGTCGC TCCCTCTCAC CGCGGTAGAA AAGGCAAAAC AGAGCATCCG GACCCTGCTG
GGGGCATCTC TTGTGGTATC GCTGGTGGTT GCGCTGCTCC TGAGCTATCT ACTCTCGCGG
CTCACATCCC GCTCGCTGCG CACCATCACA ACCCTGGCCA CCCAACTGGG CCGGGGAGAC
TACGGCCGGC GGATTCCGGT CATCACCCGC GATGAAGTCG GCGAACTCGC CCGCGTCATG
AATGAAATGG CTTCGCAGAT CGAACGGAAC CTGGCGCGGA TCTCGGCGGA GAAAAACCGG
CTCGACACCA TCCTCCGCGG AATGGGCGAG GGGCTGATGG TGAGCGATGC CGGGGGAACC
GTGACTCTCG TGAACCCGGC TTTTCTTGAG CTCTTCGGCC TCACCCAGAG CGTGGAAGGG
AGACAGATCA TCGAAATCGC CCGGTTCCCC GAGCTGCATG AAACGTTCAG GGCCGTGGTG
TCGTCCCGCA GCGAACACGT GGAGGAGATG ACGCTTCCCC TGGGGGACGA AAAGCAGGTG
CTGACCCACT GGGTGCCGCT CATGGAAGAG GACGAACTGC GCGGCGTGGT GGCGGTGTTC
CACGACATTT CGGACCTGAA ACGGCTTGAG GTTGTCCGCC GCGACTTCGT GGCCAATGTA
TCCCACGAAC TCAGGACGCC GGTGACGGTC ATCAAGGGAT ATGCCGAGGC CCTGGCCGGC
GGACTCGTGG AGGAAGACCC GGAACGGGCC GGGCGTTTCC TTGAGATTAT TTGCAGTCAC
TCGGAACGAC TCGCCGATCT GATCCGCGAT CTCCTCACCC TTTCACAGCT GGAATCGGGA
GGACTCCAGC TGGAGCTGAC CCAGATCCAC CTGGACCGTG CGGTCTCCCA CGCGGCGGGG
CTGCTGGAGC AGAAGGCCGC ACGGAAGGAG ATCGTCATCG ACATCTCGGC CCTGGCAGGC
GCTCCACCGG TACTGGCCGA CCCGGGACGG CTGGAGCAGG TCCTCATCAA CCTCATGGAC
AATGCCCTCA AGTACACGCC GCCCGGAGGA ACCATCACCC TCAGTGCCGA TGAGGCGGAC
GGGATGGTGA GGGTGTCGGT TCACGACACC GGCATCGGCA TCCCCCCCAA GGACCTGCCG
CGCATCTTTG AGAGGTTCTA CCGGGTGGAT GCCGCCCGCA GCCGCGACGA GGGGGGGACC
GGCCTCGGCC TTTCCATCGT GAAGCACATC ATCCAGCTCC ACGGCGGAAG CATCACTGTC
GAGAGCGAGC ACGGCAAAGG CACAACCTTC CGGTTCACCC TGCGCAAGAC CTGA
 
Protein sequence
MVAGREIRDM RRDAPAVAAA ISRETRARVT VILEGGEVVA DSDIAPQYLG TLENHANRPE 
VRQALEQGGG AAVRYSATIK TPMLYVATVL NTTTGERGVL RLSLPLTAVE KAKQSIRTLL
GASLVVSLVV ALLLSYLLSR LTSRSLRTIT TLATQLGRGD YGRRIPVITR DEVGELARVM
NEMASQIERN LARISAEKNR LDTILRGMGE GLMVSDAGGT VTLVNPAFLE LFGLTQSVEG
RQIIEIARFP ELHETFRAVV SSRSEHVEEM TLPLGDEKQV LTHWVPLMEE DELRGVVAVF
HDISDLKRLE VVRRDFVANV SHELRTPVTV IKGYAEALAG GLVEEDPERA GRFLEIICSH
SERLADLIRD LLTLSQLESG GLQLELTQIH LDRAVSHAAG LLEQKAARKE IVIDISALAG
APPVLADPGR LEQVLINLMD NALKYTPPGG TITLSADEAD GMVRVSVHDT GIGIPPKDLP
RIFERFYRVD AARSRDEGGT GLGLSIVKHI IQLHGGSITV ESEHGKGTTF RFTLRKT