Gene GSU2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2042 
Symbol 
ID2687997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2237164 
End bp2238738 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content63% 
IMG OID637126733 
Productsensor histidine kinase 
Protein accessionNP_953091 
Protein GI39997140 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCGTC TGCGCTTCAG TCTCAGCTTC CTGATCCTCT CTTCCCTTAC GTTCCTCCTG 
GTTCTTACCT GGTTTCTCCT GAGCCTGATC TCGTTCAAGA CCGCCGAAAG CGACCTCCTT
CGCCAGAAAA GCGACAAGGC GCGTATCCTG CTCGCGTCTT TCACCGCGCT GGTTCCCCCG
TCGCTGGACG GGATAGGTTC TTCGGCGGCC GGCGTACTTG CCCGCCAGCT GGCGGGAGAA
CCGGAATTCA CGGCCCTGGC CGTTGTGCGG GGTGACGGAG CACCGGTGTT CGCTCTGGGT
GCCGGTTCAC CCGCTGACGA GCGTCTGGCG GCCTGTCTTC GCGACGGCGC CGAATCGGTC
TGGCTCCCTG CGGGGGAGGG CGTCCTCATT CGCTATGCCC CAATCATCCG TGAGGGGGCG
ACCGTTGGCG CGGCCAGGCT CGTCCTTTCC CTGGCGGGGG AGCGGCAGAT GCTCGCCCAT
TCGCGTCACC TGTTCCTCGC CTACTTCGCC CTCGATTTCC TTCTCCTTCT GGCGGTGGGG
TCACTGCTGC TTTCCCGCTT CATCATCATC CCCATGCGCA AGCTTCTGGC CGCAACGGAG
CGGATCGGCG CCGGCGACTA TCACCATAAG GCCGCGGTTC CGGGGAGCCG TGAGATAGCC
GAACTGGCCG ATTCATTCAA CCAGATGGTG GACGCCCTGC GGGTCAAGGA TGAAGAAGCG
GTGCGCCACG TGACCTCGCT GGAGCGGGCC AACCGGGAGC TCAAGGAGGC CCGCGAAGAA
ACCCTCAGAT CCGAAAAAAT GGCGTCGGTG GGAATGCTGG CCGCTGGGAC GGCCCACGAG
ATCGGCACCC CGCTGGCAGC CATCATGGGG TACGTATCAC TCTTGCGGGA TGATGAATCC
CTTGATCCGG AAAGGGCCGA CTATCTCCGT CGAATTCAGC ATGAGACCGA ACGGATCGAC
CGGATCGTTC ACGACCTGCT CGATTATGCC CGCCCCGTGC CCCTTGCCGC CGAAGAGGTT
GACGTGGTCG CCCTGGTGGC TGAAACTGTG GAGATGGTAA GCCGGCAGGG AGCCCTCAAA
CGGGTTCAGG TGCACATCGG GGCTGGCGAG GGGAGTGCTG TGGCCGTGAC CGATCCGCAT
CAGCTCCAAC AGGTTTTGAT CAACCTGCTG ATCAATGCCC GTGACGCCAT GCCCGACGGT
GGCCGCCTGG ATGTCGCCGT ATCGACCGGA GCGTTCATGC CGCCGGCAGG GCTTCCAGAC
AGCGGTCGCT GGTCTGTTAT GGCCGGCCGG CGCAAGGATG ACTTCGGCGG TGTGTTCCGC
ACACCGTTTG CCGGGGACGG GAGCGAGGTG CCGTGTATCC GGATTGCCGT GACCGACACC
GGGTCGGGAA TCGCGCCCGA GCACCTGGAG AGGATCTTCG ACCCCTTTTT TACCACCAAG
GAGCCGGGCA GGGGGACCGG ACTGGGGCTA TCCATCTCGG CCAGGATAGT TGATTCTCTT
GGCGGGAGAA TGACGGTCGA AAGTGCGGTG GGCGAAGGAT CGCGGTTCGA GATATGGCTT
CCAGTCTCGA AATGA
 
Protein sequence
MRRLRFSLSF LILSSLTFLL VLTWFLLSLI SFKTAESDLL RQKSDKARIL LASFTALVPP 
SLDGIGSSAA GVLARQLAGE PEFTALAVVR GDGAPVFALG AGSPADERLA ACLRDGAESV
WLPAGEGVLI RYAPIIREGA TVGAARLVLS LAGERQMLAH SRHLFLAYFA LDFLLLLAVG
SLLLSRFIII PMRKLLAATE RIGAGDYHHK AAVPGSREIA ELADSFNQMV DALRVKDEEA
VRHVTSLERA NRELKEAREE TLRSEKMASV GMLAAGTAHE IGTPLAAIMG YVSLLRDDES
LDPERADYLR RIQHETERID RIVHDLLDYA RPVPLAAEEV DVVALVAETV EMVSRQGALK
RVQVHIGAGE GSAVAVTDPH QLQQVLINLL INARDAMPDG GRLDVAVSTG AFMPPAGLPD
SGRWSVMAGR RKDDFGGVFR TPFAGDGSEV PCIRIAVTDT GSGIAPEHLE RIFDPFFTTK
EPGRGTGLGL SISARIVDSL GGRMTVESAV GEGSRFEIWL PVSK