Gene GSU1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1004 
Symbol 
ID2686090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1084304 
End bp1085395 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content63% 
IMG OID637125674 
Productsensory box histidine kinase 
Protein accessionNP_952058 
Protein GI39996107 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG AAGAAAAGAA AGAGGAATTC CTTGCCACCG TGATCGACAG CGTCGGAGAC 
GGCGTCATCG TCATCGACCT GGACGGACGG ATCGCCCTGA TGAACCCCGC TGCGGAGGAG
ATCTCCGGCA TTTCGCGGCG GCAGGCCGTG GGACATCGCT TCGCCCTCGT CTTCCACCGG
GAGGCGGTGC TGCGGGAGAT GGTCGGCAAG ACCGCCACCA GCGGCATGAC CATCTCCGAC
CACGAGAACA TCGTCATCCG GAAGCTGAAG CAGCTGACGC CGGTTTCGGC GACCACCTTC
CCACTTATGC TTCCTCACGG CGAAACCACC GGGACGATCC TGGTGCTGCG CGACATCACC
AGCATCCGGG AGCTGGAAGA TGCCGTCCGC AATGCCGACC GGCTTTCCAC CCTGGGAACC
CTGGCCGCGG GATTGGCCCA CGAGATCAAG AACCCCCTGG GGGGCATCAA GGGAGCGGCC
CAGCTCCTGG AGCTGGAACT GCCCACCGAA AGCGAGTTGC GCGACAACGT CCGGATCATG
CTCAAGGAGG TGGAGCGGGT CAACCGGATT GTTGAGGAGC TTCTGGCCTT GGCCTCGCCC
CGGGGACTGC AACTGAGCAA GGTGAACCTC CATAAGGTCA TCGGCGACAT CCTCACGCTC
CAGAAGCGCT CGACCGAAGG GAAGAACGTC GCCTTCCAGC AGCAGTTCGA TCCCAGCATC
CCGCCCATCC TTGCCGACGA GGGGCTGTTG ACCCAGCTTT TTCTGAACCT CGTGAAAAAC
GCGATGGAGG CGGTGGATGA CGGCGGCTGC ATCCGGGTCG CCAGCCGGGT GATATCCGAC
TACTCAATGA CCCAGAAGGG CGAGCGACGC TCGCGGATGG TGGCCATCGA CGTGGCCGAC
GATGGACCTG GCATCCCGCC GGAGCGGCTC GAGCAGCTCT TCACGCCCTT TTTCACCACC
AAGACCAAGG GGACGGGCCT GGGGCTGGCC ATCTGCCAGA AAATCGTGAC GGAGCATCGG
GGAATGATCA AGGTGGAATC GTACCCCGGC AAAGGGACCA CCTTCACGGT GATGCTCCCC
CTGATTCAGT AG
 
Protein sequence
MTTEEKKEEF LATVIDSVGD GVIVIDLDGR IALMNPAAEE ISGISRRQAV GHRFALVFHR 
EAVLREMVGK TATSGMTISD HENIVIRKLK QLTPVSATTF PLMLPHGETT GTILVLRDIT
SIRELEDAVR NADRLSTLGT LAAGLAHEIK NPLGGIKGAA QLLELELPTE SELRDNVRIM
LKEVERVNRI VEELLALASP RGLQLSKVNL HKVIGDILTL QKRSTEGKNV AFQQQFDPSI
PPILADEGLL TQLFLNLVKN AMEAVDDGGC IRVASRVISD YSMTQKGERR SRMVAIDVAD
DGPGIPPERL EQLFTPFFTT KTKGTGLGLA ICQKIVTEHR GMIKVESYPG KGTTFTVMLP
LIQ