Gene GSU1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1038 
Symbol 
ID2688721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1124101 
End bp1125642 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content63% 
IMG OID637125707 
Productsensory box histidine kinase/response regulator 
Protein accessionNP_952091 
Protein GI39996140 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC ACGTGCGGAT GAAACATGTT ATGGATGAAT GTGCTCTGGA AAATGCACCT 
CCTCTGATCC TCCTGGTGGA CGATGATGCG GTGACCCGCA AGATGCTGCG CAACCTCTTC
ACGCTTTCGG GATACCGCGT GGCCGACGCG GAAGACGGCG CCCGGGCGGT TGAGATGTTT
CGCGAACTTT CACCGGATCT GGTCCTTCTC GATATCATGA TGCCGGTCAT GGACGGGTAT
GGCGCCTGTG CGGCGATCCG CGGCCTGCCG GGCGGCGAGC ATGTCCCGAT CATCATCATG
ACTGCCCTGG ACGATGCTAA CTCAATCGGC CGTGCCTTTG ACGCCGGGGC CACCGATTTC
ATCGAGAAGC CCGTGAACTG GATGCTTCTC AACCACCGCC TTCCCTATCT GCTGCGGGCG
CGGGACGCCT TTGTCTCCCT CCAGCGCAGC GAGGCTACCA GCCGCCTGCT TTCGGGCGAG
CTGATGGCGC TCCTGAACTC AATCAACGAC AGCCTTGTCC TGTTCTCCAC CGATCTGAAA
CTGCTCTGGG CCAACCGGAG CGCCGAGCAT CTCTACGGCG ATTACGTCGA GCAGCTCGTT
GGTCAGGAGT TCACTTCCCT CAAGGGGGCG CGGGGCATCC CGTCCGATGC CATTGCCGTA
CGCGCGTCGC TCGGCTCGGG CGAGCCCTGC TACGAGCGCG TATCAGCCGC CGACGGCAGG
ATCTGGGACA TGAAGTATTT CCCCGTGCGC GGCGAAGACG GCACCGTGCG CGGCATCATC
GAGCTCGCCT CGGACATGAC GGAGGTGGTC TCACTCCAGG CGGAGGCCCT CCGGTCGGGA
CAACTGGCAG CCCTGGGCGA ACTGGCAGCC GGCGTGGCCC ACGAGATCAA CAACCCCATC
AACGGCATCA TCAACTATGC GCAACTGCTG GTGAACTGGC TGCCGTCCGC CTGCAAGGAG
CGCGACATCG CCGAACGGAT CATCCGGGAG GGAGATCGGG TGGCGGGGAT CGTCCGGGGG
CTTCTCTTCT TTGCCCGGGA GGGCATGGGG GCGCGCCTGC CCTGCAATGT CGCCGACGTC
CTGACCGACA CGCTCACCCT CACCGAGGCC CAGATCCGCA AGGACGGCAT TACTCTCAAG
GTCGGGGTGC CGGCTGACCT GGGTCGCGTG AGAGCCAGCC ACCAGCAGCT CCAGCAGGTC
TTCCTCAATA TTATCAGCAA CGCCCGCTAT GCGCTCAACG AGAAGTTCCG CGGTTTCCAT
TCCGCCAAGA TCCTGGAGAT CCGGGGCGAG CGCGTGCTCA TCGATGACCG GCCGTACATC
CGGATCGGCT TCAACGATAC CGGTACCGGT ATCCCGGAAG CCATCAAGGA CAAGGTGATG
ACGCCGTTTT TTTCGACCAA GCCTACCTGC AAGGGGACAG GCCTGGGGCT CAGCATCAGC
CAGAACATCA TCCGCGACCA TGACGGCAAC CTCTCCATCG AGAGCCGCGA GGGGGAATTC
ACCCTCGTGA GCATCGACCT TCCCGCCGAG GAGACGCCAT GA
 
Protein sequence
MTTHVRMKHV MDECALENAP PLILLVDDDA VTRKMLRNLF TLSGYRVADA EDGARAVEMF 
RELSPDLVLL DIMMPVMDGY GACAAIRGLP GGEHVPIIIM TALDDANSIG RAFDAGATDF
IEKPVNWMLL NHRLPYLLRA RDAFVSLQRS EATSRLLSGE LMALLNSIND SLVLFSTDLK
LLWANRSAEH LYGDYVEQLV GQEFTSLKGA RGIPSDAIAV RASLGSGEPC YERVSAADGR
IWDMKYFPVR GEDGTVRGII ELASDMTEVV SLQAEALRSG QLAALGELAA GVAHEINNPI
NGIINYAQLL VNWLPSACKE RDIAERIIRE GDRVAGIVRG LLFFAREGMG ARLPCNVADV
LTDTLTLTEA QIRKDGITLK VGVPADLGRV RASHQQLQQV FLNIISNARY ALNEKFRGFH
SAKILEIRGE RVLIDDRPYI RIGFNDTGTG IPEAIKDKVM TPFFSTKPTC KGTGLGLSIS
QNIIRDHDGN LSIESREGEF TLVSIDLPAE ETP