Gene GSU1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1249 
Symbol 
ID2688158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1350455 
End bp1352029 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content59% 
IMG OID637125923 
Productsensory box histidine kinase 
Protein accessionNP_952302 
Protein GI39996351 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0948878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAATC CCGAGCGGGC CGCACGGCGC GCGGCGCGAA AAATACGTTT CTACGAGTCA 
CCGGTCAGGA TCACCATCGC CCTGGGCGGC ATTATCTTCG TGGCCGAAGC CGTTGTTATG
GTCGTGATGG TGCATGTCCT GCGTTTCCCG GGCAGGGCAC CCGTTGTCGA CCTGTTTACC
GATTCGCTGA TGCTGCTTCT GCTCATCGCC CCGTTTCTCC ACCGATTCAT CTTCAAACCG
ATGGCCGCCC AGATCGCCCA GCGAGAAGAG GCCGAGACCC GCCTAAGGAG GTATCAGTCC
AACCTGGAAT CGCTGGTGGA GGACCGGACT GCCCGACTGA CCGAGGCAAT CGAGGAGCTG
GAGCGTGAAA TCGTCGAGCG GAAACGGACC GAAGAGGCGC TTCTCAGGAG CGAGGAGCGT
TTCCGTCAAC TGTTCGAACA GACCGAGGAT GCCATTATCC TCTTTCGCCC CGGCGGGTGT
CGCATCATCG ACGTGAATCC GGTTGCGGAG AGACTCTACG GCTTTTCCAA AAAGGAGTTC
TTCGAGCTTG GGCTCGAGTC GTTGATGGGC CCCAGTGATC TCGATACGTT CTGCCGGGCC
ATCGGCCAGG TCAGGATGGG AGACTCGATC CGGATCGACA CCATGACCCA CACCCGCAAG
GACGGCTCGG AAGTCATCGT TTCGGTGCGG GGCAAAATGG TTACGATCCA GCAGGTTGAC
ATGGTTTACT GCACTATCCG CGACATTTCG AAGCGGATTC GCCTCGAGAA GGAAAGCCGG
CTGATCCAGG CAAGGCTCAT TCATACCAAC AAGATGACCT CCCTGGGGGT GCTCGTGGCC
AGCATCGCCC ACGAGATCAA CAACCCGAAC AACTACATCA TGGTCAACTC GGAGATACTG
CGCCGTTCCT GGAACGACAT CTATCCGATC CTGCGCGACT ATTACGACGA ACATGGTGAC
TTTACCATTG GCGGCATCCC TTTTTCGGAG ATGCGGGAGG CCTTCCCCGA ACTGATTGCA
GGGGTCCACG ACGGTGCGCG CAGGATTCGC GATATCGTCA ACAATCTCAA GGATTTTGCG
CGCAACGAGG CTTGCTCAAT CGCGGGAAAC GTGGACGTGA ACCGGGCCAT AACCATGGCG
GCGACCATGC TGAACCACCA GATCCGCAAG CATACGCGGC ATTTCCGGCT GGAACTCGCC
GAAGATCTCC CGCTTGTCCG GGGGAGCCTC CAGCAGCTCG AACAGGTCAT CATCAACCTG
ATCATGAATG CAATCCAGGC GTTGCCCACC GAGGAGCGGG GCGTAACGGT TTCGACCACC
CGCGACGGGA ACGACGGGGG GGTAGTGATC AGGGTTGCGG ACCAGGGCAG CGGCATTCCG
ATCGAGATCT CCGACAGCAT TCTGGAGCCG TTTTTTACGA CGCGCCTCGA CAGCGGCGGC
ACGGGGCTGG GGCTCGCCAT CTGCCATTCG ATCGTCCGGG ATCATGGGGG GGACCTGGAG
TTCACGTCAG TGCCGGGCGA AGGAACTATC TTTACCGTAC ACCTGCCTGC CGCGAGCAAT
CAGGGGGTGG CATGA
 
Protein sequence
MTNPERAARR AARKIRFYES PVRITIALGG IIFVAEAVVM VVMVHVLRFP GRAPVVDLFT 
DSLMLLLLIA PFLHRFIFKP MAAQIAQREE AETRLRRYQS NLESLVEDRT ARLTEAIEEL
EREIVERKRT EEALLRSEER FRQLFEQTED AIILFRPGGC RIIDVNPVAE RLYGFSKKEF
FELGLESLMG PSDLDTFCRA IGQVRMGDSI RIDTMTHTRK DGSEVIVSVR GKMVTIQQVD
MVYCTIRDIS KRIRLEKESR LIQARLIHTN KMTSLGVLVA SIAHEINNPN NYIMVNSEIL
RRSWNDIYPI LRDYYDEHGD FTIGGIPFSE MREAFPELIA GVHDGARRIR DIVNNLKDFA
RNEACSIAGN VDVNRAITMA ATMLNHQIRK HTRHFRLELA EDLPLVRGSL QQLEQVIINL
IMNAIQALPT EERGVTVSTT RDGNDGGVVI RVADQGSGIP IEISDSILEP FFTTRLDSGG
TGLGLAICHS IVRDHGGDLE FTSVPGEGTI FTVHLPAASN QGVA