Gene GSU0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0471 
Symbol 
ID2686187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp500664 
End bp501923 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content64% 
IMG OID637125138 
Productsensor histidine kinase 
Protein accessionNP_951530 
Protein GI39995579 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.328588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCGA CAACAATCCT ACGTATGCGC GGCAATATCG TTTCCCTCAT CGCCCTTGCG 
GCCGGAGTCA TCCTGTCACT GCTTCTCGGC TGGTTCGCCG TGGGAAACTA CCGGAGCGCA
CGCCCCATTG CCGAAGGGAA CCTCCGGGGG CTTGCCCTTT CACTCACGTC GGCGCTGGAA
TCGATTGCGG CACGCGACTC GTCCCTGGCT TCACTGGCCG CCTTTCGCGC CCGGGACATC
GCCTATATCT CGGTCATCGA CCGCAACGGC ACCATCGTCT TTCACTCCAA TGCCGACCTG
ATCGGATCGC GGGTGACGGA TCAACGGTAC GTGACGGTTC TCGGCGGACG GGGCTTGGCG
GAAAACCGCA TCAGACTCGG CACGGGCGAA GAGGTGTACG AATATCATGC CCCCCTCCAC
CTCCCCGGCC GAACCCTGGC CCTGCGCCTG GCACTTCACC CCTGGCGAGC CGATGCGGTG
ATCCACCGGG CCAGGGTAGG CATGGTCGTG CTGTTTTCGC TGCTGGCGGC GGCCTGGACG
ATGGGGGTGC TCCTCTATCG CTACGCCCGC CGGGCCCAGG AGCACCGGCT CGAAATGGTC
CGACGGGAGC GGCTTGCGCA ACTGGGAGAA ATGGGGGCGG TGCTTGCCCA CGAGGTGCGC
AACCCCCTGT CCGGGATCAA GGGCTACGCC CAACTGCTCA TGGAACGGAG CAACGACGAT
GAAAACCGGG AGTTCTCCGC ACTGATCGTC ACCGAGGCAA TCCGGCTCGA ATCGCTCGTC
AGCGACCTTC TTGCCTACGC CCGGCCGGAG CCCGGGCCAG AAGGGCCGCT CCAGGTAAAC
GCGGTGATTG ACCATGTGCT GGCACTGGTG GACCCCGAAG CGCGGGCCGC CGGCGTCACC
ATTGCGGCAT CCCTTGCCGA AGGATTGGCC ACAAGAGGAA ATGAAGCGCG GTTGGAGCAA
CTCATTCTCA ATCTGGCAAA GAACGGCATT CAGGCCATGC CGGACGGGGG AACGCTCACC
GTTGTCACCC GGCGCGAAGG TAAAACGGTC GAGATCAGTG TGGCAGACCA CGGCCACGGC
ATCGCCCCCC ACGACCGGGA GCGGATATTC ACCCCGTTCT TCACCACAAA GGCCCGGGGC
AGCGGCCTGG GGCTCGCCGT CTGCCGCAAG ATAGCCGAAG CCCATGGGGG GAGCATCAGC
GTGGCGGATA ATCCCGGCGG CGGCACCGTT TTTCGGGTAA CACTCCCCCT TCACCGATGA
 
Protein sequence
MDSTTILRMR GNIVSLIALA AGVILSLLLG WFAVGNYRSA RPIAEGNLRG LALSLTSALE 
SIAARDSSLA SLAAFRARDI AYISVIDRNG TIVFHSNADL IGSRVTDQRY VTVLGGRGLA
ENRIRLGTGE EVYEYHAPLH LPGRTLALRL ALHPWRADAV IHRARVGMVV LFSLLAAAWT
MGVLLYRYAR RAQEHRLEMV RRERLAQLGE MGAVLAHEVR NPLSGIKGYA QLLMERSNDD
ENREFSALIV TEAIRLESLV SDLLAYARPE PGPEGPLQVN AVIDHVLALV DPEARAAGVT
IAASLAEGLA TRGNEARLEQ LILNLAKNGI QAMPDGGTLT VVTRREGKTV EISVADHGHG
IAPHDRERIF TPFFTTKARG SGLGLAVCRK IAEAHGGSIS VADNPGGGTV FRVTLPLHR