Gene GSU0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0683 
Symbol 
ID2685391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp718970 
End bp720676 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content60% 
IMG OID637125355 
Productmethyl-accepting chemotaxis protein, putative 
Protein accessionNP_951740 
Protein GI39995789 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.268569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGGT TCGAGGAACT CAAGGTTTCA AGCAAGCTGG CCGTGTCATT CATGGTGGTC 
ATAGTTCTGA CGACTTTCCT TGGAATCTTT TCCATATTCG AGCTGTCGCG AGTCAATGAG
ACGGGAACCG ACATGGCGGA GAACTGGATC CCCAGCCTCA ACGCGATATC GGCCATGCAA
CTGGATTTCG CCAGCTACCG CCGTCTTGAG CTGCAGCATA TTCTGGAGGT GGAGAGCGCC
GGGCAGAAAA CGTACGAAGA GAGAATGGCG GGGCTCGTCA AGAGTATTGC CGAGCACCAG
AAAGAATACG AACCATTGCT GTCAACGCCG GAAGAGAAGC AAATGCTTCA GGAGTTCAGC
ACAAAATGGC AGGAATACCT GAACGAAGGC AAGCCGGTCC TTGAGCTGTC CCGGCAGAAT
AAGGCCCAGG AAGCAGCCGC CCTCCTGAAT GCAAACTCGC GCAAACTGTA CAACGAAGCC
GGAGCCCTGA TCGACAAGCT CAAGACGCTG AACACGCAGG AGGCCAAAGA TGCCAGCGCA
CGTGGCGATA AACTCTATTC CTCGGCACGT ATCTGGATCA TCGGTTCGCT GATCGCCTGT
ATTGTCCTTG CAGTAGTCAT GGGGCTGGTA ATTACGCGGG TGCTTCTCAG GCAGCTTGGC
GGGGAGCCGA CGGCAATTGC CGATATTGCC AACAAGCTTG CGGATGGCGA TTTGCGCATC
GCCTTTGACA CCACCGGCAA GGCGGAAACG GGCGTGTATG CGGCGATGCA CAACATGGTT
GAGAAGCTGA AGGGAGTGGT TGCGGACGTG AAGAGCGCGG CGGACAACGT TGCCGCCGGC
AGCCAGGAGC TCTCCTCCAG CTCCGAGGAG ATGAGCCAGG GTGCCACCGA GCAGGCGGCC
GCGGCCGAAG AGGCCTCCAG TTCCATGGAG CAGATGAGCT CCAACATCCG GCAGAACGCG
GACAACGCCA CCCAGACCGA GAAGATCGCC CTGAAGAGCG CCTCCGACGC CAAGCAGGGG
GGCACGGCCG TGGCTGAAAC CGTCGTGGCC ATGAAGGAAA TCGCCTCCAA GATCTCGATC
ATCGAGGAGA TCGCCCGTCA GACGAACCTG TTAGCGCTGA ACGCGGCCAT TGAAGCGGCG
CGAGCCGGCG AGCACGGCAA AGGGTTCGCG GTGGTGGCGG CCGAGGTCAG AAAGCTTGCC
GAGCGGAGCC AGAAGGCGGC GGGCGAGATC AGCGAGCTGT CTGCCTCCAG CGTGCAGGTG
GCGGAAGACG CCGGCGAGAT GCTGACGCGG ATCGTGCCGG ATATCCAGCG TACCGCGGAG
CTGGTGCAGG AGATCAGTGC CGCGTGCAAG GAGCAGGACA CGGGCGCGGA GCAGATCAAC
AAGGCGATCC AGCAGCTTGA CCAGGTGATC CAGCAGAATG CCAGCGCCAG CGAAGAGATG
GCGTCCACCA GCGAAGAACT GGCCAGCCAG GCCGAACAAC TGCAGGCAAC CATTTCATTC
TTCCGGACCG ATGATCGTGG CGCGTCGAGC CGGAGTGCGG CCCGTCGGCC CGTTGCCAAG
AAAAAGGCAG CGATCTCTCA TTTGGGTCAC GGTATGTCCA ACGGCTACCA CACCGAGCCC
GCGACGTCGC GAAAAGTAGC GGTAGGCGGC GGGGTGGATC TGAACCTGGA CACCGATCAC
CTGGATGACC AGTTCGAGAA ATTCTAG
 
Protein sequence
MKWFEELKVS SKLAVSFMVV IVLTTFLGIF SIFELSRVNE TGTDMAENWI PSLNAISAMQ 
LDFASYRRLE LQHILEVESA GQKTYEERMA GLVKSIAEHQ KEYEPLLSTP EEKQMLQEFS
TKWQEYLNEG KPVLELSRQN KAQEAAALLN ANSRKLYNEA GALIDKLKTL NTQEAKDASA
RGDKLYSSAR IWIIGSLIAC IVLAVVMGLV ITRVLLRQLG GEPTAIADIA NKLADGDLRI
AFDTTGKAET GVYAAMHNMV EKLKGVVADV KSAADNVAAG SQELSSSSEE MSQGATEQAA
AAEEASSSME QMSSNIRQNA DNATQTEKIA LKSASDAKQG GTAVAETVVA MKEIASKISI
IEEIARQTNL LALNAAIEAA RAGEHGKGFA VVAAEVRKLA ERSQKAAGEI SELSASSVQV
AEDAGEMLTR IVPDIQRTAE LVQEISAACK EQDTGAEQIN KAIQQLDQVI QQNASASEEM
ASTSEELASQ AEQLQATISF FRTDDRGASS RSAARRPVAK KKAAISHLGH GMSNGYHTEP
ATSRKVAVGG GVDLNLDTDH LDDQFEKF