Gene GSU0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0756 
Symbol 
ID2687414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp805794 
End bp807443 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID637125428 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_951813 
Protein GI39995862 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAT GGCAAAGCCT CAAGGTGAAG TACAAGATTT TCACGTTGAT TCTGGTATGT 
TGCGTCGGCC TGATTGCGGT CGGGCTGCTG GGGTTTGGCG GGATGAGGAG CATGGGCAAA
AGCCTCGGCG AGCTCAACGA GGAGCAGAAG TCCGTGGCCA CCCTGTCGGC AATGAAGAAC
GACTTCCTGG AGATGCGCTT GGCGATCGTC TACATGCTGG CGCTGACCGA TTCCGCCAAG
CTGGCCGAAA AAGAGCGGGA CTTCGGCGCG GCCGCCGCCA GGATCAAGGA ACGCCTGGCT
TCCCTGGAGC ACCACAATTT TGCCGCGGAA GAAAAGAAAA AAATTACGGA ATTCCGTGAC
GGCTATGAGG CATACCTTGC CGAGGGCACC AAGTCGGCGG CCATGGCCAA GGCAGCCGCC
GAAACGGGAA ACGCCGCAGG GCGCGAGGAA GCCGTGCGAT ATGCCGTAAC CACCGCGGCC
CCGCTCTACA ACAAGCCGGC GCAGGCACTT GCGGAACTGG TCGAGATCAG CATCAAGGAA
GGCGGTGAAG TGTACGACGC GGACATGGCG TCCTACCGCA GATCGGTCTT GGTCATGGGA
GTCATCCTCA TGGTGGTGGT CGCCGTCTCC GCCGTGGCCG GTCTGGCCAT CGCCTCATCC
ATCAGCGGGC CGCTGAACCG GATTCTCGAA GTGCTGCAGC GGGTGGCCGC GGGGGATCTG
ACCGCCCGGG CCGAGATCGA CAGCCGGGAC GAGATGGGGT TGCTGGGCCG CGAGTTGAAC
GTAACAGCTG AAAAAATCGG TAAAATTATT GGCCAGTTGG CACAGGCTGC CGGGTCGGTT
GCCTCCGCCT CTGCCCAACT CCACGCCACG GCCGAACAAA TGGCCACGGC CTCCGAAGAG
GTGGCGGCCC AGGCTGAAAC CATCGCCACC GCGGGCGAGG AGATGGCGGC CACATCCAAC
GACATTGCCC ACAATTGCGT CACCGCGGCA GAGGGGTCCA CCCAGGCCAA TGATGCTGCC
GAGGGGGGCG CCCAGGTGGT GCAGGCGGCC ATTGCAGCCA TGGACCGCAT TGCCGAGCGG
GTCCACGCCT CGGCGAAAAC CGTCGAGGGG CTGGGGGTGC GCAGTGAAGA GATCGGCGAG
ATCATCGGCA CCATTGAGGA TATCGCCGAT CAGACCAACC TGTTGGCCCT GAACGCAGCA
ATCGAGGCGG CCCGGGCCGG CGAGCAGGGG CGGGGCTTTG CCGTGGTGGC CGACGAAGTG
CGGGCCCTGG CCGAGCGGAC CAGCAAGGCG ACCCGCCAGA TCAGCGAGAT GATCCGGGCC
ATCCAGCACG ATACCCAGAG CGCGGTCCAC TCCATGGAAG AAGGGGTGTC GGACGTCCAG
GCCGGCACGG CCGAGGCGGC CCGCTCCGGG CAGGCCCTGC AGATGATCCT GGCCAAGATC
GGCGACGTCA CCAACCAAAT CAGCCAGATC GCCACGGCTG CCGAAGAGCA GACCGCCACC
ACCGGCGAGA TCAGCAACAA CATGCACCAG ATCAGCCAGG TGGTGCAGGA TACGGCCCGC
GGCGCCCAGG ACACGGTGGC CGCGGCCAAC TCCCTGTCTC GGTTGTCGGA AGATATGCAG
GGCATGGTGC AGCAGTTCCG GCTGGCCTGA
 
Protein sequence
MAAWQSLKVK YKIFTLILVC CVGLIAVGLL GFGGMRSMGK SLGELNEEQK SVATLSAMKN 
DFLEMRLAIV YMLALTDSAK LAEKERDFGA AAARIKERLA SLEHHNFAAE EKKKITEFRD
GYEAYLAEGT KSAAMAKAAA ETGNAAGREE AVRYAVTTAA PLYNKPAQAL AELVEISIKE
GGEVYDADMA SYRRSVLVMG VILMVVVAVS AVAGLAIASS ISGPLNRILE VLQRVAAGDL
TARAEIDSRD EMGLLGRELN VTAEKIGKII GQLAQAAGSV ASASAQLHAT AEQMATASEE
VAAQAETIAT AGEEMAATSN DIAHNCVTAA EGSTQANDAA EGGAQVVQAA IAAMDRIAER
VHASAKTVEG LGVRSEEIGE IIGTIEDIAD QTNLLALNAA IEAARAGEQG RGFAVVADEV
RALAERTSKA TRQISEMIRA IQHDTQSAVH SMEEGVSDVQ AGTAEAARSG QALQMILAKI
GDVTNQISQI ATAAEEQTAT TGEISNNMHQ ISQVVQDTAR GAQDTVAAAN SLSRLSEDMQ
GMVQQFRLA