Gene GSU1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1304 
Symbol 
ID2686494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1428274 
End bp1429974 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content62% 
IMG OID637125978 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952357 
Protein GI39996406 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTGGA AGAACCTGAA GCTGAGTCTC AAGGTGCTTG CGGGCATCGG TAGTGTCCTT 
GTGTTGCTGG CGGTCATCGG CGTCTGGGCG GTGAACGGCC TGTCAAAGGT TGTCAGGGAC
GGCCACGAGG TAAGCGAAGG GAACAAACTG CGGGCCGAGC TCCTCCAACG GGAGGTGGAT
CACCTCAACT GGGCCAAGAA CGTCAGCACG TTCCTGCTGG ATGGCAAGGT CAGGGAGCTC
ACCGTTCAGG TGGATCACAC CAAGTGCAAG TTCGGCGAGT GGTACTACGG CGAGGGACGC
AAGCAGGCCG AGGCCATGCT TCCCGCGCTG AAGGACGAAC TGGCTGCCAT CGAGGAACCC
CACCGCAAGC TCCACGAGTC GGCAGGGCTC ATCAAGAAGG CCTACAACAA AGAGCAGGGT
GAGCAGGGCC GCAAGGACGC GGAGATCATC TTCGCGAGTC AGACCCAGCC GAACCTGCAG
CTGGTGCAGA AGCACCTGGC CGGGCTAAAT GAGACATCCC GCAAGAATAT CCTCTCCGAT
GAACAGATGA TCGCCAACGC CAACAGCACC AAGACCGCCG TGATCGCCCT GAGCATTGCC
GCCCTGGTAA TCGGCGTTGT TCTGGCACTG CTCATCTCCC GTTCCATCAG CATTCCGGTC
CTGAAGGGGG TTGAATTCGC CCTGAAGATA GCTGACGGCG ACCTGCGGAG CACCCTCGAC
ATTGATCGGA AGGACGAGGT GGGGCAGTTG GTGGCGGCCC TCAACGACAT GGTTGCCAAG
CTCCGTGACA TTGTAACCGA TGTGAAGAAC TCGGCCGATA ACGTTGCTGC CGGCAGCCAG
GAGCTCTCCT CCAGCTCCGA GGTCATGAGT CAGGGAGCCA CCGAGCAGGC AGCCGCGGCC
GAAGAGGCCT CCAGCTCCAT GGAGCAGATG GCGGCCAATA TCCGGCAGAA CGCGGACAAC
GCCTCTCAGA CTGAGAAGAT CGCCCTGAAG AGCGCCACTG ATGCCCGCGA GGGTGGCAAG
GCTGTTGCCG GCACGGTGAG TGCCATGAAG GAAATCGCCT CCAAGATCTC GATCATCGAG
GAGATAGCCC GTCAGACGAA CCTGCTGGCG TTGAACGCAG CCATCGAAGC GGCGCGTGCC
GGTGAGCACG GCAAAGGGTT CGCAGTGGTA GCGGCCGAGG TCAGAAAGCT TGCCGAACGG
AGCCAGAAGG CAGCGGGCGA GATCAGCGAA CTGTCTGCCT CCAGCGTTCA GGTAGCGGAA
GAGGCCGGCG AGATGCTGGC GCGGATGGTG CCGGATATCC AGCGTACCGC GGAACTGGTG
CAGGAGATCA GTGCCGCGTG CAAGGAGCAG GACTCGGGAG CGGAGCAGAT CAACAAGGCG
ATCCAGCAGC TTGACCAAGT GATCCAGCAG AACGCCAGCG CCAGCGAAGA GATGGCGTCC
ACCAGCGAAG AGCTGGCCGG GCAGGCGGAG CATCTCCAGT CGACCATCAC ATTTTTCAAA
ACCGACGAGC AGGGGCGGGC TGCCGGGAGA TCTCCGGCCG TGCGTCCGGC CGCGGTGGCG
AAGAAGCCCG CTGCGTTGCG CCTGGGCCAC GGCAACGAAC GCCGGACGGA GCCTGTCGCG
CCGCGAAAGG CTGTAGCCGG CAAGGGCGTG GACCTGAAGA TGGACGGCGA CTACCTGGAT
GACCAGTTCG AAAAGTTCTA G
 
Protein sequence
MSWKNLKLSL KVLAGIGSVL VLLAVIGVWA VNGLSKVVRD GHEVSEGNKL RAELLQREVD 
HLNWAKNVST FLLDGKVREL TVQVDHTKCK FGEWYYGEGR KQAEAMLPAL KDELAAIEEP
HRKLHESAGL IKKAYNKEQG EQGRKDAEII FASQTQPNLQ LVQKHLAGLN ETSRKNILSD
EQMIANANST KTAVIALSIA ALVIGVVLAL LISRSISIPV LKGVEFALKI ADGDLRSTLD
IDRKDEVGQL VAALNDMVAK LRDIVTDVKN SADNVAAGSQ ELSSSSEVMS QGATEQAAAA
EEASSSMEQM AANIRQNADN ASQTEKIALK SATDAREGGK AVAGTVSAMK EIASKISIIE
EIARQTNLLA LNAAIEAARA GEHGKGFAVV AAEVRKLAER SQKAAGEISE LSASSVQVAE
EAGEMLARMV PDIQRTAELV QEISAACKEQ DSGAEQINKA IQQLDQVIQQ NASASEEMAS
TSEELAGQAE HLQSTITFFK TDEQGRAAGR SPAVRPAAVA KKPAALRLGH GNERRTEPVA
PRKAVAGKGV DLKMDGDYLD DQFEKF