Gene GSU1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1029 
Symbol 
ID2685753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1107913 
End bp1109562 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content62% 
IMG OID637125699 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952083 
Protein GI39996132 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.392996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAT GGCGGGACTT GAAGGTGAGA ACAAAAATTT TCGTCCTGGT GATTGCCGGA 
TGTCTGGGGC TCGTGGTACT GGGATCGGTG GCGCTTTACA ATATGCGCAA CCTGAGCGGC
AGCGTGAAGG AAGCCAACAT CGGCATGGAG CACGTGGCGG GGCTTTCCGG TATGAAGAGC
GACTTTCTCG AGATGAGGCT GGCGCTCGTC TACATGCTTG CCCTGAAAGA TGCGGAAAAG
ATCGGCGGCA AGGAACAGGA TTTCCTGAAG GCCGCTGACA GGATCAAAAA GACACTCGAC
GACCTGGGCA AGCAGGAACT GACTGACACC GAGAAAAAGT CCCTCGTCGA GTTCAGGGGT
GGCTTCGAGT CTTATGTCGA GAAGGGAACG AGACTCGCCG AGCTAATCAA GGACGCTACC
GCCAAGGGAG ACGAGGTGGG CCGGGCCGAT GCCATGACCT TCGCCACCCA GAGCGTGGCC
CCCCTCTACG ATACCCCGGC CAAGATCATT GCCTCGATGG TGCAGGAAAA TATCGGCGAA
GCTCATAAGA TGTATGAGCA GGACATGGCC TCGTACCGAG CTTCCTTCAT TATGATGGTG
GTGATCATTC TGGGGGTGAT CGGCGTCGCC GCTGCGGCGG GCCTGGCCAT CGCCGGTTCC
ATCAGCGGTC CTCTCAACAA GGTGCTCGAT GTACTCACCC GCGTGGCCGC CGGTGACCTG
ACAGCCCGGG CCGACGTCGT CAGTGCCGAC GAAATGGGGC TGCTGGCGCG TGAGGTGAAC
ACTACCGCGG CCAAGATCAA CGAGATCATC GGCCTTGTTG CCCACAATGC CTCCCAGGTG
ACTGCCGCGG CGACCCAGCT CCATGCCACC TCCACCCAGA TGTCCACCGG CGCTGAGGAG
GTGGCCCAAC AGGCCGCCAC CGTGGCCACG GCCAGTGAGG AGATGGCTGC CACCTCGGCC
GAGATCGCCC ATAACTGCTC CCTGGCGGCT GAAAGCTCCC GACACGCCAA CGATCGGGCC
GAGAACGGTT CGGATGTGGT GCAGGAAACC CTGACCGTCA TGAACCGCAT CGCCGAGCGG
GTGAAGGATT CGGCACGCAC CGTCGAATCT CTGGGCGAGC GGAGCGACCA GATCGGCGAG
ATCATCGGCA CTATCCAGGA CATCGCCGAC CAGACCAATC TCCTTGCTCT CAACGCGGCC
ATCGAAGCCG CCCGCGCTGG CGAACAGGGA CGCGGCTTCG CCGTGGTCGC CGACGAGGTT
CGGGCCCTGG CCGAGCGGAC AACCAAGGCC ACCAAGGAGA TATCCCAGAT GATCAAGGCG
ATCCAGGGGG AAACCAAGGG CGCCGTCACT TCCATGGAGG AGGGGGTCAA AGAGGTGGAA
AAGGGAACCT CGGACGCATC CAAATCGGGC GAGGCCCTGC AGGCGATCCT GGAGCAGATC
GGCGGCGTTA CCATGCAGGT GAGCCAGATT GCCACTGCCG CCGAGGAACA GACCGCGACC
ACGGGTGAGA TCAACAACAA CATCCAGCAG ATCACGGAGG TGGTCCAGCT CACCGCGCGG
GGCGCCGAAG AGTCGGCCCA GGCCGCCGAG CAGCTGGCGA AACTGGCCGA GGAACTGCAG
GACCTGGTGT ACAAGTTCAA ACTCGCCTGA
 
Protein sequence
MSAWRDLKVR TKIFVLVIAG CLGLVVLGSV ALYNMRNLSG SVKEANIGME HVAGLSGMKS 
DFLEMRLALV YMLALKDAEK IGGKEQDFLK AADRIKKTLD DLGKQELTDT EKKSLVEFRG
GFESYVEKGT RLAELIKDAT AKGDEVGRAD AMTFATQSVA PLYDTPAKII ASMVQENIGE
AHKMYEQDMA SYRASFIMMV VIILGVIGVA AAAGLAIAGS ISGPLNKVLD VLTRVAAGDL
TARADVVSAD EMGLLAREVN TTAAKINEII GLVAHNASQV TAAATQLHAT STQMSTGAEE
VAQQAATVAT ASEEMAATSA EIAHNCSLAA ESSRHANDRA ENGSDVVQET LTVMNRIAER
VKDSARTVES LGERSDQIGE IIGTIQDIAD QTNLLALNAA IEAARAGEQG RGFAVVADEV
RALAERTTKA TKEISQMIKA IQGETKGAVT SMEEGVKEVE KGTSDASKSG EALQAILEQI
GGVTMQVSQI ATAAEEQTAT TGEINNNIQQ ITEVVQLTAR GAEESAQAAE QLAKLAEELQ
DLVYKFKLA