Gene GSU1294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1294 
Symbol 
ID2686524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1410931 
End bp1413819 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content59% 
IMG OID637125968 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952347 
Protein GI39996396 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACAGA ACATGCGGAT CGGTTTGCGG TTGGGGCTCG GGTTCGGATT AGTGGTTGTG 
TTGATGGTGA TCCTCGGCGG CATCTCCATC AATAGAATGG CTACCCTGAA CACAGACCTT
GACATGGTTG TCAAGGATCG CTGGCCCAAA GCGGAGACAA CCTTCGGAAT CAGTAGCCAG
ATCAACGTGG TGGCGCGGGC ATTGCGCAAC GCAATCCTTC TGGATGATCC AGCAGAGGTT
CAGAAGGAGA TTGCGCGAAT CAACGAAGCT TCGGTGAGCG TCAGCAAGTC GATGGATGAA
CTGTCCAAGA GCATCACCAG CGAGGAGGGG AAGGCCAAAC TGAAGGCGGT GGAGGCCTCC
CGCGCCGCAT ACCGGGAGGA TCTGCTGAAG CTGGTTGAGT ATATCAGGGC CGGCAACAAG
TCTGCTGCCC AGAAGATGCT CTTCGGCAGC TACCGCGAAA GGCAGCGAAG CTACTTCGAT
GCCGTTGACG GGTTGACACA GTATCAGGCC AAGCTCCTGG CGGTATCCGG CAAGGAGGCT
GAGCAGACCT TCGTTTCCTC CAGAAACATC ATCGTAGCGC TTCTCGTGCT GTCGGCCCTT
CTCGCATGTG CCTGCGCCTG GCTCGTGACC CGCTCCATCA CCAGACCCAT CGGCGCCTGC
ATGGCCGCTG CGGGCAGGAT CGCCTCCGGC GACACCGACG TGACCCTGGA CGTGACGGCG
CGTGACGAAA CCGGCCTGCT GCAGGTTGAG ATGCAGAAGA TGGTGGAAGC AATCAGGGCG
CTGATTGCGG ACGCCGACAT GCTCTCGCGG GCCGCGGTCG AGGGGCGTCT CGCGACCCGG
GCCGATGCGG CTAAGCACCA GGGCGATTTC CGCAAGATCG TCTCCGGCGT CAACGATACC
CTTGATGCGG TCATCACCCC ACTCAATGTG GCCGCCGACT ACGTGGATCG CATCTCCAGG
GGCGACATTC CGCCCCGGAT TACCGATACC TACAACGGAG ATTTCAATGA GGTGAAAAAC
AACCTCAACC GGTGTATCGA CGCGCTCAAC GGCCTGCTGA GCGACATGAA CGAGATGTCG
AAAATGCACG ATCTGGGCGA CATCGACGTG GTTATGCCCG CTGACAATTA CCAGGGCGCT
TACCGGATCA TGGCCAAGGG CGTGAACGAC ATGGTGAACG GTCACATCAG CGTCAAGAAA
AAAGCGATGG CCTGCGTGGC CGAGTTCGGC AAAGGGAATT TCGACGCCGA ACTGGAAAAG
TTTCCCGGCA AAAAGGCTTT TATCAACGAC ACGATCGAGG CAGTACGGAG CAATATCAAG
AACTTCATCG CCGACATGGG ACACATGTCC CAACAGCACG ACCTGGGTGA CATCGACGTT
AAAATGCCGG AAGACCAGTA CCAGGGGGCC TTCCAGGTCA TGGCCAAGGG TGTGAACAAC
ATGGTGGGCG GTCACATCAG CGTCAAGAAG AAGGCGATGG CCTGCGTGGC CGAGTTCGGC
AAAGGGAACT TCAATGCCGA CCTCGAAAAG TTCCCCGGCA AAAAGGCATT TATCAACGAG
ACGATCGAGG GCGTACGGAC CAATCTCAAG TCCTTCGAAG AGCAACTGGG AATCCTGATC
ACGGCCGCTG CCGACGGCCA ACTGGACAAG CGAGCCAACG CGGACCTCTT CGTGGGAGGC
TGGAACCAGC TTGCCCGGGG TGTCAACGAT ACCATCACCA ATATCGTCGA ACCGCTGATG
GTTACCGCCG ATTACGTGGA CCGGATCAGC AAGGGCGATA TGCCGCCCCT TATCACCAAG
GAATACCGGG GCCAGTACAA CATCATCAAA CAGAACCTGA ATACGCTCAT CGATGCCACC
AACGGCATCG TTGCGGCCGC GAAAGAGGTT GCCGGCGGCA ACCTGATGGT GGAGCTCAGG
GAGCGCTCGG CCAAGGATGA GTTGATGCAG GCCCTTTCCG CAATGGTAAA GAAGCTCTCC
GAGGTCGTGG CCGAGGTGAA GAGCGCCGCC AACAACGTGG CCGCGGGAAG TCGCGAAATG
AGTTCCGGCT CCGAGCAGAT GAGCCAGGGG GCCACCGAGC AGGCGGCTGC CGCGGAAGAG
GCGTCGTCAA GCATGGAGGA AATGTCTTCC AATATCAGGC AGAATGCCGA TAACGCCTCC
CAGACGGAGC GGATCGCCAT CAAGAGCGCC CAGGACGCCC GCGACGGCGG CAAGGCGGTG
GCCGAAACCG TCACCGCCAT GAAGGATATC GCCTCGAAGA TATCGATCAT CGAGGAGATT
GCTCGCCAGA CGAATCTCCT GGCACTGAAC GCGGCCATCG AGGCGGCCCG CGCCGGCGAG
CACGGCAAGG GATTCGCGGT GGTGGCGGCC GAGGTCAGAA AGCTTGCCGA GCGGAGCCAG
AAGGCTGCCG GCGAGATCAG TGACCTGTCG GCCTCCAGCG TGGAGGTGGC GGAGAAGGCC
GGCGAGATGC TGGGTCGAAT CGTGCCCGAC ATTCAGAAAA CCGCGGAACT GGTCCAGGAG
ATCAGTGCAG CCAGCAAGGA ACAGGACACA GGCGCCGAGC AGATCAACCG GGCCATCCAG
CAACTGGACC AGGTGATCCA GCAGAATGCC AGCGCAGCCG AAGAAATGGC ATCCACGGCA
GAAGAACTGT CTGCCCAATC GGAACAGCTC CAGTCGATCA TTTCCTTCTT CCGGGTCGAT
AGCTCGGCGC AGTCCTCCAG TGCGATTGCG GCTGCAAAGC CTGCGGCGAA AAAGCCGGCC
CTTGCCCACG CGCCGGCCAA CGGGTACCAC AAGGCAAATC AGGCTCCGGC CAAGAAGGTG
GCACATGCAG GACTCAACCT CAATCTTGAA GGGGGAGATC ACCTGGATTC CGAGTTCGAA
ACGTTCTAG
 
Protein sequence
MLQNMRIGLR LGLGFGLVVV LMVILGGISI NRMATLNTDL DMVVKDRWPK AETTFGISSQ 
INVVARALRN AILLDDPAEV QKEIARINEA SVSVSKSMDE LSKSITSEEG KAKLKAVEAS
RAAYREDLLK LVEYIRAGNK SAAQKMLFGS YRERQRSYFD AVDGLTQYQA KLLAVSGKEA
EQTFVSSRNI IVALLVLSAL LACACAWLVT RSITRPIGAC MAAAGRIASG DTDVTLDVTA
RDETGLLQVE MQKMVEAIRA LIADADMLSR AAVEGRLATR ADAAKHQGDF RKIVSGVNDT
LDAVITPLNV AADYVDRISR GDIPPRITDT YNGDFNEVKN NLNRCIDALN GLLSDMNEMS
KMHDLGDIDV VMPADNYQGA YRIMAKGVND MVNGHISVKK KAMACVAEFG KGNFDAELEK
FPGKKAFIND TIEAVRSNIK NFIADMGHMS QQHDLGDIDV KMPEDQYQGA FQVMAKGVNN
MVGGHISVKK KAMACVAEFG KGNFNADLEK FPGKKAFINE TIEGVRTNLK SFEEQLGILI
TAAADGQLDK RANADLFVGG WNQLARGVND TITNIVEPLM VTADYVDRIS KGDMPPLITK
EYRGQYNIIK QNLNTLIDAT NGIVAAAKEV AGGNLMVELR ERSAKDELMQ ALSAMVKKLS
EVVAEVKSAA NNVAAGSREM SSGSEQMSQG ATEQAAAAEE ASSSMEEMSS NIRQNADNAS
QTERIAIKSA QDARDGGKAV AETVTAMKDI ASKISIIEEI ARQTNLLALN AAIEAARAGE
HGKGFAVVAA EVRKLAERSQ KAAGEISDLS ASSVEVAEKA GEMLGRIVPD IQKTAELVQE
ISAASKEQDT GAEQINRAIQ QLDQVIQQNA SAAEEMASTA EELSAQSEQL QSIISFFRVD
SSAQSSSAIA AAKPAAKKPA LAHAPANGYH KANQAPAKKV AHAGLNLNLE GGDHLDSEFE
TF