Gene GSU1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1033 
Symbol 
ID2685619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1113816 
End bp1115417 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content64% 
IMG OID637125703 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952087 
Protein GI39996136 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA GGACCAAATT CGTAGTCGTC AATCTGTTGA TCGTCTGCTG CGCCCTTGCC 
GCCGTGGCGG CCGCCTGCCT CGTGGAGTTC AACCGCGAGT TGCGCCGCCA GGCCGTGACC
TCCCAGGAGA TCAGACTCAA GACCTTCTGG GAGCTGTTAC GCCAGAAGGG GGACGGCTTC
ACCGTCGCCG ACGGTAAACT GATGGCAGGC AGCTATGTCA TCAACGATAA TTATGAGCTT
CCCGACAAGC TGAAGGAGCT GACCGGGGGG ACCGCCACCA TTTTCATGGG GGATACGCGG
GTCTCCACCA ACGTGCTCAA GCCGGACGGA AGCCGCGCCG TCGGAACCAA GCTGCAGGGC
GCCGCCTACG ACGCCGTAAT AAAGGAAGGC AAACCCTACC GGGGGGAGGC GGATATCCTC
GGCGTTCCCT ACTTCACCGC CTACGACCCG ATCCGCGATT CCCGGGGCGA AGTAATCGGC
GTACTCTACG TGGGGGTCAA GAAAGGCGAT TTCTATGCGT CCTACGAGAG TCTCAAACTG
ACAGTGGTCG GAATCGTGCT GGTCATCGTG CTGCTGGCCG CCGTTGCCAG TAAGGTAATC
ATCCACCGGC TCTTCACCCC CCTCAATCGT ATGCACGACG TGCTGCGGGA CGTGGCCCAG
GGTGAGGGTG ACCTGACTCA GCGGCTCGAT TACCTCGCAC AGGACGAGGT CGGCGACATG
AGCCGGTCGT TCAATTCATT CATGGACAAG CTCCACGGTA TCATCACCCA TGTTGCCCGG
ACCGTGGAGC AACTCGCCTC GTCCGCGTCC CAGGTACACG GCTCTGCCGA GCAGATGGCC
GCCGGCGCGG GAGAGGTGGC CTCCCAGGCG GGGACGGTGG CCACGGCTGG CGAGGAGATG
GCCGCCACGT CCACCGAGAT CGCCCAGAAC TGCGCCATGG CCGCCGAAGG GGCGCGCCGG
GCCAGCAGCA CGGCCACTGC CGGGGCTGAG GTGGTTGGGA ACACGGTAAC GGTGATGGAC
CGGATCGCCG AAAAAGTTAA GAACTCCGCC CGCACGGTGG AACGGCTGGG GGAGCGCAGC
GACCAGATCG GAGAAATAGT CGGCACGATC GAGGATATTG CCGATCAGAC CAATCTGCTG
GCGCTCAACG CTGCCATCGA GGCGGCCCGC GCCGGAGAGG CGGGGCGCGG CTTCGCCGTG
GTGGCCGACG AAGTGCGGGC CCTTGCCGAA CGGACCACTA AAGCGACCCG TGAAATCTCC
GGCATGATCA GGGCCATCCA GGCGGAAACC CTGGAGGCGG TTTCCTCCAT GGACGAAGGA
GTCAGGGACG TGGAGACCGG CACCGCCGAA GCTGCCCGTT CGGGGGAGGC TCTGCGGGAA
ATTCTCGACC AGATTACGGC GGTGTCGATG CAGGTGAACC AGATCGCCGT GGCCGCCGAG
CAGCAGACGT CCACCACCCG AGAGATCAGC GGCAACATCC AGCAGATCAC CGAGGTTGTG
GAGGGGACCG CCCAGGGCGC GGACGAGTCG GCATGCGCGG CCGGCGGGCT GAATCGGCTG
GCGGAGGACC TCCAGCGCAT GGTGGGGCAG TTCCGGCTGT AG
 
Protein sequence
MKIRTKFVVV NLLIVCCALA AVAAACLVEF NRELRRQAVT SQEIRLKTFW ELLRQKGDGF 
TVADGKLMAG SYVINDNYEL PDKLKELTGG TATIFMGDTR VSTNVLKPDG SRAVGTKLQG
AAYDAVIKEG KPYRGEADIL GVPYFTAYDP IRDSRGEVIG VLYVGVKKGD FYASYESLKL
TVVGIVLVIV LLAAVASKVI IHRLFTPLNR MHDVLRDVAQ GEGDLTQRLD YLAQDEVGDM
SRSFNSFMDK LHGIITHVAR TVEQLASSAS QVHGSAEQMA AGAGEVASQA GTVATAGEEM
AATSTEIAQN CAMAAEGARR ASSTATAGAE VVGNTVTVMD RIAEKVKNSA RTVERLGERS
DQIGEIVGTI EDIADQTNLL ALNAAIEAAR AGEAGRGFAV VADEVRALAE RTTKATREIS
GMIRAIQAET LEAVSSMDEG VRDVETGTAE AARSGEALRE ILDQITAVSM QVNQIAVAAE
QQTSTTREIS GNIQQITEVV EGTAQGADES ACAAGGLNRL AEDLQRMVGQ FRL