Gene GSU1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1141 
Symbol 
ID2688456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1231999 
End bp1233588 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content59% 
IMG OID637125815 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952194 
Protein GI39996243 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0751853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGAG TGCTCAGCCT CTGGCAGCGT TATCTCGATC TCTCCGTCAA TGCCAAATTG 
ATGCTGTATG TGGCCTGCTT CACCGTCTGG CTCATTTTGG TGGGGGCGGC CGGGTTGTGG
GGCATGGGAG TGTTGAGCAC CGGCATCAAC CGCGACGGGC TTGCATTGCG GCGTGTGCTT
CTGGTGTCCG GGCTCAAGAA CGATTTGCTC TACTTGCGCC ACGACCTTGA TCGCTATTTT
CTCGAAAATG GAAATGCCGC GAGCCGTGCC ATACACGATG CCGAACAACG GCTCAAAACC
ATTGCTGGTG GCATAGAGGC ACTTGAACGT CACGAGCCCG ACGCGGAGCA GCGCAGGCTC
CTGGGGGTCT TCGCGCAAGA GTTTGCACTG TATCGCGAAG CAGTAGTGCG GCTTATTGAC
CTGCAAAAGC GAGCCCTTGA GACGGGAGAT GTCACTGTGC GGTCAGCGGC GGCCTCCTAT
GCCCGAGAGG ATATTCTGCC CCTCTTTTTC GGGGCTTCCG ATGCGGTAAC CGATCTGGTG
GAGTTTGATC GCCAGCAGGC CTTGCAGACC GTAGATGCGA ACGGCCTGCA GTATGCCCGG
CTTTCACGGG TTCTACCGGC ACTGATCGTG GCAGCTGTTA CACTGGGGCT TTTTTTCGGG
ATTGTCATTG CCCGTTCCAT TACCAAGCCT CTCGCGCGTA TTCTCGCTGC GGTCGAATCT
CTTGCCACCG GCAACCTGAA CGTGGATGCT CCCGTTGGTG CGCGAGACGA TCTTGGCCGG
TTGGCCGTGG GAATCAATGC CATGGTTGGG CGGTTCCGGG ACGTTGTGAC GTCGATATGC
AGAGATAGCG AGGCTGTGGC CGGAGCTGCA TCCCAGCTGT CGGGAACCGC CTGCCAGTTG
TCCGAGGCCG CCACAGAGCA GGCTGCCGCT GCGGAAGATG CGTCGTCGAG CATGGAACAG
ATAAGCTCCG CCATCCGGGC GAATGTCCAG AACGCCCAAA CAACGGCAGA TGTTGCCAAC
CGGAGTTCCA TCGATGCTGC CGCGGGGGGG GAAACGGTGA CGGAAACCGT GGCGTTGATG
AAGGAGATCT CCAGAAAGAT CATGGTGATT GAGGAGATAG CTCGTCAAAC CAATCTGCTG
GCGCTCAACG CCGCCATTGA GGCGGCACGG GCCGGTGACC ACGGGAAGGG GTTTGCCGTG
GTTGCCGGTG AAGTCCGTAA ACTGGCGGAG CGGAGCCAGT CGGCCGCGGC CGAGATCGGC
CGGCTTTCGG TAACGAGCGT CGAGGTGGCT GAGCGGGCAG GCACCCTGTT CGGCGCCATT
ATTCCCGATA TCCGCCAGAC GGCTGAGCTT GTCCAGGGAA TCAGCTCCGC CTGCCACGAG
CAGGAAACCG GCGTTGGCCA GATCAATCGG GCCATACGTC AGCTAGATGC CGTGATTCAG
CAGAATGCTT CGGCTTCCGA GCAGATGGCG TCGACGGCGC AGGAATTGTC ATCTCAGGCG
GACATGCTGC TTGATGCCGT CAGTTTCTTT CGACTGGGTG AGACTAACCG GTATCAGGAA
TCCCGTAGCG AACTATCGGG AATTTCCTGA
 
Protein sequence
MPRVLSLWQR YLDLSVNAKL MLYVACFTVW LILVGAAGLW GMGVLSTGIN RDGLALRRVL 
LVSGLKNDLL YLRHDLDRYF LENGNAASRA IHDAEQRLKT IAGGIEALER HEPDAEQRRL
LGVFAQEFAL YREAVVRLID LQKRALETGD VTVRSAAASY AREDILPLFF GASDAVTDLV
EFDRQQALQT VDANGLQYAR LSRVLPALIV AAVTLGLFFG IVIARSITKP LARILAAVES
LATGNLNVDA PVGARDDLGR LAVGINAMVG RFRDVVTSIC RDSEAVAGAA SQLSGTACQL
SEAATEQAAA AEDASSSMEQ ISSAIRANVQ NAQTTADVAN RSSIDAAAGG ETVTETVALM
KEISRKIMVI EEIARQTNLL ALNAAIEAAR AGDHGKGFAV VAGEVRKLAE RSQSAAAEIG
RLSVTSVEVA ERAGTLFGAI IPDIRQTAEL VQGISSACHE QETGVGQINR AIRQLDAVIQ
QNASASEQMA STAQELSSQA DMLLDAVSFF RLGETNRYQE SRSELSGIS