Gene GSU1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1140 
Symbol 
ID2685536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1229730 
End bp1231913 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content55% 
IMG OID637125814 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952193 
Protein GI39996242 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.664897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA ACATGAAGGT AGGCTTAAGG CTCGGTATCG GTTTTGGCGT GGTGGTGACT 
GTGTTCATGA TTGCGATCCT AGTTACACTT GTGCTGTTGC GTGAGGTCAA TCAGGAGTCG
CGTCAGGTCG CCGAAGAGTC GATACCATTC CTCATGAGTG CATACGAGAT GGATGTGGCC
TTGGCGGAGC TGACGGAGAA TCTCACAGAC GTGGCAGCTA CCCATAGTCC TGATGGTTTC
AAGGGGGCAG AGGAGGCAGC TGCTATTGTA AAGCGGGAAA TAACGAAGTT TCGTGAAATG
TTCCGCAAGG AGAACGACAC CGTTGCATTG AAGGAGTTGG ATGATGTGGA GGCTGCGTTT
ACCGCCTTTC ACCTCAGCGG CGTCAAGATG GCCAAGGTCT ATATGGAGCA GGGAATTGGA
GCGGGTAACC CTCTGATGAA AGAATTCGAT AATGCCCACG AGGTACTGAT CGAAAAGGTC
GAGAAGCTTC AAAAGAGCCA GGTGGATGAG GCTCTAGGCA ATAGCCGCGA CAATGTTGCC
GCAGTCGGTA AGGTGACCAT GGTCCTAATC GGATTCGGGA TTGTAGCCGT TTTGATAGGC
GTGGCCGTCG CGTTTTTCAT TACCCGGAGC ATTACGCTTC CGCTCGTCCG CGCCATGGAA
GCCAGCAATC GCCTTGCAGA GGGAGATTTG ACCATTGAGA TTGTTGCTGA CCGCGAGGAT
GAGGCGGGGC AACTGCTGAA GTCCATGAAG AACATGATTG ATTCCCTGCG GTCTCTGGCT
ACAACGGCAG AGCGTGTGGC AGAAGGAGAT CTGGCAGTCG AGGTTGTTGT CCGTTCAGAT
CGTGACGTTC TTGCCCGAAA CCTCCACGGC ATGCTTGAGA CGCTCAAAGG GCTGCGTCAG
GAAACAGATG AACTTATCGG TGCGGTTCGT GACGGCAGGT TAAGCGTGCG CGGCAATGCG
CGAACCTTCA GTGGAGGTTG GGGGGAGCTT CTGACCGGTA TCAACCAGTT GGTGGACGCT
TTTGTGCAGC CGATACAGGT TACTGCTACG GCGCTTAATC GTATCAGCCG CGGTGATATA
CCGGAGAAGA TAACGGCGGA ATACAAAGGA GATTTCAACG AAATCAAGAT CAATCTCAAC
AGTCTCATCG ATGCTATGAA TTCAATTACC GCCCTCGCCC AAGAGCTCTC TGCCGGCAAT
CTGACAGTTG AGGTTAAAGA GCGGTCTGAG CGGGACGAGT TGATGAAGGC ACTTGCGTCT
ATGGTGACCA AGCTGCGCGA TGTGGTGGCG GATATCATGA TAGCTGCCGA CAACGTCACG
TCAGGCAGTC AACAACTGTC GTCCACGTCC GAGGAGATGA GCCAGGGGGC CACCGAGCAG
GCTGCTTCGG CAGAGGAGGC CTCGTCGAGC ATGGAACAGA TGTCGTCCAA TATCCGCCAG
AATGCGGATA ATGCGGCGCA GACCGAGAGG ATTGCCATCA AGTCTGCGGC GGACGCCATC
GAGGGAGGGA AGGCGGTCGG CAATACCGTG TCAGCCATGA AGGAGATCGC ATCGAAGATT
TCCATCATTG AAGAGATTGC ACGGCAGACT AACCTGCTGG CGCTCAATGC GGCGATCGAG
GCGGCGCGTG CCGGCGAACA CGGGAAAGGA TTCGCCGTGG TGGCGAGTGA GGTACGCAAG
CTTGCTGAGA GAAGCCAGAA AGCAGCGGGC GAAATCAGTG AGCTCTCTTC TTCGAGCGTA
GAGGTTGCAG TCAGGGCAGG CGAATTGCTT GCCACCATCG TGCCCGATAT TCAGCGAACC
TCTGAACTGG TGCAGGAGAT CAGCGCCGCC TGCCGTGAAC AGGATACGGG TGCCGAGCAG
ATCAACAAGG CCATCCAGCA GCTTGATCAG GTGATCCAGC AGAATGCCTC GGCGGCGGAG
GAAATGTCGT CCACGGCTGA GGAGCTTTCG TCGCAGGCCG AGCAGCTTCA GGACACGGTC
GCTTTCTTCA GTATTGGTGG AGAGATGAAA CGTAAGATTG CGCCAAAGCC GTCTCGACCG
AACGCCAAGG CGAGCATCAG GCTCCCTGCG GCTCCTCACG GCACGGCCAA CGGTTATGGC
CGAACGAGTG CTTCTGTTAC CGGCGGCTTT GCCCTGGATA TGGCAGGCCA CGATCATCTG
GACAACGAGT TCGAAAAATT CTGA
 
Protein sequence
MFKNMKVGLR LGIGFGVVVT VFMIAILVTL VLLREVNQES RQVAEESIPF LMSAYEMDVA 
LAELTENLTD VAATHSPDGF KGAEEAAAIV KREITKFREM FRKENDTVAL KELDDVEAAF
TAFHLSGVKM AKVYMEQGIG AGNPLMKEFD NAHEVLIEKV EKLQKSQVDE ALGNSRDNVA
AVGKVTMVLI GFGIVAVLIG VAVAFFITRS ITLPLVRAME ASNRLAEGDL TIEIVADRED
EAGQLLKSMK NMIDSLRSLA TTAERVAEGD LAVEVVVRSD RDVLARNLHG MLETLKGLRQ
ETDELIGAVR DGRLSVRGNA RTFSGGWGEL LTGINQLVDA FVQPIQVTAT ALNRISRGDI
PEKITAEYKG DFNEIKINLN SLIDAMNSIT ALAQELSAGN LTVEVKERSE RDELMKALAS
MVTKLRDVVA DIMIAADNVT SGSQQLSSTS EEMSQGATEQ AASAEEASSS MEQMSSNIRQ
NADNAAQTER IAIKSAADAI EGGKAVGNTV SAMKEIASKI SIIEEIARQT NLLALNAAIE
AARAGEHGKG FAVVASEVRK LAERSQKAAG EISELSSSSV EVAVRAGELL ATIVPDIQRT
SELVQEISAA CREQDTGAEQ INKAIQQLDQ VIQQNASAAE EMSSTAEELS SQAEQLQDTV
AFFSIGGEMK RKIAPKPSRP NAKASIRLPA APHGTANGYG RTSASVTGGF ALDMAGHDHL
DNEFEKF