Gene GSU1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1041 
Symbol 
ID2688717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1127555 
End bp1129180 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content62% 
IMG OID637125710 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952094 
Protein GI39996143 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCATGA GAAACTGGAA AATCGGGACG AAACTGGCAA CCGGGTTCGG AGGACTTCTT 
CTGCTGCTGG TGATCTTCAC CACAGTAACC ATCATTTCCA TCCGGTTCGT GAAGACGAGC
ACGAGCCAGA TCAGGACCGA GAGCCTTCCC TATGCCTTGC TCGCCGAGGA AATGGCGTTC
GAGGTGGTTC AGGTCCAGCA GTTCCTGACC GACGTGGGGG CGACGCGGGA GCCCGATGCT
TATGCCGAGG CGGACGCGGC CGCGGCAAAT TTCCGCAAGT CTCTGAAGCA GTTCGAAGAC
ATGTACCGCC GCGAGAATGA TACCGCTGCC CTCCAATCCG TTGAAAAGAT GGAGAAGGAC
TTTGAGTCCT TTTATCAGCT GGGTCGACGG ATGGCCGAGG CTTATATGGC CGAGGGGACG
GAAGCGGGTA ACCGTTTGAT GGGCGATTTC GACAAGGTGT CAACGGTTCT TGCCGAAGAC
ATGAGAACAT TCAAGGAGGG GCAGGTCAGA GAAGCCAACC ACATGACCGC GTCTGTGGAC
GAAACGCTCG GCGGGCTGGA AAAGGTCATT ATCGCCCTTG CCGCTGCCGG GATCATTGTG
GGGCTTTTCG CCTCCTGGTT CATCGGGAAA GCCATTTCCG CCCCCCTCGG CAAGGCGGTG
TCCGCCATCG ACCGGATCGC TTCGGGCGAT CTCACCATCA GGATTCCGGT CACGGGAAGC
GATGAAACCG GGGCGCTCGC CGTATCCGTC AACCGCATGG CGGACGATAT GGGTACGGCC
ATGGCAGCCC TGGCCAATGC ATCGTCACAT CTGGCCTCGG CCTCGGTGGA GCTTGCGGTG
CAGGCGGACC AGATGGCCAA GGGGGCCGAG GAAGTGGCGG CCCAGACGGG AACCGTTGCC
GCGGCGAGCG AGGAAATGGC CGCAACGTCC CACGAGATTG CCATGAACTG TTCCCATGCC
GCCGAAAGCT CCCGGAGGGC CAACGACCGT GCATCGGCAG GCTCCGATGT CATCCGTCGC
ACTGTCGAGG GGATGCATCG CATTGCCGAA AAGGTTCAGC GCTCCTCCGA GAGCGTTGCC
GGACTGGGGG CCCGCAGCGA CCAGATCGGC CAGATTGTTT CGGTTATCGA AGACATCGCG
GACCAGACGA ACCTGCTGGC CCTCAACGCC GCCATCGAGG CGGCCCGGGC CGGGGAGCAG
GGGCGGGGGT TCGCCGTGGT GGCCGACGAG GTACGTGCGC TGGCAGAGCG AACCGGCAAG
GCGACGCGTG AGATCGCCCA GATGATCCGC TCCATCCAGC AGGAAACCGA GGGTGCGGTC
AAGGCCATGG AAGAAGGGGT GGCGGAAGTC TCTGCCGGCA AGGAGGACGC CCAGCAATCC
GCCGGTGCTC TCCGGGAGAT CGTCGAGCAG ATCGAGGCCA TGACGACACA GATCAATCAG
ATCGCCGTCG CCTCCGAGCA GCAGAACGCC ACCACCGATC AGATCACCAT GAACCTCCAG
CAGGTTTCGA GCGTTATCGA GGCGTCCTCC CGCGGTTCCG AGGAAACCGC CAACGCGGCC
CATACCCTCT CGGCGCTCTC CGAAGAACTC CAGAGCATTG TGGGGCGGTT TCGCACGGCG
GCTTAG
 
Protein sequence
MFMRNWKIGT KLATGFGGLL LLLVIFTTVT IISIRFVKTS TSQIRTESLP YALLAEEMAF 
EVVQVQQFLT DVGATREPDA YAEADAAAAN FRKSLKQFED MYRRENDTAA LQSVEKMEKD
FESFYQLGRR MAEAYMAEGT EAGNRLMGDF DKVSTVLAED MRTFKEGQVR EANHMTASVD
ETLGGLEKVI IALAAAGIIV GLFASWFIGK AISAPLGKAV SAIDRIASGD LTIRIPVTGS
DETGALAVSV NRMADDMGTA MAALANASSH LASASVELAV QADQMAKGAE EVAAQTGTVA
AASEEMAATS HEIAMNCSHA AESSRRANDR ASAGSDVIRR TVEGMHRIAE KVQRSSESVA
GLGARSDQIG QIVSVIEDIA DQTNLLALNA AIEAARAGEQ GRGFAVVADE VRALAERTGK
ATREIAQMIR SIQQETEGAV KAMEEGVAEV SAGKEDAQQS AGALREIVEQ IEAMTTQINQ
IAVASEQQNA TTDQITMNLQ QVSSVIEASS RGSEETANAA HTLSALSEEL QSIVGRFRTA
A