Gene GSU1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1030 
Symbol 
ID2685724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1109751 
End bp1111400 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content62% 
IMG OID637125700 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952084 
Protein GI39996133 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCA AGAGTTTCAA GGATTGGAGA ATACTGCCGA AGATCATCGG CGCGGCGCTC 
CTCGGAGTGG CGCTCCTGGC GGCGGTAGTG CTCTTCTACT TCCTGCCCAT GGTCGAGAAG
CAGGAGATGG AAAGCCGCAA GCGTGCCACC AGACAATCGG TGGAGCTGGC CTTCGGCATC
GTGGGGGCCT ATGAGGCACG GATCGCGGCG GGAGAGCTGA CCGTTGACGA AGCAAAGGAG
CGGGCCGCTG CTGATATCAA AAAACTGCGC TACGCCAAAA AAGAGTACTT CTGGATCAAC
GACTCCAGCG CGCGTCTCGT GGCCCATCCC CTCAGGCCGG AAAACGAAGG AAAGGACATG
GGGGATTTCA AGGATGCCGA CGGCAAGCTG ATTTACCGCG AGTTCGCCAA GGCGGCCGGC
GCAGAAAACG GCGAGTTGTT CGTGGACTAC CGCCAGATCA AGCCCAACGA GAAGACGCCC
CTGCCGAAGG TTTCCTTTGT GAAGTACCAC AAGCCGTGGG ATTGGGTGAT CGGTACCGGC
ATCTACGTGG ACGACGTGAA GCGGGATATC ACCATGCTGC GCTGGAAGAT CATCGGCGCC
ATCGTTGTGG CCGGCGCGGT TGCCTGTCTG CTGGTTCTCT TTGCCGGGGT CAGGATTACC
CGCCCGCTCA AGGTCGTCGT GTCGAGCCTT GAGGATATTG CCCAAGGCGA AGGGGACCTG
ACCCGGCGGA TCGACGTGGT GACCCGGGAT GAGGTGGGCG ACCTGGGGCG TGCCTTCAAC
CAGTTCATCG AGAAGTTGCA CAACATCATT TCCCAGGTTG TCCAGAACAG CATGCAGGTG
GCGTCGGCCG CCGCCCAGAT TCACAGCACC TCCGAGCAGA CCGCAACGGG CGCCGAGGAG
GTTGCCGCTC AGGCCGGCAC CGTGGCCACG GCCGGCGAAG AAATGGCCAG CACTTCGTCG
GAGATCGCCC GCAACTGCAT GGCTGCCGCC GAAAACTCGC GCCAGGCCAA TGACACCGCC
CTCAAGGGCT CCCATGTGGT GAAGGAGACC CTGACGGTCA TGACCCGCAT CGCCGACCGG
GTCAAGGAGT CCGCCCATAC GGTCGAGTCC CTCGGGTCGC GCAGCGATCA GATCGGCGAG
ATCGTCGGTA CCATCCAGGA TATTGCCGAC CAGACCAACC TGCTGGCGCT TAACGCCGCC
ATCGAGGCGG CCCGGGCCGG CGAGCAGGGC CGGGGGTTCG CCGTGGTGGC CGACGAGGTG
CGGGCGCTGG CGGAGCGGAC CACCAAGGCC ACCAAGGAGA TCGGCCAGAT GATCCGGTCG
ATCCAGCAGG AGACAAAGCT GGCGGTCTCT TCCATGGAGG AAGGGGTCAA GGAAGTGGAG
AGGGGAACGT CAGAGGCCGC CAAATCGGGT GAGGCGCTGG AGGAAATCCT GCACCAGATC
GGCGAGGTGA CCAACCAGGT CAATCAGATC GCCACCGCCG CCGAGCAGCA GACAGCTACC
ACCAGCGAGA TCAGCAGCAA CATTCATGAG ATCACCGAAG TTATTACCCA GACAACCCGC
GGCGCCCAGG ACTCAGCCTC CGCCACCAGT GACCTGGCGC GTCTGGCCGA AGAACTTCAG
CGTTTGGTGG GACAATTCCG CCTCTCCTGA
 
Protein sequence
MAIKSFKDWR ILPKIIGAAL LGVALLAAVV LFYFLPMVEK QEMESRKRAT RQSVELAFGI 
VGAYEARIAA GELTVDEAKE RAAADIKKLR YAKKEYFWIN DSSARLVAHP LRPENEGKDM
GDFKDADGKL IYREFAKAAG AENGELFVDY RQIKPNEKTP LPKVSFVKYH KPWDWVIGTG
IYVDDVKRDI TMLRWKIIGA IVVAGAVACL LVLFAGVRIT RPLKVVVSSL EDIAQGEGDL
TRRIDVVTRD EVGDLGRAFN QFIEKLHNII SQVVQNSMQV ASAAAQIHST SEQTATGAEE
VAAQAGTVAT AGEEMASTSS EIARNCMAAA ENSRQANDTA LKGSHVVKET LTVMTRIADR
VKESAHTVES LGSRSDQIGE IVGTIQDIAD QTNLLALNAA IEAARAGEQG RGFAVVADEV
RALAERTTKA TKEIGQMIRS IQQETKLAVS SMEEGVKEVE RGTSEAAKSG EALEEILHQI
GEVTNQVNQI ATAAEQQTAT TSEISSNIHE ITEVITQTTR GAQDSASATS DLARLAEELQ
RLVGQFRLS