Gene GSU3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3199 
SymbolcheA-3 
ID2688365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3506196 
End bp3507881 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content66% 
IMG OID637127892 
Productchemotaxis protein CheA 
Protein accessionNP_954240 
Protein GI39998289 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGT CCCAGTACCG GGACCTCTTC GTTGCCGAGG CCCGGGAACA CCTGGAGCGC 
CTGGGCGAAG AGGTACTGGC CCTGGAAAAG GACCCGGCCA ACGGCGAACG CCTCGACTCG
CTCTTCCGCA CCGCCCACTC CATCAAAGGC ATGGCGGGCT CCATGGGATA TGACGGCATA
GCCGACCTTT CCCACCGCAT GGAAGACCTG ATGGACCGCG TCCGCAAGGG CCGGATTCCC
TTTGGCCGGG ACATCGCCGA CCTGCTGCTG GCCTGCGCGG ACCAGTTGGG ACGAATGGTG
GAAGACGTGA CCGGCGGCGG GAACGGCTCC CTCGACGCGA CAGACCTCTG CGCCAGACTC
GCCCTGGTTG CCGGGCAGGA GGCCGCAGCC CCGGCGGCTC CAGCCGATGC AGAGACATCC
CCCTCCCCGC AGCCATCCGA CCAGCCGGAA CCGGCACGTC GCGACGAAAG CGACGGGGCG
CGAACCGTCC GGATCCGGTC AGAGCTTCTT GACCGATTTG TCAACATAAC CGGCGAACTG
GTCACCGGCA AGAACCGGAT CATGGAACTG GCGGCGGGAC TCGAATCCGA ACCACTGCGG
GATGCGGCGG CCGAACTGTC GAAACTGGTC CGCGACCTGC AGCGCGAGGT CATGTCGGCC
AGAATGATGC CCTTCGGCAC CATCTGCGAC CGTTTCCCCC GCATGGTGCG GGATCTGGCT
CGCCGTAGCG GGAAAGAGGC GACGCTGGCC ATCGACGGCA AGGATCAGGA ACTGGATCGC
GGCATTCTGG AAATTCTCCC CGACCCTCTG CTCCATGCCC TGCGCAATGC CGTCGATCAC
GGCATCGAGT CGCCGGAGGA ACGGAGTGCG GCCGGAAAGG GAGCGGGGGG TCGGATCGTC
CTGTCGGTTC GCAGGGAAAA AGACCATCTG GACGTGACAG TGACGGATGA CGGGCGGGGC
ATGGATCCGG CAGCTCTCGT CAACGCCGCC CTTGCCAAGG GAATCATCAC CCCGGAAGAG
GCGGCGACGC TCAGCCGGCA GGAGGCGTTG ATGCTCGTCT GCAGGCCGGG CTTTTCCACG
GCCAGGAGCG TCACCGAGGT ATCCGGAAGA GGGGTGGGGA TGGATGCGGT GCAAGCCGCT
GTAAGTCGGG CGGGTGGCAG CCTGTCCATC CAGTCCGAGC GAGGCCGGGG AAGCAGGATC
ACCCTTCGGC TCCCCCTGAG CGTGGCGATC ATCCAGGTGC TCCTGGTGGG CTGCGGCCCG
CTGACCATGG CGGTTCCCGT CAACGCCGTC CGCCGGACCG TCGAGCTGGA CCGGCGGCTC
CAGCGCATCG AAGATGGGCG GGCTGTTTTT GATCTGGGCG GGGAAACCCT CCCGCTGGTT
GACCTGGGCC TGCTCGTGGG GACCGGCCCG ACTGCCGGCG GGGATTTCTC GCCCGTTCTG
ACGGCCGACG TTGCAGGACG CACAATGGGG TTTGCCGTGG ACCGTTTTTT CGGACAGGCA
GAGGTATTCA CCAAGCCGCT CGGCACGCCG CTCAACCGTG CCAGGGGGCT TGCGGGAGGA
GCTATACTGG GAGACGGTCG GGTCATCTTC ATCCTCGACC TCCCCAATCT TGTCGACGGG
GCCACCAGCC GGCGCCGCGT TTTCATGCAC CCTGACGGTG CGCACAAAGG GGGAACGACC
GCATGA
 
Protein sequence
MDMSQYRDLF VAEAREHLER LGEEVLALEK DPANGERLDS LFRTAHSIKG MAGSMGYDGI 
ADLSHRMEDL MDRVRKGRIP FGRDIADLLL ACADQLGRMV EDVTGGGNGS LDATDLCARL
ALVAGQEAAA PAAPADAETS PSPQPSDQPE PARRDESDGA RTVRIRSELL DRFVNITGEL
VTGKNRIMEL AAGLESEPLR DAAAELSKLV RDLQREVMSA RMMPFGTICD RFPRMVRDLA
RRSGKEATLA IDGKDQELDR GILEILPDPL LHALRNAVDH GIESPEERSA AGKGAGGRIV
LSVRREKDHL DVTVTDDGRG MDPAALVNAA LAKGIITPEE AATLSRQEAL MLVCRPGFST
ARSVTEVSGR GVGMDAVQAA VSRAGGSLSI QSERGRGSRI TLRLPLSVAI IQVLLVGCGP
LTMAVPVNAV RRTVELDRRL QRIEDGRAVF DLGGETLPLV DLGLLVGTGP TAGGDFSPVL
TADVAGRTMG FAVDRFFGQA EVFTKPLGTP LNRARGLAGG AILGDGRVIF ILDLPNLVDG
ATSRRRVFMH PDGAHKGGTT A