Gene GSU1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1035 
Symbol 
ID2685770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1117662 
End bp1119311 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content65% 
IMG OID637125704 
Productmethyl-accepting chemotaxis protein 
Protein accessionNP_952088 
Protein GI39996137 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.640997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGC CTGCACGGCT GGCAGCACTG AAGTTGTCCC ACAAACTGAT CCTGGCCCTG 
GCGGTACTGA ATCTGTTCGT CATCGCCGCG GTGGCCGCCT CCAGCTACCA GGGACAGAAA
ACGGCGGTAC AGCATGCCGT GGACGAAAAG CTGCTCGCCT GCGCCCAGGG GGTCCGGCTT
CTGGGGGATG CCTTCCATGA CCGGCTCGGG CAGTCAGCTG ACATCAACCA GGAAGAGTAC
GTGGCCATGC TCGACAATCT TTCGGCGTTT GCCGAAGGGG CGGGCGTCAA GTACGTCTAC
ACCGTGGTGG TGAAGGACGG CAAGGTTGTC TTTACCACCT CAAGCCATAC CAGGGAGGAA
AAGGAAAAGG GAGATATCGC CGCCCTGTAC GACCCCTACG ACGACGCGAG CAGTGCTCTC
AAGGATGCCA TTGCCGACGG CAAGCCCCGC TACGATACCT ATTCCGACCA GTGGGGTACG
TTCCGGTCCC TGTTCCTTCC GGTTCGCTCC AGCGGCGGCG CGACGTACGT CATCGGCGTC
GACGTCTCAA CCGCGGACGT GAACGCCGTC CTCCGTTCAA GCCTGATCAC CACCGTTGTC
ATGGGCGCTG TCCTGTTCGT GGCCGGCACG CTGCTCATGC TCCTGGTGAT CAGACCCGTT
TCCGCCGCAG TCCGGATGCT TGCCGAAAAG GTCAACCATG TTGCCGATGG TGATCTGAAC
GTTACCGTCG ATTATGCGAG CGGTGACGAG CTCGGCATGC TGGCGGGCGA CATGAACCGT
ATGGTTGAAA AACTCCGGGA CATGGTTGCG GGCGTTGCGG GGGCGGCGGC CGAAGTGACC
ACGGCCGCCC GTCAGCTGTC GTCCACCTCA GAGGAGATGG CGGCGGGCGT CCAGTCTGCC
GCCGCGGAGG TCGTCGGCGT TTCCACGGCA GGCGAAGAGA TGGCGGCCAC GTCCTTCGAG
ATTTCCTTCA ATTGTTCCAC CGTGGCCGCG GATGCCCGTC AGGCCACTGA GTCGGCCACG
GCGGGCGAGG AGGTCGTGTC GGCCACGGTC TGCATCATGG CGAACATTGC CGCGCTGGTG
CGGGATTCGG CCCGGACGGT CGAAAGCCTG GGCGCGCGGA GCGATCAGAT CGGGGAGTTG
GCCGGTTCCA TCGAGGATAT CGCCGATCAG ACCAATCTCC TGGCCCTGAA CGCCGCCATC
GAGGCGGCCC GGGCAGGGGA GCAGGGGCGC GGGTTCGCCG TGGTTGCCGA CGAGGTAAGG
GCGCTTGCCG AACGGACGGC CCGGGCCACC CGCGAGATCA CGGCCGTGAT CCGCTCCATC
CAGCAGGAAA CCCAGGGGGC TGTTACCGCC ATGACCGCGG GTGTTGTCGA GGTTGAGCGG
GGGACGGCCG AGGCGTCCCG GTCGGGCGAG GCCCTGAGGG GCATCCTTGA ACGAATCCAT
GCCGTGGAGG AGCAGGTTGT CCAGATTGCC GCGGCCGCTG ACCAGCAGAC CGCCACCACC
ACCGAGATCA GCGGCAATAT CCTGCGGATC TCCGACGTAG TCCAGAGCAC TACCCGCGGC
GCCCAGGATT CGGCCGATGC CGCCGCGCAC CTGCAGGGTC TTGCCGAAGA GCTTCATGCC
GCCGTGGGCC GGTTCAGGGT CGCCGGGTAG
 
Protein sequence
MRMPARLAAL KLSHKLILAL AVLNLFVIAA VAASSYQGQK TAVQHAVDEK LLACAQGVRL 
LGDAFHDRLG QSADINQEEY VAMLDNLSAF AEGAGVKYVY TVVVKDGKVV FTTSSHTREE
KEKGDIAALY DPYDDASSAL KDAIADGKPR YDTYSDQWGT FRSLFLPVRS SGGATYVIGV
DVSTADVNAV LRSSLITTVV MGAVLFVAGT LLMLLVIRPV SAAVRMLAEK VNHVADGDLN
VTVDYASGDE LGMLAGDMNR MVEKLRDMVA GVAGAAAEVT TAARQLSSTS EEMAAGVQSA
AAEVVGVSTA GEEMAATSFE ISFNCSTVAA DARQATESAT AGEEVVSATV CIMANIAALV
RDSARTVESL GARSDQIGEL AGSIEDIADQ TNLLALNAAI EAARAGEQGR GFAVVADEVR
ALAERTARAT REITAVIRSI QQETQGAVTA MTAGVVEVER GTAEASRSGE ALRGILERIH
AVEEQVVQIA AAADQQTATT TEISGNILRI SDVVQSTTRG AQDSADAAAH LQGLAEELHA
AVGRFRVAG