Gene GSU0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0750 
Symbol 
ID2687331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp793255 
End bp794874 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content65% 
IMG OID637125422 
Productmethyl-accepting chemotaxis protein, putative 
Protein accessionNP_951807 
Protein GI39995856 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAA GTATCGGCAG ACGGCTAACC CTCAACATGG TCTGGGGGGT GTTTGTGGTT 
CTGGTGCTGG TGATCGGCAA CTGGATCGGC ATGGGCCATC TGGAGCAGCT GCAAGCGACC
AGTCACGAGG CCATGGCGCG CAGCAGGTCC GCCCAGGAGA CCAAGGTGAT CGGCGAAAAA
CTGTACCGGT TCGTCCTGGA ATCGGTCGCC AACCCCGACA TGGCCGGCTC CTCGTCCAAG
GGATGGCTGA ACCGGAAGGC GGAGGGAATG GCCAAGCTGA AGCAGCTGGC CGAGCAGACC
GGGGACGACG CTGCCCTGGG CGCCCTGGTA GCCTCGGCGG ACAAGGCGTT CCGGGGCACG
GTTACCCTCT ATGAAACCAA ATTGATCCCG GCGCTCGAAC GAGGGGCGAA CCACGAGGAT
ATCATGGACA TCGACGATGA GATCTCCATG GAGTCCGACA ACCTGAGCAT CAGCCTGCTC
AAGGTCGCCG AAACGCTGGA AAAACGGGCC ATTGCCGCAA GCGCCCAGTA CGACTCCTTC
AGCGCAAAAC TCAAGACCTT CTCCCTTGCC CTGGGTGGCA TCGGCATCGT GCTGCTGGTG
GTGTTCTCCT CCTGGCTCAG CCGTTCGATC ATGAGGCCGC TCCGTCAGGT CATCGCCATG
ATGGAAGACG TGGCCGAGGG CGAGGGGGAT CTGACCAAGC GCCTCGAACA CCGGAGCAAC
GACGAGTTGG GGAAACTCTG CACCGAGTTC AACTCCTTCG TGGGCAAAGT CCACGACACC
ATCTCCCGGA CCTCCAGCGT GGCTCGGGAC GTGACCGGAT CGGTGGCAGA GATCAGCCGG
ACCGCCGAGC GGTTGGCCGA GGGCGCCGAA GAGGTCGCCT CCCAGGCGGT CATGGCGGCC
ACGGCGAGCG AGGAGATGGC CGCCACCTCC TGCGAGATTG CCGGCAACTG CCAGACCGCG
GCCCAGTCGT CGAGCCGGGC GCGGGAAACG GCTGCCCGGG GCTTTGCCAT GGTGGAAAAC
ACCATTGCGG TCATGAACCA GATCGCCCGG CGGGTCAGGG TGTCGGCCGA ATCGGTGCAG
GGGCTCGGCG CCCGCTCCGA CCAGATCGGC GAAATCGTCA TGACCATTCA GGACATCGCG
GACCAGACGA ACCTGCTGGC GCTCAACGCT GCCATCGAGG CGGCCAGGGC CGGCGAACAG
GGGCGCGGCT TCGCCGTGGT GGCCGACGAG GTGCGGGCAC TGGCGGAGCG GACATCACGG
GCTACCCGCG AGATCGGCGA GATGATCAAG GGCATCCAGG GCGAAACCCG GACTGCGGTG
CTCACCATGG AAGAAGGCGT CAAGGAAGTC GAAGCGGGCA CCCGGGAGGC CGCCAAGTCG
GGCGAGGCGC TCAATGAGAT CATGCAGGGG ATCGAGCAGC TCAACCAGCA GATGGGCCAG
ATCGCCTGCG CCGCGGAGCA GCAGACCTCC ACGACCATGG AAATCAGCGG CAGCATCCAG
CGCATCAAGG ACGTCGCCCA GGAGACCGCC GGCGGCGCCC ACGACAGCGC CCGGACCTCC
ACCCGGCTCA CGGACCTGTC CCACGACCTG GATCGCCTGG TGAGCCAGTT CAGGGTGTGA
 
Protein sequence
MSVSIGRRLT LNMVWGVFVV LVLVIGNWIG MGHLEQLQAT SHEAMARSRS AQETKVIGEK 
LYRFVLESVA NPDMAGSSSK GWLNRKAEGM AKLKQLAEQT GDDAALGALV ASADKAFRGT
VTLYETKLIP ALERGANHED IMDIDDEISM ESDNLSISLL KVAETLEKRA IAASAQYDSF
SAKLKTFSLA LGGIGIVLLV VFSSWLSRSI MRPLRQVIAM MEDVAEGEGD LTKRLEHRSN
DELGKLCTEF NSFVGKVHDT ISRTSSVARD VTGSVAEISR TAERLAEGAE EVASQAVMAA
TASEEMAATS CEIAGNCQTA AQSSSRARET AARGFAMVEN TIAVMNQIAR RVRVSAESVQ
GLGARSDQIG EIVMTIQDIA DQTNLLALNA AIEAARAGEQ GRGFAVVADE VRALAERTSR
ATREIGEMIK GIQGETRTAV LTMEEGVKEV EAGTREAAKS GEALNEIMQG IEQLNQQMGQ
IACAAEQQTS TTMEISGSIQ RIKDVAQETA GGAHDSARTS TRLTDLSHDL DRLVSQFRV