Gene GSU0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0401 
Symbol 
ID2685789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp432601 
End bp434220 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content60% 
IMG OID637125066 
Productmethyl-accepting chemotaxis protein, putative 
Protein accessionNP_951460 
Protein GI39995509 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.395676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGA CGATCAAGCA GAGAATGGGG ATGACGGTAG GAGTTACCCT GCTGGGTATG 
GTGGTCATCA TTGTGTTCAT GGTTGTCGGC TTCACCAAGG TCCACCGACA GCAGGAGCTC
ATGGACCGGC TCACCCTGAT CAATAATACG GCATTGCGCG GCAACATTGC CATGCTTAAG
GCCAGGGAAT ATGAGGCCGA GTTCTTTGAC CGCAAGCAGG ACAAGTGGGT TCCGCGGGTC
AAGCAAGCGG TTGACCAGGT CAACAAGGAA CTGGACGTCA TCCTCAAGAA TACGGATGAT
CCAAAAATCA AGGGGTGGGC CGAAAGCGCG CGCAAGCTCG CTACCCAGTA TGTGCAGCAG
TTCCAGGAGC TTGCATCCGT GGCCCTGGGA AGCAACTTCC AGGGGGCGGA ACTGGCCGAA
ACCCGGGAAG AACTGCGTGA TATCCTTAAC GAGTTCGAAC CCCTGCTCGA CAATTACATT
CCCAAGCAGG TGGGAGTCGC CTATCAGGCT GCCACCGAGG AGATGGACAG GAGCATTGCC
GCCATCCGCC TTCAGATCTT CGGCGCCGTG CTGCTGGTGG CCGTGGCCAT GCTCGTCAGC
ATTTCGTCCA CCGCCATTTA CCTTCTCAGG TCCCTTCGGT TGATCAATGA CCGTCTGCGC
GACATTGCCG ACGGCGACGG CGACCTGACC AAGCGAATTG AACTCCAGTC CAGGGACGAG
CTGGGAACCC TGGCCGTTTC GTTCAACAAT TTCGTGGGAA AGCTTCACGA CATTATTGCT
CAGGTTTCCC AGGGCACCCT CCAGGTGGCG TCGGCTTCCT ACGAACTCCA GGCCAACGCG
GAGCAAATGG CCCACGGTGC AGAGGCGGCA GCCACACAGG TGAACACCGT TGCCTCGTCC
AGCGAGGTGC TGGCGGCATC AACCTTTGAG ATATCAAGCA ACTGCGGCAC TGTTGCCGAA
AGCTCCCGGC GGGCCAACGA CTCCGCCCAG ACCGGTGCGG TCGTTGTCGA GAAGACTGTC
GATATCATGG CCAGGATCGC CGAGCGGGTC AAAGACTCGG CCCGGACCGT GGAAAGTCTG
GGTGCCCGCG GCAACCAGAT CGGAGAGATT ATTTCCACCA TCGAGGATAT CGCCGATCAG
ACGAACCTGC TGGCCCTGAA CGCCGCGATC GAGGCGGCTC GGGCCGGTGA GCAGGGCCGC
GGTTTTGCCG TTGTGGCGGA CGAGGTGCGG GCGCTTGCCG AGCGGACATC CCGGGCTACG
CGAGAGATTT CCCAGATGAT CAAAGGGATT CAGGGCGAGA CCAGGGGGGC GGTGCTTGCC
ATGGAGCAGG GGGTCAAAGA GGTGGAACTC GGCTCCGAGG AGGCCGCCCG CTCCGGCGAG
GCGATACGAA CCATTCTCGA GCAGTTCCGT ACGCTCGACT GTCAGGTGGG GGAAATTTCC
GCAGCTGCCG AGGACCAGAC CCGTGTCACC ACTGAAATCA GCACTAATGT CATGCAGATA
ACCGAGATTA TCGAGACCAC GGCAAAGGGT GCCGCTGACT CGGCCGAGGC GGCCCAAGGG
TTGGCTGAGC TTTCGGATCA GCTCAAGCAG ATCGTCGGAC GATTCAAACT CAGCGTCTGA
 
Protein sequence
MQLTIKQRMG MTVGVTLLGM VVIIVFMVVG FTKVHRQQEL MDRLTLINNT ALRGNIAMLK 
AREYEAEFFD RKQDKWVPRV KQAVDQVNKE LDVILKNTDD PKIKGWAESA RKLATQYVQQ
FQELASVALG SNFQGAELAE TREELRDILN EFEPLLDNYI PKQVGVAYQA ATEEMDRSIA
AIRLQIFGAV LLVAVAMLVS ISSTAIYLLR SLRLINDRLR DIADGDGDLT KRIELQSRDE
LGTLAVSFNN FVGKLHDIIA QVSQGTLQVA SASYELQANA EQMAHGAEAA ATQVNTVASS
SEVLAASTFE ISSNCGTVAE SSRRANDSAQ TGAVVVEKTV DIMARIAERV KDSARTVESL
GARGNQIGEI ISTIEDIADQ TNLLALNAAI EAARAGEQGR GFAVVADEVR ALAERTSRAT
REISQMIKGI QGETRGAVLA MEQGVKEVEL GSEEAARSGE AIRTILEQFR TLDCQVGEIS
AAAEDQTRVT TEISTNVMQI TEIIETTAKG AADSAEAAQG LAELSDQLKQ IVGRFKLSV