Gene GM21_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0039 
Symbol 
ID8135338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp51174 
End bp52721 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content64% 
IMG OID644867656 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003019884 
Protein GI253698695 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00058] hemerythrin family non-heme iron proteins
[TIGR02481] hemerythrin-like metal-binding domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0381098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTAT TCTCTCGCTT CATAACGATC AACTTGATCG CTGTCGCCGC CACGATTGGC 
GCAGCCGTTG CCATGGGCAG CGGAGTTGGT CTGTTTGCCG CCGGCGCCGT GATAATCCTC
CTCTCCGCCG TGGCCTACGG TCTGTCGTCA CGTGGGGAAA CCAAGGCGTT GGAGGAAATG
GCCGTCGCCC TGGAAAGCGC CGCGGCTGGC GATCTTTCCT ACCGGGTCAC GGCCGGCGGC
AACGGCGAGA TAGGACGCAT AGGGGCCGCA TTCAACACCA TGATGGGCGA CTGGAACAAG
ACCATGCACA AGTTCTTCAC CGTCACCGAT CTGGTACGCG ATTCCGTCGC CCTGGTGAGC
GCGACTAACG ACGCCATGGC TGCCGCGGCC GAGGACGTCG CGCTGCAGGC CTCCACCATA
GCTACCGCCA GCGAAGAGAT GTCCGCCACC TCCGGCGACA TCGCGCGCAA CTGCCTCTAC
GCGGCCGAGA ACGCCCACAG GGCCACCGAG GAGACCACCT CCGGCGCGGA GATCGTCAGT
AACAGCGCAA GGCTCATGGA GAACATCGCC CAGCGCGTGA TGGCCACCTC CAGCTCCGTC
GCAGGCCTTG GCGAGCGTTC CGACCAGATC GGCGCCATAG CCGGGACCAT CGAGGACATC
GCCGACCAGA CGAACCTGCT CGCCTTGAAC GCTGCCATCG AGGCCGCCCG CGCCGGCGAG
ACCGGCCGTG GCTTCGCCGT CGTCGCCGAC GAGGTGCGCG CCCTTGCCGA GCGGACCACC
CGCGCCACCA AAGAGATCGA CGCCATGATC AAGTCGATCC AGACCGAGAC CAGAGCCGCC
GTCGGCTCCA TGGGAGAAGG GGTCGAGCAG GTGAACCAGG GGACCGCCGA AACCTGCCGC
TCCGGCGAGG CGCTGAACGG CATCCTCAGA ATGATCAACG ACCTGACCAT GCAGCTCTCC
CAGATCGCCA CGGCGGCCGA GGAGCAGACG GCGACCACCC ACGAGATCAC CAGCAACATC
CAGATGATCA CCAACGTGGT CAACAGCAAC GTGGAAAGCG CCCGCGACAC CAGGGCGGCC
ACCGGGAAGC TGGTCCAGCA GGTGGACGAG CTGCACCAAC TGGTGTCGCA CTTCCAGCTC
TCCGACGCCA TGGTCTGGGA CCAGAGCTTC GCCACCAGCA TCGGCACCTT CGACGATCAG
CACAAAAAGC TCTTCGCCAT GGTGAACGAA CTGAACCAGG CCATGCAGCA CAAGCGGAGC
AAGGAGGCGA TCGGATCGGT CTTGAACCGC CTGATCGAGT ACACCGGCAG CCACTTCGCC
GCCGAGGAAG AGGTCTTCCG CAAGACCGGC TACCCCGAGG AAGAAGCCCA CGTCAGGGCG
CACCGGGACC TGGTGCAGCA GGTAGTGGCG CTGCAGCAGA AATTCAACGC CGGCGAGACC
CTCCTTACCC ACGACGTCAT CGAATTCCTG CAGAACTGGC TGGTGAAGCA CATCAAGGGG
ACCGACGTCC GCTACACCTC CCACCTGACC AAGGCGGGGG TCCGTTGA
 
Protein sequence
MSLFSRFITI NLIAVAATIG AAVAMGSGVG LFAAGAVIIL LSAVAYGLSS RGETKALEEM 
AVALESAAAG DLSYRVTAGG NGEIGRIGAA FNTMMGDWNK TMHKFFTVTD LVRDSVALVS
ATNDAMAAAA EDVALQASTI ATASEEMSAT SGDIARNCLY AAENAHRATE ETTSGAEIVS
NSARLMENIA QRVMATSSSV AGLGERSDQI GAIAGTIEDI ADQTNLLALN AAIEAARAGE
TGRGFAVVAD EVRALAERTT RATKEIDAMI KSIQTETRAA VGSMGEGVEQ VNQGTAETCR
SGEALNGILR MINDLTMQLS QIATAAEEQT ATTHEITSNI QMITNVVNSN VESARDTRAA
TGKLVQQVDE LHQLVSHFQL SDAMVWDQSF ATSIGTFDDQ HKKLFAMVNE LNQAMQHKRS
KEAIGSVLNR LIEYTGSHFA AEEEVFRKTG YPEEEAHVRA HRDLVQQVVA LQQKFNAGET
LLTHDVIEFL QNWLVKHIKG TDVRYTSHLT KAGVR