Gene GM21_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3968 
Symbol 
ID8139342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4553009 
End bp4554538 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content66% 
IMG OID644871584 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003023742 
Protein GI253702553 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00000000439597 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAGG CGGTCAACCA GGCGGTGAAG GAGTTGTCCG AGGCGATGAT GGCGGGGAAG 
CTCGACGTGC GCGCGGACCT CAAGGGGCTC AAGGGGGAAG ACGCGGAAAC GGTCCGGCTC
ATAAACGGCA TGATCGACGC CCTGATCGCC CCCATGAGGC TCGCCGGCGG CGCGCTGCGG
GAGATCGCCC ACGGCAACCT CCCCCCTTTC GTCATCGACG AGTACCAGGG GGAGTTCCAC
CAGATCAAGC AGGACATAAA CACCCTTTTG GCCATCCTCT ACGGCATCCA CGCCGAGGCG
GTGCACCTGA CCAACAGCAT CGGCGAGGGG AAGCTGAAGA CCCGGGGGAA CGACTGGGAT
TACCAGGGGG TCTGGAGGGA GCTGATCGCG GGGTTCAACG GGACGCTCGA CGCGGTCATC
GCCCCTATCC GCGAGGCGGG AGAGGTGCTG GAGCGCCTGG CGCGCTACGA CCTGAAAAGC
AGGATGAGCG GGAAGTACCG CGGCGAGCAC GCCGCGATCC GCAAGGCGAT GAACTCGACG
GCGGTTGCGT TGAACGACGC CATAGCCCAG GTCGGCGAGG CGGTAGGGCT CGTCTCCGAC
GTGGGGCGGC GCATTACCAG CGTCAGCTCC TCCTTCGCCC TTGGGGCCAG CGAACAGAGC
AAGGAGCTGG GGGAGACCTC GGTAAGCCTG ACGCAACTTT CCCGGAGCGC CGCCCAGAAC
GCGCGGAGGT CGAAGGAGGC TCATGCCGAC GCCAAGAAGG CGACCGACGC CATGCGCCTG
GCCAAGGAGG CGATGGGGCG GATGCTGGCG TCCATGGACG AGATCAGCGC TGCTGCCGAA
AGCACCGTCT CCATAGCCGG GGAAATAGAC GGCATCGCCC AGGAGACCGG CGTCCTGGCG
TGGAGCACCG TCGAGAAGGC GGCCCGCATG AGAATATCCG CGGGTGGGTT CGGTGTCGTG
GCCCAGGAGA TCCGCAAGCT TTCCCGGCAG TGCTCCCAGA CGGCGAACTC CATGAAGGAG
TTCGAGAAGA AGCTGGGTGC GGAGCACCAG GAGGAATTCG GCGCCCTGAT CGCGAGCCTG
TTGCAGATCG CCAGATTCTC GAACCTGTTG GGGGTGAACG CCGCCGTCGA AGCGGCCCAC
GTCGAGGGAG CCGGCAACGA GTTCCAGGCG ATGACCGACG AGATACACAC CCTGGCGGTC
AGGTCGGCCG ACGCGGCGAA AAGTACCGGG ACGCTCACCA AGTCCTCCCA GGACCTGGCG
CGGCAAGGGG TGGTGCTTTC GCGCGAGATC GACCTGGAGC TGGAAGGTGC TGTGGAGGCG
GCGCAGGCGA TAGCCCGTTT CGCCGACGAA ATCCTGGCCG GCATCGAGGG GCAGACGGCC
AGGATCGAGG AGATAAACGC GAGGGCGGTC CACATAACCG GTGTCACCGA GAAGAATGCC
TCCGGCGCGG CCGACTCGCT CGTGGCGGCG CAGGAGCTAG AGGCGCAGGT CGCCAAGCTC
TCCACCATGG TGAACCGGTT CAGCTTCTGA
 
Protein sequence
MSKAVNQAVK ELSEAMMAGK LDVRADLKGL KGEDAETVRL INGMIDALIA PMRLAGGALR 
EIAHGNLPPF VIDEYQGEFH QIKQDINTLL AILYGIHAEA VHLTNSIGEG KLKTRGNDWD
YQGVWRELIA GFNGTLDAVI APIREAGEVL ERLARYDLKS RMSGKYRGEH AAIRKAMNST
AVALNDAIAQ VGEAVGLVSD VGRRITSVSS SFALGASEQS KELGETSVSL TQLSRSAAQN
ARRSKEAHAD AKKATDAMRL AKEAMGRMLA SMDEISAAAE STVSIAGEID GIAQETGVLA
WSTVEKAARM RISAGGFGVV AQEIRKLSRQ CSQTANSMKE FEKKLGAEHQ EEFGALIASL
LQIARFSNLL GVNAAVEAAH VEGAGNEFQA MTDEIHTLAV RSADAAKSTG TLTKSSQDLA
RQGVVLSREI DLELEGAVEA AQAIARFADE ILAGIEGQTA RIEEINARAV HITGVTEKNA
SGAADSLVAA QELEAQVAKL STMVNRFSF