Gene GM21_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2551 
Symbol 
ID8137893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2980296 
End bp2981486 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID644870160 
Productresponse regulator receiver sensor signal transduction histidine kinase 
Protein accessionYP_003022350 
Protein GI253701161 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value1.14529e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCAAC AGGATAAAGA GCCGCGCGGC CGCGTGCTGA TAGTAGACGA CGAGAAGGTC 
ATCCTCGACC TTACCGCCAT AATCCTTAAA AACCGCGGCT ACCAGGTCTT CACCGCGCTT
TCGGCGCTGG AGGGGCTGGA GACCATCGAT AAGGAACGCC CCGAGCTCGT GCTGCTCGAT
TACATGATGC CGAACATGGA CGGCCTCACC GCGCTCAAGG AGATCAGGCG CAGCTACCCC
GATACCTACG TGATCATGTT CACCGGCAAG GGGAGCGAGG AGATCGCCGT CGAGCTGATG
AAGGCGGGAG CCTCCGACTA CATCCTGAAG CCGTTCAACA ACCAGGACCT GGTCGAGAGG
ATCGAGAGCG TCCTGAAGCT TCGGGGCATC GAGCTGCAAA ACCGCGCCCT TTTGAGCGAG
CGGGAGCGGC TTCTGGCCGA GATCGCGGAC TGGAACCGCG AGCTGGAGCG CCGGGTGCAG
GAAAAGAGCG AGGCGCTGCG CCGGGCCCAG GCAGAGGTGG TGCAGTCCGA GAAGCTCGCT
TCCCTCGGCT ACCTCTCCGC CGGGATGGCG CACGAGATCA GGAACCCGCT CAACTCCATA
GCGCTTTTCG TGCAGCTCAT CAAAAGCGGG CTGGACGAGC ACGAGCGCCT GGACTACGTG
GAAAAGATCC TCAAGGAGGT CGACCGGATC GACAACATCC TGGCGAAGCT CCTGGACGCC
TCCAAGCGCC CGAAGTTCGA GATCAGCGAG GTGCGGGTCG ACCGGGTCCT GGAGCACACG
CTCGACGCCT TCACGCCGCA GCTGCGGCAG AAAAGGATCC GGGCGGTCAC CGACTTCAAG
AGCATCCCCC CGGCCATCAA GGCGGACCCG ATGGAGATAG AGCAGATCTT CACCAACCTC
TTCCTGAACT CCATCTACGT GATGCCCGAG GAGGGGACCC TCGCGGTGGA GCTGGCAGGG
GACGAGCAGT GGATCACGGT GAGGGTCTCC GATACCGGCC CCGGCATACC GCCCGAGAAC
CTCCCCAACA TCTTCGATCC CTTCTTCACC ACCAACAGCC GCGGCACCGG GCTCGGGCTC
TCCGTGGTCC TGCGCATCGT GAAGACCTAC AAGGGGAAGA TCGAGGTGGA GAAAAGCGAC
AGCTCCGGGA CCACCTTCCT GGTCCGCCTG CCGCTTGCCC CCCCGAGGTA G
 
Protein sequence
MSQQDKEPRG RVLIVDDEKV ILDLTAIILK NRGYQVFTAL SALEGLETID KERPELVLLD 
YMMPNMDGLT ALKEIRRSYP DTYVIMFTGK GSEEIAVELM KAGASDYILK PFNNQDLVER
IESVLKLRGI ELQNRALLSE RERLLAEIAD WNRELERRVQ EKSEALRRAQ AEVVQSEKLA
SLGYLSAGMA HEIRNPLNSI ALFVQLIKSG LDEHERLDYV EKILKEVDRI DNILAKLLDA
SKRPKFEISE VRVDRVLEHT LDAFTPQLRQ KRIRAVTDFK SIPPAIKADP MEIEQIFTNL
FLNSIYVMPE EGTLAVELAG DEQWITVRVS DTGPGIPPEN LPNIFDPFFT TNSRGTGLGL
SVVLRIVKTY KGKIEVEKSD SSGTTFLVRL PLAPPR