Gene GM21_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2778 
Symbol 
ID8138121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3226226 
End bp3228496 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content54% 
IMG OID644870381 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022570 
Protein GI253701381 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones117 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCCA AGCAAGTCAT GAAGATGCCA TCATCGCTGC AGAGCTTTGC ACGACAGGTT 
ATTGCCACTA CTCTTCTCAT CAACCTTATT GTCCTGGGCG TCACGCTATG GTCGCTGCGC
GACAGTCGGA TGCATTACGA GGAACGGATT ACCTTGAATA CCGTGAACCT CTCTCTGGTT
CTGGAAAAGT ACTTGAACGG CGTTATGGAA AAAGTGAACT TGACGCTTCT GGCCCTGGGC
GATGAAGTAG AGCAGCAACT GACTGCCGGG CGCCTTGACG GCGACCGCTT GAATGCGCAG
ATGAAACGGC AGCTTTCTCG ACTCCCGGAG GTGGATGGCA TTCGCATGAC TGACGCCCAA
GGCCGGGTAA TATACGGAAC CGGTATGACC CCAGGGGCAC ACCCCAGCGT TGCCGACCGC
GATTACTTCC ATTATCTGCG CAGCAACCCA ACTGCTGGTC TTGTCATCTC CAAACCGCTT
ATCAGCCACA TCAGCGGCAA GAATGTTGTC GTCGTCGGCC GACGGATTAA TCGATCAGAC
CATTCTTTTG GCGGGGTAAT ATACGTTGCA ATAATTGTCG AGCACTTTGC AACGCTATTT
TCTACCATAA ATGTCGGTCC TCATGGCGCG ATCACACTCA CTGACACCAA GAGTATTGTC
ATCGCCCGCT CCCCGGTGCC TGGGCATACT GGCAGTTACC TTGGTAAAGA GCTGAAATCG
GCAGGGCTGA AGAAGTTGAT TGATGAGGGG CTGACAGTGG GAAGCTACCG AAGCAAATCA
GCGCTTGACG GGGTGCAACG CACTATCACC TTCCGGGTAA TCAATGGGTA CCCGCTCCTG
GTCTTTGTAG GGATGGCGCC TTCCGATTAC CTGCACGAGT GGCGATTGGA TGCTTTGAAG
ATGGGGGGAC TGGTCGTCTC TTTCATGCTG ATAAGCATAA TCACCAGTAG GCTGATCTAT
GAAAGGCGGA AACGCGAGAA ACTGGCAGAG GCGGAATTGT ATCAACATAA GGTATATCTG
GAGAGCATCG TGGTGCAACG GACCTCCGAC CTTGAAACCA GGAACCGGGA GTTACAGGAG
TCGGAGGGGG TGCTGAAAAC CATCTTGGAC AATGTCTATG ACGCTATCGT CATCCATGAC
GCTTCAGGGC GGATACTGCA GGTGAACAGG CGCTGGCGCG AGATGTATGG TGTCTCTGAA
AACGAGTCGG AGACTCTCAC CATTGCCGAT TTCTCCACTG ATCCGCCACC ACCTGAGGAA
CTCGCTTCCT TGTGGGGACA TGTCCTCGCC GGGAACTCAA ACTTCTTCGA GTGGCCCGCC
CGCCGGCCGC ATGATGGCTC CAAAGTCTGG GTGGAAGCCT TCCTTTGTCC GATAAGATTG
AAGGAGCAGA ATCTGATCAT GGGGTGCGTA CGAGACATAA CCGAGCGCAA GGCAACTGCA
CAGGAACTAC AGAAGTATCG GAATCATCTC GAGGATCTTG TACAGGAGCG GACTGAGGAG
TTGGCAAGAG CTGTCGAGAA GACGCGCAGG GAAACGGAGC AGCGGATTGC TGCGGTCGAG
GAACTGCGAC AAAAGGAGCG GCTTCTCATC CAGCAGAGCC GATTGGCTGC AATGGGGGAG
ATGATGGGCA ATATTGCACA TCAGTGGCGC CAGCCGCTAA ACATTCTGGG TCTCATCGTC
CAGGAGCTCC AGATATGCCA CCAGAAGGGC ACTTTGGATA ATAAGCTCGT CAATACCCTG
GTACCGAAAG CGATGAAGGT GATCGCACAT ATGTCGCAGA CGATTGACGA TTTCCGCAAC
CTGCTGAGCC CCGACACGTC AAGAACCGTC TTCAGTGTCA ACGAAGTTGT TGAAAGGGTC
CTGTCAATTA TGATTCTTGA GGCAAAGGTG GATGTCATCG CAGAGGAGGA GTGCTTCGCG
GAGGGTGCTA GAAACGAGTT TTCACAGGTC ATTATAAATG TTCTGGCCAA CGCGAACGAT
ATCTTCCGGG AGCGGCAAGT TTCGGCATCG CGGATCATCA TCCGAATTCT GCCTCAGGAC
CTCAAGTCGG TGGTAACCAT CGCCGACAAC GGCGGCGGGA TCCCTGAGGA AATAATGTGC
AAGATCTTCG ATCCATATTT CACCACCAAG GCGCCCGACA GGGGTACCGG CATAGGCCTG
TTCATGTCGA AGACCATCAT CGAACAGAGG ATGAAGGGGG CGCTGTCAGC CCGCAATACA
GCTGAAGGCG CAGAGTTCAG GATTGAGGTT CCTGCAGGTA CCAAGCCCTA G
 
Protein sequence
MGAKQVMKMP SSLQSFARQV IATTLLINLI VLGVTLWSLR DSRMHYEERI TLNTVNLSLV 
LEKYLNGVME KVNLTLLALG DEVEQQLTAG RLDGDRLNAQ MKRQLSRLPE VDGIRMTDAQ
GRVIYGTGMT PGAHPSVADR DYFHYLRSNP TAGLVISKPL ISHISGKNVV VVGRRINRSD
HSFGGVIYVA IIVEHFATLF STINVGPHGA ITLTDTKSIV IARSPVPGHT GSYLGKELKS
AGLKKLIDEG LTVGSYRSKS ALDGVQRTIT FRVINGYPLL VFVGMAPSDY LHEWRLDALK
MGGLVVSFML ISIITSRLIY ERRKREKLAE AELYQHKVYL ESIVVQRTSD LETRNRELQE
SEGVLKTILD NVYDAIVIHD ASGRILQVNR RWREMYGVSE NESETLTIAD FSTDPPPPEE
LASLWGHVLA GNSNFFEWPA RRPHDGSKVW VEAFLCPIRL KEQNLIMGCV RDITERKATA
QELQKYRNHL EDLVQERTEE LARAVEKTRR ETEQRIAAVE ELRQKERLLI QQSRLAAMGE
MMGNIAHQWR QPLNILGLIV QELQICHQKG TLDNKLVNTL VPKAMKVIAH MSQTIDDFRN
LLSPDTSRTV FSVNEVVERV LSIMILEAKV DVIAEEECFA EGARNEFSQV IINVLANAND
IFRERQVSAS RIIIRILPQD LKSVVTIADN GGGIPEEIMC KIFDPYFTTK APDRGTGIGL
FMSKTIIEQR MKGALSARNT AEGAEFRIEV PAGTKP