Gene GM21_2502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2502 
Symbol 
ID8137844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2928796 
End bp2930067 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content60% 
IMG OID644870111 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022301 
Protein GI253701112 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC AACTGCCGGC GGCTAATCGG GGCCACGCCC TGCAACCTCC GCCCGGATTC 
GAACCTGAAG ACGAGACAAT GCCTATTCGA CAGTTGCGCA ACTTCGCCAA ATTATCCGAA
AACCTCAACG GTGCCGAGGC GGCCAACCGG GTTTTGTCAG CCACCATCGA ACATACCCGC
GACGGCATCA TGGCAGTGGA CGGAAACGGG GAGATCGTCG CGTGCAACCG GCGTTTCCTG
GAGATGTGGG GCATAAGTAA CGATGCCCTG CCCTGCGGAG ATTCCGACAA ATTGCTCCTG
TCGCTCTTGG GTCAGGTACG CGACCCGGTG CTGTTCTTCG AGAACTTCAG CGAGATGCAG
TGGCAGCCGA ACCGGGAGAG CTACGACGTA GTGGAACTGA ACGACGGCAG GTGTTTCGAG
CGCTTTTCCA GGCCGCACTA TCTGGATTGG AAGACGGCCG TGCGGATCTG GACCTTTCAC
GACATCAGCG AACTGAAGAA GATGGAGAGC CAGCTGCTGC ACGCTCAGAA GATGGAGGCA
ATAGGGACCC TCGCCGACGG CATCGCCCAC GACTTCAACA ACATCATGAC CGCCGTGATC
GGTTACACGG ACCTGCTCAT GACCGAATTA TCCCCTCCCG CGCCTTACCG AGGATTTCTC
GAGAACATAA ACACCGCCAC TCATCGCGCC ATCACGGTGG TGAAGAACCT GCTCGCCTAT
TCGAGGCAGG AGCCGATGCA GACAACGAGG ATCCTTGGCA ACGACCTGAT CGAAGGGATT
TTTGTGCTTT TGAAGAGGGT GGCGGGGCAA GGTATCGAGC TGGCATGGGA GCCGGCCCCC
GACACCCTCC CGATAATGGT GGACCAGGCG CAGATGGAGC AGGTATTGGT AAACCTGACC
GCCAATGCCA GGGACGCCAT GCCTCAAGGG GGAACGCTCA CCATAACGGT CGACGCTGTA
GATCTCGCCC CCCACGAGGT CCAGGGGTAC GACCACACAC GGCCAGGGCC CCATGTGCGC
ATCGCCGTTT CCGATACCGG CTCCGGGATC GACCAGGAGA CACAGAGCAG GATCTTCGAT
CCTTTCTTCA CCACCAAGGA GGCGGGAAAC GGGACCGGGC TCGGGCTTTC CATCAGTTAC
GGCATAGTCA AGCGCCATGG CGGCTTCATC CGCGTCGCCA GCGAAAACGG CGAGGGGACC
ACGTTCTCCA TCTTGCTCCC CAAGGCTGAA GCTTCCCCCA ACCGACAGAA ACAGCCTGCC
CCGTCGAACT AG
 
Protein sequence
MSQQLPAANR GHALQPPPGF EPEDETMPIR QLRNFAKLSE NLNGAEAANR VLSATIEHTR 
DGIMAVDGNG EIVACNRRFL EMWGISNDAL PCGDSDKLLL SLLGQVRDPV LFFENFSEMQ
WQPNRESYDV VELNDGRCFE RFSRPHYLDW KTAVRIWTFH DISELKKMES QLLHAQKMEA
IGTLADGIAH DFNNIMTAVI GYTDLLMTEL SPPAPYRGFL ENINTATHRA ITVVKNLLAY
SRQEPMQTTR ILGNDLIEGI FVLLKRVAGQ GIELAWEPAP DTLPIMVDQA QMEQVLVNLT
ANARDAMPQG GTLTITVDAV DLAPHEVQGY DHTRPGPHVR IAVSDTGSGI DQETQSRIFD
PFFTTKEAGN GTGLGLSISY GIVKRHGGFI RVASENGEGT TFSILLPKAE ASPNRQKQPA
PSN