Gene GM21_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1801 
Symbol 
ID8137132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2094441 
End bp2096009 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content64% 
IMG OID644869413 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021613 
Protein GI253700424 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.0233266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA TCGAGAAGGG GTTTAAGACG CTGTTCCAGA ACATGGAACT CCCTGCCTAC 
CTGCAAAGCG CGGACGGCTC CATGGTCGAT GTCAATCATG CGGCCTTATC GATGTTCGGC
ATGACCAGGG AGGAGTTCCT TGCCGGTCGC GGCCCCTCGC CTTGCCGGCG CCTGACCGGT
GAAAGCGGCG AGGAGCTTTC GCCCGACCGG CACCCCGCCG CTGTCGCCCT TGAGAGCGCA
CGGGAGGTCC GGGATTTCAT CACCTCGGTG CAGCAGGAGG GAGAACCTCT TCCGCTCTGG
GTCAACCTGA ACGCGATACC GGTGCTGGAT GAGGGAGGGG GCGCCCCCCA CGCCGTCCTG
GTCACGCTGA GGGACATCTC GAAGCTCAGG CTCCTGGAGC AGGCTGTCGC CGCGACGGCC
GCCGAGCGGG AACAGGAACA AAAGCAGTTG CAGATGCACC ACGCCCAGAA GCTGGAAAGC
CTCGGAGTCC TCGCCGGCGG CATCGCCCAC GATTTCAACA ACATACTCAC CTCCATCATG
GGAAACACCG AGCTTGCGCT GATGCAGCTC ACCCCCGGCG CCCCTGCCTG CGAAAACCTG
CGCCGGGTGG AGCGGGCCTC CCACCGCGCC GCGGCCCTGT TGAAGCAGAT GCTTTTCTAC
CTGGGTAAGG GCACCTTCTC CTCCGAACCG ATAGATCTGA ACCGGTTGGT GGAGGAGATG
GCGGATATGC TGCAGGCTGC CGTTTCCAAG AAGGCGACGC TGCGCCTGGA GCTCTCCCGG
CCGCTGGGGC TTTTCAGCGC CGACCCGGTC CAAGTGCGCC AAGTGGTGAT GAACCTGGTC
CTGAACGCTT CGGAAGCGCT CGGAAACGAG GTGGGCAAGA TCAAGATCTC CACCGCGCAA
AGGCACTACC GGCAGGAGGA GCTTGCGGAG TTCCGAGGCA GCGAGGAGCT TGCCCCAGGC
CCCTACCTGA CCCTGTCGGT GAGCGACACG GGTTACGGCA TGGACAAGGA GACGAGGGCG
CGGTTCTTCG ACGGCCTGTT CCCCGCTACC GGACGAGGGT TGGGTATGGC GGCCATCCTC
GGCGTGGTCC GGGGGCTCAG AGGGGGGGTG CGGCTGCAAA GCGACGTGGG GAAAGGCTCC
GCCTTCACGC TGCTGATCCC TGTGGACGCG GATGTGTTGA CGGCCGCCAA GCCCGCCGAG
CGGGCCTCCG AACCGATGGG GAAGGGTCCC GTGCTTTTGG TCGACGACGA AGAGGAGGTG
TGCCTGTTGG TGGGCGCCAT GCTGGAGCGG CTCGGGTACG AGGTGATCGC CGCACGCGAC
GGCCATCAGG CTCTCGAGCT TTACCTGCAG CGCGACGACT ACGCCTTCGT CATGCTCGAC
CTCACCATGC CGGTCATGGA CGGCGAGGAG ACCTACGAGC AGCTGCGCAG TATCGACCCG
TCGGTGAGGG TGATCATCAC CAGCGGCTAC AGCGAAAACG AAGTGGCGCG CCGCTTCGAA
GGAAAAGGGG TGAAGGGATT GCTGCAAAAG CCTTTCGACA TGGACGCGCT GCGCAGGGTT
CTCAGGTAG
 
Protein sequence
MSDIEKGFKT LFQNMELPAY LQSADGSMVD VNHAALSMFG MTREEFLAGR GPSPCRRLTG 
ESGEELSPDR HPAAVALESA REVRDFITSV QQEGEPLPLW VNLNAIPVLD EGGGAPHAVL
VTLRDISKLR LLEQAVAATA AEREQEQKQL QMHHAQKLES LGVLAGGIAH DFNNILTSIM
GNTELALMQL TPGAPACENL RRVERASHRA AALLKQMLFY LGKGTFSSEP IDLNRLVEEM
ADMLQAAVSK KATLRLELSR PLGLFSADPV QVRQVVMNLV LNASEALGNE VGKIKISTAQ
RHYRQEELAE FRGSEELAPG PYLTLSVSDT GYGMDKETRA RFFDGLFPAT GRGLGMAAIL
GVVRGLRGGV RLQSDVGKGS AFTLLIPVDA DVLTAAKPAE RASEPMGKGP VLLVDDEEEV
CLLVGAMLER LGYEVIAARD GHQALELYLQ RDDYAFVMLD LTMPVMDGEE TYEQLRSIDP
SVRVIITSGY SENEVARRFE GKGVKGLLQK PFDMDALRRV LR