Gene GM21_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0306 
Symbol 
ID8135613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp375390 
End bp377141 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content53% 
IMG OID644867924 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003020146 
Protein GI253698957 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones206 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAG AAGAGTTGTT ACAAGATCTC AAGAAGAGAG GCGAACTTGA CGAACGCCAC 
CTTTTGGCGT CGATCCGATT GCTGCACCTA CTCGCCTCGG GAAAATCCTC CCAAGAGGTC
ATGTCGACGT TGCTGCCCTT CTTTCAAGAA CTTTCCGGAT GTGAAGCCGT CGCTGTCCGA
CTACGCGAAG AAGAGGATTA CCCCTACTTC CAAACGGTGG GGTTCACGCA GGAGTTCGTG
AACGCCGAGA GCCATCTGTG CGTGAAGGAC TTGGCCGGCC AGATAAAAAG GGATGAGATC
GGGCATCCAG TTCTGGAATG CATGTGCGGT AACGTCCTGT GTGGGCGGTT TGATGATAGC
AAGCCTTTCT TTACTCCAAA TGGCAGTTTC TGGACAAACA GTACTAGCGA GTTGCTCGAA
AGTACTACTG AAGAGGAGCG GCAGTCGGGA ACCCGCAACA GGTGCAGCGC GGAGGGATAC
GAGTCGGTAG CCCTGGTTCC ACTGCACCAT AATGAAGATA TCATTGGGCT GATACAGTTT
AATGACCGAC GCAAAGACCG GTTCTCCAAA GGGTTTATCG CACTTGTGGA GATGTTGGCG
GACAGTGTGA CCGTTGCGGT ACTTCGGCGG TGGGAGGAAG AGAGCCTCAG GAAGCGTGAG
GAGTATTACC GCGCTATGGT GACAGCGTTC GACGGCCTCA TATACATCTG CTCGGCCAAT
TACCGGATCG AGTTTCTAAA TGAAGCTATG AAGCGGCGTA TTGGCCACGA TGCCATTGGC
GAATTGTGTT ATGAAGCTCT GCATGGGCTT GATGCTGTCT GCCCGTGGTG TAAAAATGAT
CGCATTTTCG CAGGCGAAAG CCATCGTTGG GTGGTCCAGA GCCCAAAGGA CAACCGTTGG
TACGAAGTGT CTAACACCCC TATAAGGAAA AACAACAAGA TCGTTTCCAA GCAGGCCATG
ATTGTGGACG TAACAGACCG GCAACAGTTG CACGAGGATC TTTTACAGAA ACAGCAAGAT
TTGATGTCGG CAAATGATCT ATTGGAAAGT CGGGTTGCCG AGCGGACTTC CAATTTGGAG
GCAGCAATGA GGGAACACGA GTCATTCAGT TACTCGGTTT CCCATGACCT GCGCGCCCCC
CTGCGTCACA TAAACAGCTT CAGCGCCATT GTCATGGAGG AATTCGGGAA TGAACTGCCT
CCCCCTGTGA AAGACTACCT GGTGCGGATT CGCTCAGCAT CAAACAGGAT GGGGGGATTG
ATCGACCACC TGCTTGAACT GTCCAGGGTA GGGAGGGCGG CGCTGAAACT GGAACCGGTG
GATTTGAGCG AGATGGCCGC GTCCATTTTG ACCGCCCTCC AGGAGACGGA GTTGAAGCGG
ACCGTGAAAA TCTTTGTTGA AGAGGATGTG CGGGTACTGG GGGACCGGAC GCTTTTGCAA
CAGTTGCTGC AGAATCTTCT TGGAAATGCC TGGAAGTACA CCTCCAAAAC CGTGAGTGCG
CGTATCGAAT TCGGCAGTTC AAAGCGAGGC GACAGCACGG TCAATTATGT CAAAGATAAC
GGCGCAGGGT TCGACATGCA GTACAAGGAC AACCTGTTCA TTGCCTTCCA GCGTTTGCAT
ACCGCGGAGT TCGAGGGCGA GGGGATCGGG CTTACGACCG CTCAGCGCAT AATCCATCGC
CACAGTGGGG ACATCTGGGC TGAGGGAGAA GTAGGAAAGG GGGCGACTTT CTACTTCACC
TTGCCGGATT AG
 
Protein sequence
MATEELLQDL KKRGELDERH LLASIRLLHL LASGKSSQEV MSTLLPFFQE LSGCEAVAVR 
LREEEDYPYF QTVGFTQEFV NAESHLCVKD LAGQIKRDEI GHPVLECMCG NVLCGRFDDS
KPFFTPNGSF WTNSTSELLE STTEEERQSG TRNRCSAEGY ESVALVPLHH NEDIIGLIQF
NDRRKDRFSK GFIALVEMLA DSVTVAVLRR WEEESLRKRE EYYRAMVTAF DGLIYICSAN
YRIEFLNEAM KRRIGHDAIG ELCYEALHGL DAVCPWCKND RIFAGESHRW VVQSPKDNRW
YEVSNTPIRK NNKIVSKQAM IVDVTDRQQL HEDLLQKQQD LMSANDLLES RVAERTSNLE
AAMREHESFS YSVSHDLRAP LRHINSFSAI VMEEFGNELP PPVKDYLVRI RSASNRMGGL
IDHLLELSRV GRAALKLEPV DLSEMAASIL TALQETELKR TVKIFVEEDV RVLGDRTLLQ
QLLQNLLGNA WKYTSKTVSA RIEFGSSKRG DSTVNYVKDN GAGFDMQYKD NLFIAFQRLH
TAEFEGEGIG LTTAQRIIHR HSGDIWAEGE VGKGATFYFT LPD