Gene GM21_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3146 
Symbol 
ID8138497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3649660 
End bp3651225 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content61% 
IMG OID644870750 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022931 
Protein GI253701742 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones137 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA AAGATCTCAA TTACAACTCA GGCACGCGTA AGCAGCAGGA GCCCGGGCGA 
CTGCAGGGGG ACCTACCCTT CCGGCTGATG GTGGAGCAGG TGAAGGAGTA CGCAGTCGCG
CTTTTGGATC CGCAGGGAAA CATCTCCTCG TGGAACGCCG GGGCGGAGCA GATCACCAGG
TATCGGGGCA GCGAGGTGAT CGGGAAGCAT TTTTCTCTGC TCCATCCCAA GGAGGAACTC
CGTTCGGGGG CGCCGGAAAG GGAGCTCGCC ATAGCACGCT CGCTGGGGAG CCTTGACCTG
CAGGGGTGGC GGCTGAAAAA AGACGGCAGC CGCTTCTGGG CTGGAATCAC GCTCACCGCT
ATCTATGATC AAGAGGAGAA TCTCGCCGGA TTCGCGCTCT TCGCACACGA CGACAGCGAG
AAACGCGCCT CGGACGAGGC GCTGCTTAAA AGCCGCAACA TGCTGGAGCG GCTCTTCGAG
ACGGCTCCGG ACGGCATCGT GGTGGTCGAC GGCAACGGCG TCATCCGCAG GACGAACCAG
CAGGCGGAGA TCACCTTCGG CTACATGCGG GAGGAGATGC TCGGGCAGCG CATCGAGCTC
TTGATTCCGG AGCGGTACCA CAAGCGCCAT CGCCAACACC GGCGCAACTA TTTCGCCGAC
CCGCGCGCCC GCAAGATGGG GATCGGCCTC GAACTGTACG GACGCAACAA GGACGGTCAC
GAGATCCCGG TGGACATCAT GCTGAACCCC ATCGAGACGC CGGACGGGAC CTGGGTCTTC
GCCGTGATCC GCGACATAAC CAGCCAAAGA CAAGGCGAGG CGAAGATACT GGAGTTGAAC
CTTGCCCTGA GAAACCAGCT CGAGCAGTTG GGGGCAAGCA ACAGGGAGCT GGAATCCTTC
AGTTACTCCG TTTCTCACGA CCTGCGGGCC CCGTTGCGCC ACATCATCGG GTTCGTAGAC
CTGTTGAACG CCAAGGCCGC GAACGTTCTG GACGAGAAGA GCCGTCACTA TCTGGAGGTA
ATCAGCGACG CGGCCAACAA GATGGGATTG CTGATCGACG ACCTGCTGGC CTTCTCGCGC
ATGGGGCGCA GCGAAATGAT GAAGGGGTGG GTAGACCTGG GACTGCTGGT GAGAGAGATA
GTGAACGACC TGGAAAGCGA CAGCAAGGAG AGGGAGATCC AATGGGACAT AGCCCCGCTC
CCCATCGTGC TGGGTGATGC GGCAATGCTG CGCCAGGTGC TCATCAACCT CGTCGGCAAC
GCGGTCAAGT TCACCCGTTC GCGGGAAAAA GCAAGAATTG CCATCGGCGC CATCGACCGG
GAGCAGGAAA CGGAGATCTT TGTAAGGGAC AACGGGGTCG GGTTCGACGA GGCTTACGCG
AGCAAGCTTT TCGGCCTTTT CCAGCGTCTG CATGCCAATG AGGAGTTCGA GGGAACCGGG
GTCGGGCTGG CTATCGTGCA GCGGATCGTA CTGCGGCACG GCGGCAGGGT CTGGGCCGAG
GGCGAGGTCG ATGGCGGAGC GACCTTCTGG TTCTCGCTCC CGAAGGGAGT AAACCCGGTA
CCCTAG
 
Protein sequence
MTDKDLNYNS GTRKQQEPGR LQGDLPFRLM VEQVKEYAVA LLDPQGNISS WNAGAEQITR 
YRGSEVIGKH FSLLHPKEEL RSGAPERELA IARSLGSLDL QGWRLKKDGS RFWAGITLTA
IYDQEENLAG FALFAHDDSE KRASDEALLK SRNMLERLFE TAPDGIVVVD GNGVIRRTNQ
QAEITFGYMR EEMLGQRIEL LIPERYHKRH RQHRRNYFAD PRARKMGIGL ELYGRNKDGH
EIPVDIMLNP IETPDGTWVF AVIRDITSQR QGEAKILELN LALRNQLEQL GASNRELESF
SYSVSHDLRA PLRHIIGFVD LLNAKAANVL DEKSRHYLEV ISDAANKMGL LIDDLLAFSR
MGRSEMMKGW VDLGLLVREI VNDLESDSKE REIQWDIAPL PIVLGDAAML RQVLINLVGN
AVKFTRSREK ARIAIGAIDR EQETEIFVRD NGVGFDEAYA SKLFGLFQRL HANEEFEGTG
VGLAIVQRIV LRHGGRVWAE GEVDGGATFW FSLPKGVNPV P