Gene GM21_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2070 
Symbol 
ID8137406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2400102 
End bp2402039 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content59% 
IMG OID644869685 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003021880 
Protein GI253700691 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000000000000789609 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCAAG GTGACGACGC GGCCATGTCC ATGGACGCGG AAAAACTCGC AGCCATGGTG 
GAATCGTCCG GCGACGCGAT TATCGCAATG AACGTCGACG GCACCATCAC CAGTTGGAAT
CCGGCGGCGA CAAAGCTCTA CGGCTATACC GCGAAGGAAG CTGCAGGCTG CGACATCTCC
TTTCTGGAAT CGCCGGAGCG ACCCGGCGAG ATCTCGGCGG AACTGCAACA GGTGCAAAGG
GATAGGCGAT CCCGGCACTT TGACCGCTAC CGCCGGCGCA AAGACGGCAG CCTGGTTTTC
GTCTCGCTGA CGCTCTCCCC GATACTGGAC AAGCAAAATA CGCTGATCGG CCTTTCCTCC
ATCGCCCGAG ATGCGGAGGA ACTGGTACAG CAACGTACCA AGGAACTGAT GCAGGCCATC
CAGGCGCTGC AGGTTGAGAT CGCCGAGCGA CGCAGGGCAG AGGATGCGCT TACCCGGAGC
GAGGATATGC TCCGCTTCGC TGCATTGGCG GCGGACATCG GCATGTGGCA CTACGATCTG
TTGACAGGGG ACCTGGTCTG GAGCGACAGG TGCAAGGAGT TGTTCGGCTA CTCTCACGAC
TTTCAGATGA CCTATGAAGC TTTCCTCGAC GCCGTCGCGG AAGAGGACCG GCAGGGGGTA
GACCAAGCGG TTCAGAGATC CTTGCAGGAA AAATCCGAAT ACGCCGTAGA GCTGAGGGTG
ATGCTGCAGG ACGGCCAGGT GCGTTGGGTA ATGAGCAAAG GGCACGCCTT CTACGATAGC
CAAGGGAAAC CGTTGCGGAT GGCAGGCATA GCCTTGGACA TCACCCAGAG GAAAAAAACG
GAAGCGGCGT TACTGCGGGC CAAGGAGGAA TGGGAAAGCA CTTTCAACAG CGTCCCCGAT
CTCATCGCCA TACTGGACGA AAAGTGCCGC ATCGTCCGGG TCAACGAAGC CATGGCGCAG
CGGGTCCATA TCAACCCCGA CGGATGTGTC GGGCTTTTTT GCTACCAGGT CCTTCACGGG
GAGAATGCGC CGCCGCATTT CTGTCCCCAC GGCCAAAGCT TAATGGACAA TATGCAACAC
ATTGCCGAGG TCTACGATCC CCACCTGACC GGAACCTTTC TGGTCAGCAC CACGCCGCTT
GTGGCCGCCG ACGGCAAATC GATCGGAACG GTTCATGTCG CCAGGGACAT AACGGAGCGC
AAGAGGGCGG AGGAGGAGAT CGCCCGGCTG AACGCCGATT TGGGGGCGCA TGTCGCCGAA
CTGGAGGAGA GAAACCAGGA ATTGGACGCC TTCAACCGCA TGATATCCCA CGACCTGCGG
CAGCCCTTGA ACATCATGTC CCTTGCGGGC CAGCACATCG ATATGCTGTG CAGCAGCGAT
AACCCCGAGT GCCGGCAAAG CGTGCGGACG CTCGAACAGG CGGTATTGCG CATGAACGCC
ATGATTGAGA CGCTGCTCTC CTTCTCGCGT TCCACGCATG GGGATCTGTT GCGCGAGGAT
TTGGACATCA GCGAGACGGT GCAGGTGATA CTCGCTGAAT TATGTCTGGC CGAGCCTGTG
CGCCGGATAA GGACCGTGAT AGAGGAAGGG GTCATGGTCA ATGCCGACCC CCGGCTGTTG
CGGACCGCGC TGGAAAACCT TCTGGGAAAT GCCTGGAAAT ACACCGGCGG CCGTGAAGAG
GGATACATCG AGTTCGGTGT GAGGGGAGGG GAGGTAGAAC CGGTCTACTT CATCAAGGAC
AACGGGACTG GGTTCGACAT GGCCGATGCG GATAAGCTCT TCGTCCCGTT CCAGCGGCTG
GCAGGGGCCG ACGCGTTCAA AGGATCGGGC ATCGGCCTGG CGACCGTGGA AAAGATCATC
AAGCGGCACG GCGGAAGGAT CTGGGCAGAG GGGGAGCCGG ACAAGGGAGC CACCTTCTAC
TTCACGCTTA AAAGTTGA
 
Protein sequence
MKQGDDAAMS MDAEKLAAMV ESSGDAIIAM NVDGTITSWN PAATKLYGYT AKEAAGCDIS 
FLESPERPGE ISAELQQVQR DRRSRHFDRY RRRKDGSLVF VSLTLSPILD KQNTLIGLSS
IARDAEELVQ QRTKELMQAI QALQVEIAER RRAEDALTRS EDMLRFAALA ADIGMWHYDL
LTGDLVWSDR CKELFGYSHD FQMTYEAFLD AVAEEDRQGV DQAVQRSLQE KSEYAVELRV
MLQDGQVRWV MSKGHAFYDS QGKPLRMAGI ALDITQRKKT EAALLRAKEE WESTFNSVPD
LIAILDEKCR IVRVNEAMAQ RVHINPDGCV GLFCYQVLHG ENAPPHFCPH GQSLMDNMQH
IAEVYDPHLT GTFLVSTTPL VAADGKSIGT VHVARDITER KRAEEEIARL NADLGAHVAE
LEERNQELDA FNRMISHDLR QPLNIMSLAG QHIDMLCSSD NPECRQSVRT LEQAVLRMNA
MIETLLSFSR STHGDLLRED LDISETVQVI LAELCLAEPV RRIRTVIEEG VMVNADPRLL
RTALENLLGN AWKYTGGREE GYIEFGVRGG EVEPVYFIKD NGTGFDMADA DKLFVPFQRL
AGADAFKGSG IGLATVEKII KRHGGRIWAE GEPDKGATFY FTLKS