Gene Mext_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4110 
Symbol 
ID5834270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4571622 
End bp4573349 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content69% 
IMG OID641369901 
Productcircadian clock protein KaiC 
Protein accessionYP_001641551 
Protein GI163853508 
COG category[T] Signal transduction mechanisms 
COG ID[COG0467] RecA-superfamily ATPases implicated in signal transduction 
TIGRFAM ID[TIGR02655] circadian clock protein KaiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGTCTG AATCCGCGAG CCCGCCCGAA GGCACTTCCA AGGGCGCTTC GAAGAGCGCT 
CCCGCGCTCG CGAAGGTCGC CACCGGCATC GACGGCTTCG ATACCATTAC TTTCGGCGGC
CTGCCGAAGG GGCGGCCCTC GCTGGTCTGC GGCGCGGCCG GCTGCGGCAA GACGCTGTTC
GCCACCACCT TCCTCGTCAA CGGCGCGACC CGCTTCGGCG AGCCCGGCGT GTTCATGAGC
TTCGAGGAGC GCGCCGAGGA TCTCGTCGCC AACGTCGCCT CCCTCGGCTA CGACCTCGAC
GGGCTGGTGG CGCAGGGCAA GCTCGCCATC GACCACGTCC GGGTGGAGCG CAGCGAGATC
GAGGAGACCG GCGAGTACGA CCTCGAAGGC CTGTTCATCC GCCTCGGCTT CGCGGTGGAT
TCGATCGGCG CCAAGCGCAT CGTGCTCGAC ACGATCGAGA CCCTGTTCGC GGGCTTTTCC
GACGAGACGG TGCTGCGGGC CGAACTCCGC CGCCTGTTCG GCTGGATCAA GGACCGGGGG
CTGACCGCGA TCATCACCGG CGAGCGCGGC GACGGCCAGC TCACCCGCCA GGGGATGGAG
GAATACGTCT CCGACTGCGT GGTGCTGCTC GACAATCGCG TCGAGGACCA GATCACGACG
CGGCGCCTGC GCGTGGTGAA GTACCGTGGC TCGGCCCACG GCACCAACGA GTACCCGTTC
CTGATCGATG CCGAGGGCAT CAGTGTCCTG CCGGTCACGG CGGCCGACCT CGACTACACC
ATCGCCGAGG GTGTGGCCTC GACCGGCATC CCCGGCCTCG ACGCGATGCT CGAACCCGGT
GGCTTTCACC GCGGCACCAG CATCCTCGTG TCGGGCGAGT CCGGCACCGG CAAGACCATG
ATCACGTCGA GCATGATCGC CGCGGCCTGC GCGCGCGGCG AGCGCTGCAT GTCCTTCGTG
TTCGAGGAGA GCGGCGACCA GATCATCCGC AACGCCCGCT CGATCGGCCT CGACCTCGCC
CGCCATGTCG AGGCGGGCCT GCTGCGCTTC GAGGCGGCGC GTCCGAGCCT CTACGGCCTG
GAAATGCACT TGGCACGCAT GCACCGGGAC ATCGACCGCT TCGCGCCCAC CGTGGTGGTG
GTCGATCCGC TCTCGGCCCT GCGCGGTCCG CCGGCCGAGC TTCAGGCGAC GATGCTGCGC
ATGGTCGACA TGCTGAAGAG CCGCGGCATC ACCGCGGTGT TCACGAGCCT GCGCGAGGAT
GGCAGCCTCG ACCACGACAG CAATATCGGC GTTTCCTCGC TGATGGATGC CTGGATCAAG
CTCCTCAACA TCGAGGCCAA CGGCGAGCGC TCGCGCACGC TCTACGTCAT CAAGGCCCGC
GGCATGCGCC ACTCGAACCA AGTGCGCGAG TTCAGCATGT CGGGCGACGG CATCACCCTC
GTCGAGGCCT ATATCGGCCC GGCGGGCGTG CTGACGGGCA CCGCCCGCGT CGTGCAGGAG
GCGGAGGAAG CCGCCGCCGT TCTGCGCCGC GAGCAGGAGA GCCGCCGACG GCGGCGCGAG
GCGGAGCGGC GGCGCCAGTC GCTGGAGCGC CAGATCGACG AACTGCGCGC CACCCTGGAA
GCCGCGGCAG AGGAAGAGGC GGTGCTGTTG AGCGAGGACG AGATGCGCGA GGCGATGCTG
ACGAGCGAGC GGCGCATCCT CTCCACCCGC CGGGGAGGCA CGCGATGA
 
Protein sequence
MASESASPPE GTSKGASKSA PALAKVATGI DGFDTITFGG LPKGRPSLVC GAAGCGKTLF 
ATTFLVNGAT RFGEPGVFMS FEERAEDLVA NVASLGYDLD GLVAQGKLAI DHVRVERSEI
EETGEYDLEG LFIRLGFAVD SIGAKRIVLD TIETLFAGFS DETVLRAELR RLFGWIKDRG
LTAIITGERG DGQLTRQGME EYVSDCVVLL DNRVEDQITT RRLRVVKYRG SAHGTNEYPF
LIDAEGISVL PVTAADLDYT IAEGVASTGI PGLDAMLEPG GFHRGTSILV SGESGTGKTM
ITSSMIAAAC ARGERCMSFV FEESGDQIIR NARSIGLDLA RHVEAGLLRF EAARPSLYGL
EMHLARMHRD IDRFAPTVVV VDPLSALRGP PAELQATMLR MVDMLKSRGI TAVFTSLRED
GSLDHDSNIG VSSLMDAWIK LLNIEANGER SRTLYVIKAR GMRHSNQVRE FSMSGDGITL
VEAYIGPAGV LTGTARVVQE AEEAAAVLRR EQESRRRRRE AERRRQSLER QIDELRATLE
AAAEEEAVLL SEDEMREAML TSERRILSTR RGGTR