Gene Emin_0928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0928 
Symbol 
ID6262643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1028228 
End bp1030513 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content40% 
IMG OID642611407 
Productserine/threonine protein kinase 
Protein accessionYP_001875818 
Protein GI187251336 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000612403 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.21149e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGAAG AGACAAAAAA AGAAGACACT TTAATCGGGC AGGATATTTC CGGCTGTGAA 
ATTTTAGAAA AAATAGCGCA GGGCGGCATG GGCGCCGTTT ATAAAGCTAA ACACAAAGCT
TTGGAAAGAA TTGTCTGTGT TAAAATTTTA AGTCCTTCTT TAGCGGAAGA TGAAAAGGCC
GTTGAGCTTT TTATGACAGA AGCCCGTGCT ATCGCCGAGC TTGACCACCC CAACATTGTT
AACGTATATA ACGTAGGCCG TGAAAAAGGA CTTTACTTTA TAGTAATGTC TTTTATTTCC
GGCGACACGG TTTCAAATAT AGTCCGCAGA AGGCCCAATT TGCCGATAGG TTTTGTTCTT
AATATTTTCC AAGGCGTGCT TAAAGGCTTA TCGGTAGCGC ATGAAAAAGG CATTATTCAC
CGTGATATAA AACCATCTAA CATTTTAATT AACGAAAAGC TTGAAGCTAA AATCGTTGAC
TTCGGTATAG CCAAAAAAAT TGAAAAAGAT AAAACCGCCA CTAAAACAAC GGAAATGGCG
GGTACGGCTT ATTTTATTTC ACCTGAGCAG GCTTTAGGCG GGGAAATTGA CGTCCGTGCG
GATTTGTATT CCGCAGGCGC TACTTTGTTT TTTATGTTAA CGGGTCAGTT TCCGTATAAG
GGCAAAAACT CAATTGATAT TATTCAAAAA CATATTAATG ACCCTATTCC CAGCCCGGGC
GACATAAGGT CGGATATTCC GGCATGGCTT TCTCTTACGG TGCAAAAGCT TATGGCCAAA
AAACCTAATG ACAGGTTTCA GAGCGCGCAG GAAACGCTTG ACCATATAAT GAAAATGCGT
GAAGAAGAGC AATTTAAAAT AAAAAGAGGG GCGGACGCTG TTTTAGATAT CCGTTCGGAA
GTGCCTTTAA AGGTGCGTCC AGAAAATATA CGCAGCAATA CTGAAAGTAT AAGGCTTAAA
AGGTTTGCCC AAAGCAACAT AAAAAAACTT AAGCCCGATA CTATTACATC AACCGGTCCC
AGAATGCCAA TTATAGGAGC AGGCGGCAAA GCTTCCAAAG CGCCTCCGGC GCCGATACCG
GAAACCAAGG CCGCGGCGGA CCTTGCCTTC TTAGAGGATC CTTCAGTAAC AAGAAAAAAA
TTGGAAAAAG AAGCCCGCAG TTTAAAACGT TCCACCAGCA AACTTCCAAA AGCGTTTGCT
AAAGTTTTAT TTCATATTCC TGTGTTTTTT ATTATTTCAA TGTTTTCATC TTACGTGCTT
TATAACTTCG GTAAAGCGGC GGCTTCTGTT TTAGCGGAGG GTGAAAGCGC GACATTTTTT
GGTTCTTTTA AATTTTTTGC CAGCATGGAC GCTTTTACCG CCAATCCTTC AATAACGGTG
CTTACGCTTG CCGTATTTAT AATGATGTTT ATAATGCTTT GTTTAAAAGC ATACGCAAGG
CATACGCTTT CTTTGATAGC TGCTTTAGGT TTTGCATATC TGGCAGGTTT TTTTAACGTT
CCCGCCGGAT TCGGGCAGGC TTTTTCCCAC GGGTTGGAAA ATATGTCTTT TCAAGACTAT
ACTTTGATAT ACATGGTTAT AACGGGTGTG TTTGCCTGGG GTATTATGAT GAGCAAAACG
CCCTCTTTTC CCATGCGCGT ACTTTGCGCA ATGGCTGTTT TGCTTACTAT ATTATTTTGT
AAAACTTTTG TAGATCTTAA CGTTCCGCCT TCGGAAGATC CTGTGGTTTC TTTAATTTTC
TATTCCGCGG TTCTTTTAAC GGTGTCCTGC ATAGCCGTGG TTTTACCGAG AACATTTATT
TTCCACAGTC TTTTACCCAC TATTTTACTT TTTATGAGTA TAGCCGCTGT TTGGGGTTAT
ATGATATCGG GCAAAGTTTA TTCTGATGTT GACATATTAA TGGAAACAAA AGAAATAAGC
AAAATTACTC CCGCTAAGGA TAAACCCGCT CTGCAATTCA GTAAACCTGT GCTTGATTTA
AATTTTGGTT CAGGTCTTTT GGGGATAGCT AAAATGGTTG AATCCACTGC CGAAGCAACT
TCGGAGGACC CGAAAAAAGC CTCTGCGGAA AAAGACAGTC CGTTTAAAAA ATATGTCGAT
ATTTATACGG CAAAAGGCAA ACGCGCCATG GTAAGGGAAG TTTGGAAAGA TGACGCCGTG
TTGCCTTTTA CCAGAGTGTT GATGAATAAT GAAGAATCAC TTATTTTTAA ATACGCAGTG
GTTTTGATAC TGCTTTTGGG TAATATTTAT TTTGTTGTTC ATACCATAGC CAGAAAGGAT
TTATAA
 
Protein sequence
MAEETKKEDT LIGQDISGCE ILEKIAQGGM GAVYKAKHKA LERIVCVKIL SPSLAEDEKA 
VELFMTEARA IAELDHPNIV NVYNVGREKG LYFIVMSFIS GDTVSNIVRR RPNLPIGFVL
NIFQGVLKGL SVAHEKGIIH RDIKPSNILI NEKLEAKIVD FGIAKKIEKD KTATKTTEMA
GTAYFISPEQ ALGGEIDVRA DLYSAGATLF FMLTGQFPYK GKNSIDIIQK HINDPIPSPG
DIRSDIPAWL SLTVQKLMAK KPNDRFQSAQ ETLDHIMKMR EEEQFKIKRG ADAVLDIRSE
VPLKVRPENI RSNTESIRLK RFAQSNIKKL KPDTITSTGP RMPIIGAGGK ASKAPPAPIP
ETKAAADLAF LEDPSVTRKK LEKEARSLKR STSKLPKAFA KVLFHIPVFF IISMFSSYVL
YNFGKAAASV LAEGESATFF GSFKFFASMD AFTANPSITV LTLAVFIMMF IMLCLKAYAR
HTLSLIAALG FAYLAGFFNV PAGFGQAFSH GLENMSFQDY TLIYMVITGV FAWGIMMSKT
PSFPMRVLCA MAVLLTILFC KTFVDLNVPP SEDPVVSLIF YSAVLLTVSC IAVVLPRTFI
FHSLLPTILL FMSIAAVWGY MISGKVYSDV DILMETKEIS KITPAKDKPA LQFSKPVLDL
NFGSGLLGIA KMVESTAEAT SEDPKKASAE KDSPFKKYVD IYTAKGKRAM VREVWKDDAV
LPFTRVLMNN EESLIFKYAV VLILLLGNIY FVVHTIARKD L