Gene Emin_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1005 
Symbol 
ID6263778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1093080 
End bp1094774 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content39% 
IMG OID642611485 
Producthypothetical protein 
Protein accessionYP_001875895 
Protein GI187251413 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0249187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGAT TTGTTTCAAA AACCATACGC CATACACACC GTTATACGGA AATTGTTACC 
ATTTTGGTTA AATACGGAAT GGGCGACATA ATGCGCAGCC TTAAAATAAC GGATATGTTT
CCCTTCACAA AAAAACTGCT TCCCAAAGTG GATAATAAAT CCGTTTCTTC TTTTAACAAA
TGGGAAAATA TAAGAATGGC ATTGGAAGAA CTTGGCCCAA CATTCATAAA ACTTGGGCAA
ATGTTAAGCA ACCGTCCGGA TATTATTCCC CTGCAATTAA TTAAAGAACT TGAAAAACTG
CAAGACACCG TACCCCCTTT TGAGCATGAA GAGGCTGTTA AAATAGTTGA AAGCGAACTT
GAAGGCAAAA TAAGTGAAAA TTTTTCCTAT TTCAGTAAAA AACCTATAGC CGCTGCCTCT
ATTGCACAAG TGCATAAAGC AAGGCTGCCT GATGCAACCA CCGTTGCTGT CAAAGTGCAG
CGCCCGGGTA TTGAAGAAAT TATTGGCGTA GACGTTGAAA TACTGCATAA TCTTGCCTCT
TTGGCGGGAA ATAATATCAC GGAATTAAAA TATTTTAACC CCGTGGGCAT AATTAAACAA
TTTGAAGAAC ATATAAAAGA AGAACTTGAT TTTAATAAAG AAAGACTTAA TATAGAGCGG
TTTCAAAGAA ATTTCCAAAA AGACGGACGC GTGCACGTAT TAAAAGCGCA CAAAAAATAT
TCCGCCAAAC GCGTGCTTAC CATGGAACTT ATAGAAGGCG TTAAAGTAAG CCGCATAGCT
GAGGAAAACC TTGAAGGTTA CGACCGCGGG CTGATAGCCA AAAACGGCGC GCAGATAATT
CTAAAACAGA TTTTCATAGA TGGCTTTTTT CATGCCGACC CACACCCGGG CAACATAATA
ATTTTAGAAA ACAATAAAAT ATGTTTTATT GATTTTGGTA TGATGGGCAG TTTGAGCCAA
TCGCAAAAAG ATGATTTGGG CACCTTAATA GTGGCGCTGA TGTACCGCAA CTCTTCACTT
GTGACAAGCA CAATACTTAC AATAGTTAAC AGGCCTGACC ACCTCCAAAC GCGCGAAATA
GAATACAGGG TTGAAAAGCT TATCGAGCGT TACATTGACT TGCCGCTTGA AGAAATAAAC
GTTGGCGAAT TGCTTTTATC TTTAACGCAA ATGCTGCCTG AATTTGAACT AAATATGCCG
CCAAACTTTT CTTTCATGGT AAAATCTTTA ATTACCATTG AAGGCGTGGG CCGCCAGTTA
GACCCCGAGT TTAGCGCAAT GGCGGTAATA AAAGAATTTT CCCAAACAAT AATAAAAAAC
CGCCTTAGCC CCAAAGGTTT TGCGATGTCT TCCATAATAA CCTTAATGGA AACAAAAAAA
CTTATTGAAA ACGCCCCCCG CGATATACGG GAGATTCTCA ACAAAGCAAA ACAGGGGCAT
ATAAAAATTG AGTTTGAGCA CAGACATCTG GGCAAATTAA GAAGAAGCTT GGAAGAAGCC
AGCAATCGTC TGGTATTTGG CATTGCGCTG GGCTCGCTTA TAATAGGTTC GTCTATCATG
GTGCACGCAA ACATAGCTCC CAGATGGAAT GATATTCCGG TAATAGGTTT AATAGGTTTT
TTAGTAAGCG GATTTATGGC GGCGTACATA CTGCTGTCCT CAGTTTACGA AAATCTGAAA
AAAAGAAAAA AATAA
 
Protein sequence
MFGFVSKTIR HTHRYTEIVT ILVKYGMGDI MRSLKITDMF PFTKKLLPKV DNKSVSSFNK 
WENIRMALEE LGPTFIKLGQ MLSNRPDIIP LQLIKELEKL QDTVPPFEHE EAVKIVESEL
EGKISENFSY FSKKPIAAAS IAQVHKARLP DATTVAVKVQ RPGIEEIIGV DVEILHNLAS
LAGNNITELK YFNPVGIIKQ FEEHIKEELD FNKERLNIER FQRNFQKDGR VHVLKAHKKY
SAKRVLTMEL IEGVKVSRIA EENLEGYDRG LIAKNGAQII LKQIFIDGFF HADPHPGNII
ILENNKICFI DFGMMGSLSQ SQKDDLGTLI VALMYRNSSL VTSTILTIVN RPDHLQTREI
EYRVEKLIER YIDLPLEEIN VGELLLSLTQ MLPEFELNMP PNFSFMVKSL ITIEGVGRQL
DPEFSAMAVI KEFSQTIIKN RLSPKGFAMS SIITLMETKK LIENAPRDIR EILNKAKQGH
IKIEFEHRHL GKLRRSLEEA SNRLVFGIAL GSLIIGSSIM VHANIAPRWN DIPVIGLIGF
LVSGFMAAYI LLSSVYENLK KRKK