Gene Emin_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0082 
Symbol 
ID6263981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp85825 
End bp87702 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content41% 
IMG OID642610543 
Producthypothetical protein 
Protein accessionYP_001874985 
Protein GI187250503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.437143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000000932553 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGATACT GGATTTTGGA AAAGGGAGAT GTTATCGGCC CTTTTGAGGC GGAAGAATTA 
AAGAAAAGGG AAGATTTTTC TTTAGCCAAT CTTGTGTGCC CGGAAAGCCA CGGGGACGAC
CATACTCATT GGAGAGAAGC GTCCTTTTAC GCTGATTTCA CACCGCAAGA CTCAAAGCCT
AAAGCCCAAA ACACAGATAA ACCCCAAACC GTTACCCCGC CCCCTAAACA AGAAACTAAA
AAAGAGGAAA ACCCCCTTTC AATAGAAGAA TATTTTGATA AGATATACCA CACTGAAAGC
CATGAATTAA GCAATGTTTT AGGTATCCCC GACAATCTTG AAAACTCGGA TCTCTATTTA
GACCGTCTTT TGAAAAAAGA ACTGGGTAAG GAAAAGCGTT CCGCTAATAA AGAAAAATCT
TTTGGCGCTA AAGGAAAAAT TCTTTCTGCA AATGAAAAAG AACAGGTAAA AGAGGAAAAA
AAACAAGACA GGAAAACTAC GCCCGCACCG CCGCTGACAA CCCGGGAAAA AGCAACTCAT
AATGAAACAA AGGAGCAAAA GGCGGATATA CCTTTAATCC CCGCGTTAAA AGAACCTTCC
GCTAAAGAAG AAACGAAGGT TAATTCTTCT TTATCCGCCT CGGCCCAACC TGCGGAACAA
AAAAATATTC CTCCCAAAAA AGAGGAAATT AAAGAAACTC CGCCTTCTGC AATAAAAGAA
AAAAAACCTC TGCAGCCGCA AATACCCGTT ATTAAAGAAC CCGCGCCGAC GCAAAAAAAA
GAAGAGCTCG TTAAAATTGA AACAAAAAAA GAGCCTGCGC CCGCAGCAAC TAAACAAAGT
AAAGAAGCCT CTCAACCCGA AATAAAGAAA GAAACTGAAA CGCCGGTTGT GAAACAGGAT
AATCCTGTGA GTAAAGATAA CATTCGGCCC GGGGCTCCCC AACAAAAAAA AGAATCTCCT
AAAAAACCTG GGCAGCCAGA AGAAAAAAAA GAAATTACGC CTGCAACCGC CAAACAAGAA
AATCTACCTG TAATTTCAGA AGCTCAGCCC GCGCCTATTA AAGAAGATTT TTTGCAGCCT
GATATTTTAA AAATTATTGA AGAAGAAGAA AAAACTTTAC CACAACAAGA GTCCCTGAAA
GAAGCCCCTG TCTTAAAACA AGAATCACAT GTGGAAGGAC AGCCTTTAAT AGAGCCCATT
AAAACAGTGC GTCCGGAAGA TGCCTCCGAA GAAGAGGAAA ATCCTGTCGC TTTCGGTGTG
CAAAGAAAAC ATATTCCGGC AGATTCTTTT GTAGAACATA AAAAAGAAAG AAGCGCAATT
GACCCAAGAA CAAAAAAAGA AAGCGTAAAA CCTGTTATTT TAACCGCAGG CACCATTTTT
ATACTTTTAA CGGTTGGATT ATTTATTTTA CTTTCACAGG CAAAAAAATC TTCCGAACAG
CAATTCAAAG CCGCTCCGCT GCCTGTTAAA CAGGAAAGGC CTGAATTAAT AAAAACTGAT
CCGCAGCCAA AAATGCCGGC GCAAAAAGTG CAAAAAGCCC CGGCCGCCCC CGCACCTTTA
CCGCCGCCTC CCGCTAAAGC AGCCGTTGAC TTAACCGTAC AGGAAAGAGC TGTGGACATA
GCTAAAAACC ACATGCTTAA AAAGAAAGGT ATGTCTATAG ACGGTTTTTT AAATTCTTAT
TTTGAAGAAT ACGTAAAACA AGGATATACG GCATCCTGGT CGGCCGAACC GCTTCATAAA
GACATTTACA TAGTAAAATA CAGGCTTGTA AAACCCAGAA AAGAACCCGT TATTTATATA
TTTGAAGTAG ACACAAAAAA GAACATCGTG TCAGGGGCTT TAAATAACTG GTCACTTGAC
CTTCTAGATA TGCAATAA
 
Protein sequence
MRYWILEKGD VIGPFEAEEL KKREDFSLAN LVCPESHGDD HTHWREASFY ADFTPQDSKP 
KAQNTDKPQT VTPPPKQETK KEENPLSIEE YFDKIYHTES HELSNVLGIP DNLENSDLYL
DRLLKKELGK EKRSANKEKS FGAKGKILSA NEKEQVKEEK KQDRKTTPAP PLTTREKATH
NETKEQKADI PLIPALKEPS AKEETKVNSS LSASAQPAEQ KNIPPKKEEI KETPPSAIKE
KKPLQPQIPV IKEPAPTQKK EELVKIETKK EPAPAATKQS KEASQPEIKK ETETPVVKQD
NPVSKDNIRP GAPQQKKESP KKPGQPEEKK EITPATAKQE NLPVISEAQP APIKEDFLQP
DILKIIEEEE KTLPQQESLK EAPVLKQESH VEGQPLIEPI KTVRPEDASE EEENPVAFGV
QRKHIPADSF VEHKKERSAI DPRTKKESVK PVILTAGTIF ILLTVGLFIL LSQAKKSSEQ
QFKAAPLPVK QERPELIKTD PQPKMPAQKV QKAPAAPAPL PPPPAKAAVD LTVQERAVDI
AKNHMLKKKG MSIDGFLNSY FEEYVKQGYT ASWSAEPLHK DIYIVKYRLV KPRKEPVIYI
FEVDTKKNIV SGALNNWSLD LLDMQ