Gene Emin_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0145 
Symbol 
ID6263137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp152767 
End bp154668 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content40% 
IMG OID642610609 
Producthypothetical protein 
Protein accessionYP_001875047 
Protein GI187250565 
COG category[S] Function unknown 
COG ID[COG4907] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAT ACTTTTCTTT TATTTTTTTA ATGCTTTCAC TTCCTCTGTG CGCGCAGGAG 
CATATAAAAA GTTTTAACGT TTTTGCCCAG GTCTATAAGG ACGGTACCGC AGTTATTACC
GAATATATTA CGGTAAATGT TGAACACGAG CAAATAAAAC GCGGCATATA CCGTGATATT
CCCAGAAAAT ACACTAATAA ACGCTTTTTG GAAGCCGAAT TGGGAATAGA ACCGCTGTCT
TTAAAAAGAA ACGGACTGCC CGAGCATTTT TTTACTGAAA GCCCCGACAG GTACACGCTT
AGAGTAAATT TCGGGGATGA TAATTTTATC CCCAAAGGCG AGCACACTTA CGAGTTTGAG
TATTTTGTAA AAAACGCCGT TGTTTTTGAA GCGGATTCTG ACGAATTTTA CTGGAACGTT
ACGGGCAATT ACTGGCGGTT TGCTGTTTTA TCCTCCCGTT TGGACTTAGT TTTACCCGAG
GAGGCGCAAA TAAATAAAGA TTTAATTTCT TTATATTCGG GACCCAAAGG TAATAAAATC
TGCGACTCCT GCGAAATTCA ATTTACAAAC CGGTTTGCGG CCTCGTTTAT TAACAACCGT
GTCCTTAACC CGAAAGAAGG TTTTACCGTG GCTGTGCCGT TCCAAAAAGG CGTTATTACA
AGGCCGCCCA CAGAAGATAT CTTGAAAGAT TTTATAATGA ACCCCACTCC TGTTATCTTA
TGTTTGGTTG TTTTAATTTT AGGCTCGGCA TATTTGGCTT TGGGCTGGTT TCTTTTTGGC
ATAGACCCTA AAAAAGGAAC TATTATCCCT TTATATGAGC CGCCTGCGGA TATTTCCCCC
GCTAAAGCGC TTTATTTATA TAGAAGAGGC AAAATATCAG ACGCGGCATT ATCTCAAACA
ATCTTAATAA GTTTGGCCTC TAAAGGCATT TTGGAAATAA AACACAAAAA ACATTCTTTG
CAAGACGTTA AAGTTTTTGG GAAACAGCTT TTCCCAAAGG AATTTTATTT AACAAAAAAT
TTTCATCCCG AAATAATTCT CTCCGAAGAA GAAAAATCTT ATTTCTCCGC TCTTCCTGCG
GGGCAGCTGG CTCTATCCAA TACATACTAT GAATATTTTA AAAAAGCTGC CGGATGTGCT
AAATCACAGT TAAAAAATTT TTTCAAGAAT GATTATTTTA AAAACAATTC GCTTTGGGCT
TTGCCTTACA AGTTAGCCTG TGCGGCGGCG GTTTTTTATA TGGCAATAGA CCTTTTTCCC
GTACAGCACA TAATCCCGGC CTTAGCTATA ACATTCTTTT TGTTACTGGC TGTTTTTAAT
AAAAGTTTGC AAAGCCTGGT TATTTTTTTG ATGCTGTTGA TATATTTTAG AACAATAGCG
GCAAACGCTC TACCCGAGAC GGCGGTATTA TTTATAATAT GCTTTTTAAT GCGGGGTGCT
GATTATATTT TTGACAAACT TATAGCCAAA TATACCCCCA AAGGCAGAAG CCTGATGGAC
CAAATTGAAG GTTTAAAACT ATACATTAAA GTAGGTGAAA AAGACAGGGT TAAACTTGCT
ACCCCCGAAA GCGCGGCCGA TGTTTTTTGC AATATCTTAC CTTACGCGAT AGCATTAGGC
CTGTCTAACG ATTGGGTAAA ATCATTTGAC GTTATGTTTA AAAACAACCA GGTTTCCACA
AAAAGAATAT CAACAAGGGG GTTTTCTTCT TTCATAAACT CAAAAAGCTT TTCAGCTAAA
ACTTTTTATA GCGCTTTAAA TAGTTTTAAC AGTTCAGCGA GACAGTCTTC TTCCCCAAAA
TCCTCAGGCG GCGGAAGAGG TGGCGGGGGG GGCGGTGGCG GTTCAGGAGG AAGGGGCGGC
TCCGGAGGCG GAAGAGGCGG CGGGGGCGGC GGCGGAAGAT GA
 
Protein sequence
MVKYFSFIFL MLSLPLCAQE HIKSFNVFAQ VYKDGTAVIT EYITVNVEHE QIKRGIYRDI 
PRKYTNKRFL EAELGIEPLS LKRNGLPEHF FTESPDRYTL RVNFGDDNFI PKGEHTYEFE
YFVKNAVVFE ADSDEFYWNV TGNYWRFAVL SSRLDLVLPE EAQINKDLIS LYSGPKGNKI
CDSCEIQFTN RFAASFINNR VLNPKEGFTV AVPFQKGVIT RPPTEDILKD FIMNPTPVIL
CLVVLILGSA YLALGWFLFG IDPKKGTIIP LYEPPADISP AKALYLYRRG KISDAALSQT
ILISLASKGI LEIKHKKHSL QDVKVFGKQL FPKEFYLTKN FHPEIILSEE EKSYFSALPA
GQLALSNTYY EYFKKAAGCA KSQLKNFFKN DYFKNNSLWA LPYKLACAAA VFYMAIDLFP
VQHIIPALAI TFFLLLAVFN KSLQSLVIFL MLLIYFRTIA ANALPETAVL FIICFLMRGA
DYIFDKLIAK YTPKGRSLMD QIEGLKLYIK VGEKDRVKLA TPESAADVFC NILPYAIALG
LSNDWVKSFD VMFKNNQVST KRISTRGFSS FINSKSFSAK TFYSALNSFN SSARQSSSPK
SSGGGRGGGG GGGGSGGRGG SGGGRGGGGG GGR