Gene Emin_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1310 
Symbol 
ID6263858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1412445 
End bp1413674 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content42% 
IMG OID642611789 
Producthypothetical protein 
Protein accessionYP_001876197 
Protein GI187251715 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID[TIGR02212] lipoprotein releasing system, transmembrane protein, LolC/E family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTG AACTTTTTGT TGCAAAACGC TATCTGAATT CCAAACGCAA AGGTCTTTTT 
GCGTTAATAA CCACAATTAT AGGCATAGCC GGCGTAACGG TAGGCGTAGC CGCGCTTATT
ACCACTCTTG CGGTTATGAC GGGCTTTCAA ACCGATATTA AAGAAAAGGT TATCGGTGCG
CAAAGCCATA TTCTTATTTT CGGGCATATG ACGGAAGCCG TTTACCAGGA TAAAATTAAA
AAGATTGAGC AATTACCCCT AGTTTATGCA GCGGCGCCTA ATATTTTCGG ACAAGGTATA
ATTACCCATA ACGGCAGTTC TTTGGCTATA GTTCTCCGTG GGCTTGAACC GGAAATGGAA
GATAAAGTAA ACCGCCTTAA CAGTTCTTTT GAGGAGGGTT CTTATGTTGC GCCTTTAAGA
GAGGGCGAAA CTTCGGCCCC GGCGCCTTTG GTTTTAGGCA CGGAGCTTGC CAATTCCTTA
AACCTTGAAG TAGGCGACGA TGTTGTTTTA ATCTCACCAT CATCAATATC CACAAGCGCG
GGCATGGTTC CTAAAATGAA AAAGTTTAGG ATTTCAGGCA CGATAAAAAC CGGATATTAT
GAATTTGACC GCACAATGGG CTACACTACG CTTGAGCATG CGAGTGAGTT TTTAAATTTA
CAAAAGGGCG CCACGGGTAT ATCCATACGT CTTAAAAATA TTGATAACGC CGAAAAAGCC
GCAAAACTTA TACGTCCTAT TATGGGCAAC GGTTTTAGTA TACGCACATT CGCGCAGTTA
AACGGCACCT TATACGCCGC GCTTAAATTA GAAAAAACTA TGATGTTTAT CATCCTTTCC
TTAATTATTT TAGTGGCATC GCTTAACATA GCGTCAAATT TAATCCTTTT AGGCACGGAG
AAATTAAAAG ATATCGGCAT TTTACGTGCC ATGGGCGCCA GCCCAGCCAG CATAAGAAAA
ATCTTTATCT ACGAAGGTCT TATGATAGGC ACGGCAGGCA TTGTGTGCGG CGTTATACTG
GCTATGATTT TATGCTGGAT TATCGCTACG TTTAATATTG TACAGTTGCC GGGGGATATT
TATTACCTTA CAAAAGTGCC TGTAAGAATA AGTTTAACGG ACATTCTGTC TGTAGTAGCG
GGCAGCTATT TACTTTGCTT TTTAGCGGCG GTTTACCCGG CTGTAAGAGC TTCTAAAGTT
AACCCGACGG ACGCGATAAG GTACGGATAA
 
Protein sequence
MRFELFVAKR YLNSKRKGLF ALITTIIGIA GVTVGVAALI TTLAVMTGFQ TDIKEKVIGA 
QSHILIFGHM TEAVYQDKIK KIEQLPLVYA AAPNIFGQGI ITHNGSSLAI VLRGLEPEME
DKVNRLNSSF EEGSYVAPLR EGETSAPAPL VLGTELANSL NLEVGDDVVL ISPSSISTSA
GMVPKMKKFR ISGTIKTGYY EFDRTMGYTT LEHASEFLNL QKGATGISIR LKNIDNAEKA
AKLIRPIMGN GFSIRTFAQL NGTLYAALKL EKTMMFIILS LIILVASLNI ASNLILLGTE
KLKDIGILRA MGASPASIRK IFIYEGLMIG TAGIVCGVIL AMILCWIIAT FNIVQLPGDI
YYLTKVPVRI SLTDILSVVA GSYLLCFLAA VYPAVRASKV NPTDAIRYG