Gene Emin_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0047 
Symbol 
ID6263261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp49347 
End bp50960 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content43% 
IMG OID642610510 
Producthypothetical protein 
Protein accessionYP_001874952 
Protein GI187250470 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.666843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA AAGGTTATTT AGCTTTAGTC CTTCACGCGC ATTTGCCGTT TATCAAACAC 
CCGGAATATC CTGACTTTTT GGAAGAAGAC TGGTTTTTCG AAGCGATGGT TGAAACATAC
CTTCCGCTTT TAAATATGTA TGAAAAGTTA ACTGCTGAAG GTGTTGATTT CAGAATAACA
ATGTCCTTAA CGCCGCCGCT TTGCGCTATG ATGAGCGACC CGCTTTTAAT AAGCCGCTTC
AGATATTACC TTAACGCCAG AATAGAATTA AGCCAAAAAG AGCTTGTGCG CACAAAAAAC
ACAGAGTTCC AGTATGTGGC GCAAATGTAC GCAGATAAAT TTGCAAGATT TAAAGATTTG
TTCGAAAACT ATTACCACGG CAATATTTTG GAAGGTTTTA AAAAATTCCA AGATATGGGC
AAATTAGAAA TTATAACCTG CTGCGCTACG CACGGCTATT TGCCTCTGCA GGTGCATAAG
GAAAGCGTTA ACGCACAAAT TAAACTGGCG GCGGACGATT ATAAAAAACG CTTCGGCAGG
CAGGCAAGAG GTATTTGGTT GGCAGAATGC GCTTATAACC CAGGCGACGA TAGGTTTTTA
AAAGCCAATG GCATAAGGTA CTTTTTTACA GAAACGCACG GCATTTTACA CGGTGTTCCA
CGCCCTAAAT ACGGTATTTA CGCGCCGGTT TACACGCCCA GCGGAGTAGG CGTTTTCGCA
AGGGATATGG AAAGCGCCCA GCAGGTCTGG AGCGCGGAGT CCGGCTACCC AGGCGACCAG
TCTTACAGAG AATTTTACCG TGATTTAGGC TATGACCTTG ATTATGATTA CATTAAGCCA
TACCTTCACA GCGACGGCGT GCGCAGAAAT ATAGGCATGA AATACCACCG TATTACAGGC
AAAGTTTCTT TAAGCCAAAA AGACACTTAT TACCCTTCGG ACGCGAAAAG TAAAGCCGCA
GAACACGCGG GCAACTTTAT GTTTAACCGC CAGAAACAAA TTGAGTACCT ATCCACTTTA
ATGGACAGAA AACCTTTGGT AGTTTCAATG TATGACGCCG AGCTTTACGG CCACTGGTGG
TATGAAGGCG TTGATTTCCT TGAATATCTG TTTAAAAAAC TGCATTATGA CCAAAGTGAC
ATTAAGCTTA TAACACCTTC GGAATATTTA TCTAAATATC CGGAAAACCA GGTTGTTGGG
CCCAGCGCGT CCTCATGGGG CGACAAGGGG TACAACGATG TTTGGCTTAA CAGCGGTAAT
GACTGGGTTT ACAGGCACCT TATTAAAGCG GCTGAACGCA TGATGGAAAT GGCTAATTAC
TACCCTAACG CGGAAGGCCT TCTGAAAAGA GCCTTAAACC AATGCGCCAG AGAGCTTGTG
CTTATGCAAT CCTCCGACTG GGCGTTTTTA ATGACGGTAG GCACCGCGCA GCAGTATTCG
ACAAAGCGCA CAAAAGAACA TATACAGCGC TTTAATGAAT TATACGAGCA AATTAAAAAC
AACAGAATCG ACGAAGCTTA TATCTACGGA CTTGAAACAA AGGACAGTAT TTTCCCTGAG
ATTGACTACA AGGTTTATAT GTCTGAACTT AAAGATAAAG CTTTAGCTTC CTAA
 
Protein sequence
MEEKGYLALV LHAHLPFIKH PEYPDFLEED WFFEAMVETY LPLLNMYEKL TAEGVDFRIT 
MSLTPPLCAM MSDPLLISRF RYYLNARIEL SQKELVRTKN TEFQYVAQMY ADKFARFKDL
FENYYHGNIL EGFKKFQDMG KLEIITCCAT HGYLPLQVHK ESVNAQIKLA ADDYKKRFGR
QARGIWLAEC AYNPGDDRFL KANGIRYFFT ETHGILHGVP RPKYGIYAPV YTPSGVGVFA
RDMESAQQVW SAESGYPGDQ SYREFYRDLG YDLDYDYIKP YLHSDGVRRN IGMKYHRITG
KVSLSQKDTY YPSDAKSKAA EHAGNFMFNR QKQIEYLSTL MDRKPLVVSM YDAELYGHWW
YEGVDFLEYL FKKLHYDQSD IKLITPSEYL SKYPENQVVG PSASSWGDKG YNDVWLNSGN
DWVYRHLIKA AERMMEMANY YPNAEGLLKR ALNQCARELV LMQSSDWAFL MTVGTAQQYS
TKRTKEHIQR FNELYEQIKN NRIDEAYIYG LETKDSIFPE IDYKVYMSEL KDKALAS