Gene Emin_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0095 
Symbol 
ID6263629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp99705 
End bp100931 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content36% 
IMG OID642610557 
Producthypothetical protein 
Protein accessionYP_001874998 
Protein GI187250516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0764001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATA AAATATGGAT TTTTAACGTA GATGATTTTA TTTATAAATC ACAAGCCCTT 
GCCGTTTTAA AAGAGCATGC GCTTACCTAT TTGCAAATCT TAAAACCTAA CGATATGTGC
ATAGTTTCAC CTTTTACCGA AAAGGATGAA TATTTTACGG CTTATACAGC AGCTATAAAA
GGGCTTAAAC ATAAAAAATG GATGTTCCAG CCAAAGAAAC ATGCCAAGGA TGAAAGTCTT
ATCCAGGCTA TTATGAAGGA TAAGGCGCTT GTCGGTAAAA TAAAAAATTT CTGCAAACAC
GGTTTTGTGT TAATACCTTT AAAATATACG GAAGAGTTTC AAAAGCTAAG TAAAATGTGC
GGTAATAAAC TTCTTAACAA CAGCATAGCC ATACAGGAAG CCAATAATAA GCTGCGCTTT
AAAAAGCTTT GCAAGGAATT CGGCATCCGC ACCATGCAGC CCGTTTTTGA AAGAAAGGAC
GGTAAGGACA AATCTCATAT ACTTGCTTTT ATTAACGCCA ATGAAACTTA TTTGTTAAGG
CGGCCTTCAT CAATGGGCGG CTACGGCAAT ATTGCCGGTA AAATAACAGA TTTGCTGCCC
TTAATGCGCC TTTTTAATAA AGACGGCGAT TTTTTTATAG AAAAATATAA AAAAGTTGAA
AAAACTTTGG GCTCTCTTTG TATATTAAAA GACGACGGCG CAGCCTTCGT TGGCATTGAC
TGCCAGTTCC CGCACAGGGA AGGGTGGGAG GGAAGATTTT TTCCTTTTAA AAAATTTGAC
AAAAAGATTT TAGAAAGGAT AAAAGAAAAA TCAATGTTGA TGGCTGAATA TTTCCATAAA
AAAGGCGTGC GCGGGCAGGT TAATTTTAAT TGGGCTATAA CAATAAAAGA TGGTGAATAT
AAATTGAGAG GCCTTGAATG TAACTCCAGA TATAACGGTT TTGGTTTATG TTTAAGGCTT
GCAAAAACTG TTTTTGACAT ACCGCAGGAA AAGCTGCATT TTTATTTAGA CACTAATATC
GCTATTGACG AATCCTACAC CACAAAAGAT ATTATTAAAA TTATTTTTAA AATAAACACC
GGCGTTAAGT TTAAGGGAGG CATAGTTCTT ACTTCTGCCG TGAAAAACGG ACGGATGGCT
ATGTGTTTTA TTTCAACAAG CCCTAAAAAC GTTCAGCTGT TAAGGCTGGC TTTAAAAAGG
GCTGTTAAAA ATATCGATTT AAACTAA
 
Protein sequence
MADKIWIFNV DDFIYKSQAL AVLKEHALTY LQILKPNDMC IVSPFTEKDE YFTAYTAAIK 
GLKHKKWMFQ PKKHAKDESL IQAIMKDKAL VGKIKNFCKH GFVLIPLKYT EEFQKLSKMC
GNKLLNNSIA IQEANNKLRF KKLCKEFGIR TMQPVFERKD GKDKSHILAF INANETYLLR
RPSSMGGYGN IAGKITDLLP LMRLFNKDGD FFIEKYKKVE KTLGSLCILK DDGAAFVGID
CQFPHREGWE GRFFPFKKFD KKILERIKEK SMLMAEYFHK KGVRGQVNFN WAITIKDGEY
KLRGLECNSR YNGFGLCLRL AKTVFDIPQE KLHFYLDTNI AIDESYTTKD IIKIIFKINT
GVKFKGGIVL TSAVKNGRMA MCFISTSPKN VQLLRLALKR AVKNIDLN