Gene Emin_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0547 
Symbol 
ID6262750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp599150 
End bp600358 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content40% 
IMG OID642611018 
Producthypothetical protein 
Protein accessionYP_001875439 
Protein GI187250957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TACTTGGATT ACTTCTGGTC CTTGCTTTAG TAGTACCTGC TAAAGCAGAT 
ATACTTAAAA ACTTAAAATC AACCGGTGAG ATACAAGTCA TCGGCGATTC CGTTAACAGA
GAATTTATGG GCCCCGGCGG TTCATACAGC AATATTACAC TCAGAGTTCT CTATGGTCTT
AATTTTGACC TGGCTGAAGA TGTTAAAGCT AACTTAACAA TGGCATACTA CAATATGTGG
GGCCAAAACA CGGGCAATGG TTTTATGAGT ACATACCATA CAGGCCGCCC GTTTCAGGAT
TATCTTAACG AAGTTGACCT TATTGAAGCA AATGTCGTTT TAAGCAACCT TTTTGACTGT
TTGGAAGCCA AGGTAGGCCG CCAGTTTTAT GGTGATGAAG ACAGCGCCAT AATATATTTG
GGTCCTAATC ACTACAATAC AAGACAGTTG GATTACAGGC AGGCTAAGTC AGTTGACGCG
GCTGTAATCT CCTACGCGGG TGAAATGGTG TCATGGGGCT TTATCTATTC TAAAGTTAAT
GAATTAGATA CTGCTGCGAA TTTGGACGCT ACCATTCTCG GTTTAGATGT GAAAGCAAAC
GTAAATGATA ACTTTAAAGC CCAAGGTTAC TTTTATAATT TCAGAAATGA CGATAGCTCT
AGCTTTAGCA GTGGTACTAC TGAAAAATAT TTAGGCATTT ACGGCGCGAA AGGCACATTT
AACGCTGATA TATTCACCTT ATCAGCGGAA TACGCCAGAA ACGTAGGCGG GGAAGATGCT
TTTGATCATG ATAAAGGCAG CTTACTTAAA ATTGATGCTT CCGTTGATCT CGGCGCGTTT
ACGCCCAGAG GCACAATCGT TCGTGCTGAA AACTTCAGAT CATACGGTAA CTACAGACCT
GGTATTATTG TAGGCCAGGA AATTGAAAGC ACTACTACAA TGCCACAGGA TTTATTTTTT
GAAGATATTT TTGTAGGCAA CTTAGGCGTT GACATGAAAT TCGCGGCTTT AGATAAATTT
GTATTCTCGA TAGACGGTTT TGCTTTTAAT GACAGAAATA TTAAAGACTC AACCTCCTAT
GAAGCGAATG CTATGGCGAA ATACAATATG AACCCTAACG TTGAACTTCA TGTAGCTGTA
GGCGGCTTCC ATGAACATTC AATAGATAAT GTTTATAAAG CGCAAGGCGG CATGCTGATA
AGATTCTAA
 
Protein sequence
MKKLLGLLLV LALVVPAKAD ILKNLKSTGE IQVIGDSVNR EFMGPGGSYS NITLRVLYGL 
NFDLAEDVKA NLTMAYYNMW GQNTGNGFMS TYHTGRPFQD YLNEVDLIEA NVVLSNLFDC
LEAKVGRQFY GDEDSAIIYL GPNHYNTRQL DYRQAKSVDA AVISYAGEMV SWGFIYSKVN
ELDTAANLDA TILGLDVKAN VNDNFKAQGY FYNFRNDDSS SFSSGTTEKY LGIYGAKGTF
NADIFTLSAE YARNVGGEDA FDHDKGSLLK IDASVDLGAF TPRGTIVRAE NFRSYGNYRP
GIIVGQEIES TTTMPQDLFF EDIFVGNLGV DMKFAALDKF VFSIDGFAFN DRNIKDSTSY
EANAMAKYNM NPNVELHVAV GGFHEHSIDN VYKAQGGMLI RF