Gene Emin_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1237 
Symbol 
ID6263660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1336605 
End bp1338206 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content41% 
IMG OID642611715 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001876124 
Protein GI187251642 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGAA AAAAAGATTT ATACACCGCC CTTTCAGACA GCGCTTACAG ATATCCCGAA 
CGTGTGGCTT TTGTGTTTGA AGGCAAAAAA TATACATTTC ACGAGCTTCA AATTCTTGTA
GATAAATGCG CCGATATGTT TTGGACCTAC GGTATAAGAA AAGGCGATTC TGTAGCCATA
GCCCATAAAA ACTCAATTTG GTTTGTTATA ACTTCTTTTG CTTTGTATAA ATTAAGCGCA
ATAGCGGTGC CCATAAATTT TATGATTTCA AAACCGGAAG AAATCAAATT TATTATTGAG
GACAGCGGCG CCACAATGGT AATTTTACAA AACGAATTCC TGCGCGCTTA TAAAAAAACG
GCTGAAATAG CGCCCTCGCT TAAATATTTT TTCTGTTCAG ATTACCCCGA AAATAATGAG
GATGAAAGAG TAAAAGATTT ACAAAAAGAA ATTGAAAACA GCAAAATACA GTCTGAAATC
TTAGAACATA AACCCAGCCT TGAGGATAAC GCTTTTATTC TTTACACCTC AGGCACAACC
GGCGCCCCGA AAGGAGCGGT GGTAACACAC GGCAATTTAG CGGCAAATAT TATTTCCTGC
GCGCAGGTAT TTAGAATAGC GGGAGACGAC GCCATGATTT GCCTTCTTCC CATGTTTCAC
ACCTTCGCTT GGATGACCTG CGTTATCCTT CCCATTTACC TGGGCTTAAA ATCCGTTATC
GCGCCAAGCA TTACCCCTCC TTCAGCCTGG CTGCACTTAA TGGGAGTTGA AAGAGTAACT
CTTTTTATAG CAATACCTCA AATATTTTAT ATTCTTGCGA AAGAAGCGCG CGGCATTAAA
AGACTTTACC TGCAATATTG GGCGTTTAGA AAGGTGCGTT TTTGCATTTC GGGCGCGGCG
CCGCTTAACA AAGAATCCCA GGATCATTTT GAAAAAAACC TCGGTATTCA ACTTCTTGAA
GGTTACGGCC TAACGGAAAC AAGCCCCGTT ATAAGCGTTA ACCTTGAGGA AAAAAATAAA
AAAGGCTCCG TAGGTCCGGC TCTTCCCAGC GTTAAAGTGG TAATATTAGA CGACAATGAG
AATGAGCTTC CCAGAAATGC GGAAGGTGAA ATTTCGGTAA AAGGCCCCAA TGTTTTTAAA
CAGTACCATA ATAATCCCGA AGGGACAAAA GAGGCTTTTA GCAAAGAAGG CTGGTTCAAA
ACGGGCGATA TAGGCCTTGT TGACGATGAG GGTTTTATCT TTATTAAAGA CAGAAAAAAA
GACATGATTA TTATAAAAGG CCTTAAAGTT TTTTCCGCCC AGGTGGAGGC TACCATTATG
CAATTCCCCG GTATTGAGGA ATGCGCCATT ATAGGCGTGC CCGACGGCCG CGGCGGCGAA
TTTATTAAAC TTTACGCCGT AAAAGCACCG GGCGTTGATT TTAATGAAAC AGCTTTCAGG
AAGTTTTTGA AAACTAATTT AGACAACTAC AAACGCCCGC GTGATATTGA GTTTATGACG
GAGCTTCCTA AAAACTCTTT AAGAAAAATA TTAAAACGAG AGCTTAGAAA AGACGCCGTG
GAAAAACTAA AAGAACGTAC CGTCGCCCCG GCGGAAGAAT AG
 
Protein sequence
MIRKKDLYTA LSDSAYRYPE RVAFVFEGKK YTFHELQILV DKCADMFWTY GIRKGDSVAI 
AHKNSIWFVI TSFALYKLSA IAVPINFMIS KPEEIKFIIE DSGATMVILQ NEFLRAYKKT
AEIAPSLKYF FCSDYPENNE DERVKDLQKE IENSKIQSEI LEHKPSLEDN AFILYTSGTT
GAPKGAVVTH GNLAANIISC AQVFRIAGDD AMICLLPMFH TFAWMTCVIL PIYLGLKSVI
APSITPPSAW LHLMGVERVT LFIAIPQIFY ILAKEARGIK RLYLQYWAFR KVRFCISGAA
PLNKESQDHF EKNLGIQLLE GYGLTETSPV ISVNLEEKNK KGSVGPALPS VKVVILDDNE
NELPRNAEGE ISVKGPNVFK QYHNNPEGTK EAFSKEGWFK TGDIGLVDDE GFIFIKDRKK
DMIIIKGLKV FSAQVEATIM QFPGIEECAI IGVPDGRGGE FIKLYAVKAP GVDFNETAFR
KFLKTNLDNY KRPRDIEFMT ELPKNSLRKI LKRELRKDAV EKLKERTVAP AEE