Gene Emin_1329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1329 
Symbol 
ID6263546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1430438 
End bp1431478 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content42% 
IMG OID642611808 
Productradical SAM domain-containing protein 
Protein accessionYP_001876216 
Protein GI187251734 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000017523 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGAAA TAATAAAAAA GGCCGCCAAA ACAAATAACC TTACGGAAGA AGAAATAACG 
CTGCTTTTAG AAAATTCCTC TTTTAACGGG GAGCTTTTTG CCGCGGCCGA TTTTACGCGC
AAGCAAAATG TGGGTGACGG CGTGCATCTG CGCGCTTTAA TAGAATATGG GAATATCTGC
CAAAACAACT GTTTTTACTG TGGCATAAGG GCCGCTAAAA AAGATGTAAA AAGATACCGC
CTGGACACGG AAACCACGCT AAAAGCCGCC GCTTTGGCAA AAAACCTGGG TTACAAAACA
ATTGTTTTGC AATCAGGTGA GGAAAACGCC GCTCCTTTAA ATGAATTTTT GCAAATTATA
AAAGAAATTA AGAATATGGG CCTTGCCCTT ACTTTAAGCA TTGGTGAAAA AACTTACCAA
GAATATCTTG CTTACAGAGA AGTCGGCGCG GATAGGTTTT TACTGCGTAT TGAAACAACG
GACGAAAATT TGTACCAAAC ACTTCACCCT GGTATGAATT TGCAAAACAG GCTGCGCTGC
CTTAAGGATA TAAAAAAGCT GGGTTATGAA ACAGGCACAG GCATAATGGT AGGGTTGCCG
GGCCAGACGG CAAAATCAAT AGCGAAAGAT ATTTTATTTT TTAAAGAGCT AGACGCCGAC
ATGCTTGGCA TAGGGCCGTT TATCCCATGC CCCGGCACCC CTTTGGAAAA TGAAAAGGGC
GGCAGTTTGG AAACAGCTTT AAAAGTTATG GCGATATCAC GCCTTATTAT GCCAAAAATA
AATATCCCGG CCACAACAGC TATGGAAGCT ATTGAAAAAA ACGGACGGAT AAAAGCATTG
CAAAGCGGAG CAAATGTAAT AATGCCAAAT GTTACACCAC AAAACGAGCG CAAAAATTAC
GCCCTTTATC CCGGAAAACC GGGCATTTTG CAAACTCCTG AAGAGTTCTT AAATAGCCTT
AAGCAAACGC TTAGCCAAAT AGGCCGTTTT GTATCGCAAG ACGCGGGCAT GAGTTTAAAC
TACCGTCCTA TAGAAAAATA G
 
Protein sequence
MREIIKKAAK TNNLTEEEIT LLLENSSFNG ELFAAADFTR KQNVGDGVHL RALIEYGNIC 
QNNCFYCGIR AAKKDVKRYR LDTETTLKAA ALAKNLGYKT IVLQSGEENA APLNEFLQII
KEIKNMGLAL TLSIGEKTYQ EYLAYREVGA DRFLLRIETT DENLYQTLHP GMNLQNRLRC
LKDIKKLGYE TGTGIMVGLP GQTAKSIAKD ILFFKELDAD MLGIGPFIPC PGTPLENEKG
GSLETALKVM AISRLIMPKI NIPATTAMEA IEKNGRIKAL QSGANVIMPN VTPQNERKNY
ALYPGKPGIL QTPEEFLNSL KQTLSQIGRF VSQDAGMSLN YRPIEK