Gene Emin_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1508 
Symbol 
ID6263938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1601122 
End bp1602270 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content41% 
IMG OID642611996 
Productlipid-A-disaccharide synthase 
Protein accessionYP_001876393 
Protein GI187251911 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000162544 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAACA TTGTGCCTAC AAGCAAAAAT ATTTTAGTTG TAGCGGGTGA TGTAAGCGGT 
GATTTGCACG CAAGCAATCT TGTGCGTGAA ATTAAACGCA TTAATCCCAA CGTAAAGATT
ACCGCTTTAG GCGGTAAAAG ATTGAAAGAA ACGGCGGATA ATTTTTTATT TGATTTAGCT
TCAAAAGGCG CAAGCGGCTT TGTGGCCCCT TTAGTTAAAC TGCCGCTATG GATAAAACTT
TTAAAAATGG TTCGCGGGTA TTTGGATTCA GAACAGCCAG CGTGTGTTAT AGTTGTTGAT
TTTTACGGTT TTAACTCGCA GGTTTTGGGC ATGGCAAAGC ACCGTAATAT TCCTTGTTAC
TATTACGTAG CTCCGCAAGT TTGGGCAAGC AGGCACAACC GTACCAAAAC CATAGCGTCC
AGCACAAAAA AAGTTATTAC AATATTTCCT TTCGAACCGG CGTTCCATGC CAAATACGGC
AGCAACGCCG TATTTTTGGG AAACCCGCTT TTAGATATTG TTCCGCAGCC TAAAGAACAT
GTTTTTGACG GCACTTTCCG TTTGGGTATT TTGCCCGGCA GCAGAGTAGG CGAGTTAACG
AAACACACTG ATTTATTTTA CAAAACATTT AAAGAAGTTC AAAAAATATT TCCCAATACA
AAGGCGTATT TGTTTTGCGT GCCTGAGTTT AGCGATGAGT TTTACCTGTC TTTAATTAAA
GACAGCAATC CGCAGGTTAC GCTTGTGCGC GAAACGGACT ATAAAGAGCG CGGCAACATG
GATTTCCTTA TAACCTGTTC CGGCACCGCC ACTTTGGAAA ACGCGCTTTT AGGCGTGCCT
ATGCTGGTAG CTTACAAAAT GTCCTCCATC ACTTTTAAGG TGGCTAAAGC GGTTATAAAA
GTGCCCTACA TATCGCTTGT TAATATTTTA GCGGGTAAAG AAGTGGTAAA GGAATTTATA
CAACATTTTG CCACGGCAAA AAATTTATCC GCGGAAGTTA TGTCTTATTT CCAAAATCCG
CAAAAAACAA AAAAAATGCG CGAGCAACTT TTGAACATAA GGAAAACATT GGGTGATCCG
GGTGTTGCCA AAAGAGCTGC CGAGCTTATT TTAAACGACG TATTCGCTAA CAAAAAACAG
GTTTTATGA
 
Protein sequence
MPNIVPTSKN ILVVAGDVSG DLHASNLVRE IKRINPNVKI TALGGKRLKE TADNFLFDLA 
SKGASGFVAP LVKLPLWIKL LKMVRGYLDS EQPACVIVVD FYGFNSQVLG MAKHRNIPCY
YYVAPQVWAS RHNRTKTIAS STKKVITIFP FEPAFHAKYG SNAVFLGNPL LDIVPQPKEH
VFDGTFRLGI LPGSRVGELT KHTDLFYKTF KEVQKIFPNT KAYLFCVPEF SDEFYLSLIK
DSNPQVTLVR ETDYKERGNM DFLITCSGTA TLENALLGVP MLVAYKMSSI TFKVAKAVIK
VPYISLVNIL AGKEVVKEFI QHFATAKNLS AEVMSYFQNP QKTKKMREQL LNIRKTLGDP
GVAKRAAELI LNDVFANKKQ VL