Gene Emin_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0620 
Symbol 
ID6262828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp675379 
End bp676503 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content41% 
IMG OID642611091 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_001875512 
Protein GI187251030 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAG AAAAAACAAG AGATAATTTA AAAAACAATG TTTTCGGGCG GTTTTTTTTA 
TATGTGCTTT CAAAAGGATA TGAGCTCGGA ACCATAGTTA ATAAATTTTT ATATGAAAAC
GGCTGGCGTA AAAGTTACAG CGTAAACACG CGTGTTGTTT GTGTGGGTAA TATTACAGCG
GGCGGCACCG GTAAAACCAC GGCGGTGCTT CTTGCCGCGC GTACTTTGGC TGAGGCGGGT
ATAAGAACCG CAATAATTTC CCGTGGGTAT AAAAGAGATA AAAAAAATAA AAATCCCGTA
GTCTTGTTTG ACGATGAGCT TGAAAACAAC TGGGTAACCG CCGGCGATGA ACCTTTTATG
ATGAGCCGCG CATTGGCTGA CGTAAAAGTG CCCATAGTAA TTCACGAGGA CAGGCACCTT
GCCGCTACCG AAGCTCTAAA AAGATTTAAA AGCCAGGTTT TACTTCTTGA CGACGGGTTT
CAGCACTTCC GTTTAAAAAG GGATGCTAAC ATTGTTCTTA TTGACGCCAG AAATCCTTTT
GGAGGGGGGC AGCTGCTCCC GTACGGTACT TTAAGAGAGG GGCTCTCAGG TTTAAAAAGA
GCCAATCTTG TTTTATTAAC GCACAGCAAT CAGGCTGACC AGCGTAAAAA AGAAGATATA
AAGGACCAGA TACGCCTTCA AAACGAGGAT ATTGAGATTT TGGAAGCAGT GCACCAGCCT
GAGCATTATT TTGATATCTG CAATTCCGTA AAGGTGCCTT TAAACCATTT AAAAGGCGAA
GCGGGGGTAT TTTCAGCCAT AGGAGAACCC GGCGGCTTTG AAGATACGTT AAAAGATTTG
GGACTTAAAC TTGTTAAAGT CTGGCGTTAT CCCGACCACA GAAGATATAC TGAAGAAGAT
CTTAAAACTT TTGTTGATTT GGCGGGGGAA AACCCTTTGG TTACCACTTT TAAGGATTTT
GTTAAATTTC CGGAAAACTG GCGGGATATT TTAAAGAAAA ACGTGTATGT TCTTTCCGTC
AGCATGAAAA TAAAAGGTAA AAAAGAATTT GATATTTTTG CCGAAGCGCT ATATCCCAAA
TTTACAAATT TGAATGTTAA AAAGGAAAGC AAAAGCCGCA AATAG
 
Protein sequence
MDLEKTRDNL KNNVFGRFFL YVLSKGYELG TIVNKFLYEN GWRKSYSVNT RVVCVGNITA 
GGTGKTTAVL LAARTLAEAG IRTAIISRGY KRDKKNKNPV VLFDDELENN WVTAGDEPFM
MSRALADVKV PIVIHEDRHL AATEALKRFK SQVLLLDDGF QHFRLKRDAN IVLIDARNPF
GGGQLLPYGT LREGLSGLKR ANLVLLTHSN QADQRKKEDI KDQIRLQNED IEILEAVHQP
EHYFDICNSV KVPLNHLKGE AGVFSAIGEP GGFEDTLKDL GLKLVKVWRY PDHRRYTEED
LKTFVDLAGE NPLVTTFKDF VKFPENWRDI LKKNVYVLSV SMKIKGKKEF DIFAEALYPK
FTNLNVKKES KSRK