Gene Emin_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1229 
Symbol 
ID6263795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1329024 
End bp1330274 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content42% 
IMG OID642611707 
Productthreonine synthase 
Protein accessionYP_001876116 
Protein GI187251634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACTAAACTT ACAATGTATA AATTGCGGTA AAGAATATAA AACAAGCGCA 
ACGGACTTTC TTTGTTTATC CTGCGGGAGC AACCTGGAAG TCAATTATGA TTATAAGTTA
ATTTCTAAAC GTTTTAAAAT AGAGAATTTT AAAGATAAAA AACGCTTTGA TATGTGGCGT
TATATAGATT TGCTGCCCGT TAACGATTTT GATAAACTGC CCTATGTGCA AGTAGGCTGG
ACGCCGATGT ACGACCACAA AGAACTTGCA GATGAACTTG GTATATCAAA ACTTTTAATT
AAGGACGAAG GACGCAACCC CACAGGTTCA ATCAAAGACC GCGGCAGCGC GGTGTCTGTG
GCAAGAGCTT TGGAACTTGG ATTAGATATA ATAGCGGACG CTTCCACAGG CAACGCAAGT
GATTCTTTGG CCTGTTTAAC CGCCGGTTTA GATATTAAAA CAATTGTTTT TACAACCAAA
GATGCGCCGT ATCCCAAACT TACTCAGCTT TTTGTGTATG GGGCGGATGT CTTTACCGTA
GACGGCACTT ATGACGATGC TTTTGAGCTT TGCAAAAAAG CGGTTGAAGA ATACGGCTGG
TATTCCAGGG CGGCGGGGTA TAATCCTTTT ACAAGGGAAG GCAAGAAAAC ATGCTCGTTT
GAAATATGCG AGCAGCTCAA CTGGGAAGCC CCGGATAAAG TGCTTGTCGC CGTGGGCGAT
GGGACTATTT TAAGCGGTAT GTGGAAAGGT TTTGTTGATT TTCAAAAACT TGGTATTTTG
GAAAAAATGC CGCAAATGAT AGCTGTGCAA GCAGAAGGCA GCGACGCTAT AAAAAGAGCT
TTTGAAAACA AAGGCGAGGT TACCGCCGTT AAAGCGCATA CGATAGCTGA CAGTATTTTA
GTTAATTACC CGCGTGACGC GCAGCTTGCG GTTCAGGCTT TGCAGGAATC AGACGGGTAC
GCCGTTACGG TAACGGATGA AGAAATACTT GCCGCTATAC CCGAGTTTGC CAGAAAGGCC
AACATTTTTG CCGAACCGGC GGGCGCGGCT GTTTACGCCG CTCTTAAAAA ATTAGCGGAG
GAAGGTAAAA TAGAACAGGA TGAAACCGTT GCTATTGTTA TAGGCGGCAA CGGACTTAAA
GACACGTATT CTTACGCTAA AAACATACAG AAAGCGGAAG TAATTTCAAA AGATTTTGAA
GCATTTAAAA TAACGGCCAA AGAAAAAGGG CTTATAAAAA CAGATAAATA A
 
Protein sequence
MKKILNLQCI NCGKEYKTSA TDFLCLSCGS NLEVNYDYKL ISKRFKIENF KDKKRFDMWR 
YIDLLPVNDF DKLPYVQVGW TPMYDHKELA DELGISKLLI KDEGRNPTGS IKDRGSAVSV
ARALELGLDI IADASTGNAS DSLACLTAGL DIKTIVFTTK DAPYPKLTQL FVYGADVFTV
DGTYDDAFEL CKKAVEEYGW YSRAAGYNPF TREGKKTCSF EICEQLNWEA PDKVLVAVGD
GTILSGMWKG FVDFQKLGIL EKMPQMIAVQ AEGSDAIKRA FENKGEVTAV KAHTIADSIL
VNYPRDAQLA VQALQESDGY AVTVTDEEIL AAIPEFARKA NIFAEPAGAA VYAALKKLAE
EGKIEQDETV AIVIGGNGLK DTYSYAKNIQ KAEVISKDFE AFKITAKEKG LIKTDK