Gene Emin_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0487 
Symbol 
ID6262729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp524183 
End bp525367 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content37% 
IMG OID642610957 
Productglycosyl transferase group 1 
Protein accessionYP_001875380 
Protein GI187250898 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000304875 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATA GAGATAACAG AGTAAAAATT TTATATATAA TTACGCGTTT GGACGCCGGC 
GGCGCCCAAA AAAGCGTGTT GTATTCCGCC GCCAATCTGT CTAAAAATAA ATTTAAAGTT
TTCTTAGCGG CAGGACCGGG CGGAGTGCTG GATCCTTTCG CAAAAAAACT TCTTAAAAAT
AAATATTTTT ATATCAATAG CCTTAGGCAG CGCGTTTGCT TTTATAACCT TTTTTATGAT
TTGGTATCTT TATTTCAAAC GGCCTGGCTT ATAATAAAAA TCAGGCCGCA TATAATTCAT
ACAAACTGTC CTAAGGCGGG TGTAGTAGGG CGAGCGGCGG CTTTTTTAAC CGCTCCTAAA
ACAAAGGTTA TACACACTTA TCATGGCTTA GGTTTTAGCG TCTATGGCGG TATAAAAAGA
TATTTATTTT ATTCTAAAAT TGAAAAATAT TTTTCTTTTA TAACGGACCA GTTGGTTTTT
GTATCAAATT CAAATATGCA AGAAGCGCTT ACGCTTGGCA TAGGCAACGT AAAAAAAAAT
ATTCTTATTT ATCCCGGAGC TGAGTTTGAA AAGTTAAAAC CATCTTTTGA TTATAATGCC
AAACTTGAAT CGCTGCGTAT TCCTAAAGGG GCAAAGGTCA TATTAAGCAT AGGTAATTTT
AAACCTTTAA AAAACGCCCG CGATTTTGTG CTTGTGGCTA AACATGTTTT AAAAAAAATT
CCCGGAGCAT ATTTTCTTTA CGCTGGCTGT GGAGGGATGG AAGAACGCAA AGTAAAAACG
CTTGCTAAAA AATCAGGACT TAAAAATCAT TTGTTTTTTT TAGGAATGCG GCATGATACC
CGTGAATTGT TGGCTGTAAG CGATTTGTAT GTTTCAACTT CTCTGCGTGA AGGCATGCCT
GTTGCTTTGC TTGAAGCTTT GGGCGCGGGC GTGCCGGCTG TTTGTTATGA GGCTGACGGC
ACCGCCGAGG TTTTGATAAA CGGCAAAAAC GGTTTTATTT TAGGCCAGCG AAACAAAGAA
GGAATGTCAG ATAAAATAAT TGAGATTTTA AAAAACGATA AAATTTATTT CACTATCAAA
CAAGGCGTAA AAAGTTTTGA TAAAAATCTT TTTAGCGCGG TTTCCACCGT CAGAAAGCAA
GAAGAATTGT ATAATAAAAT ACTGCTTAAA AACCCGGGTT CTTAA
 
Protein sequence
MKNRDNRVKI LYIITRLDAG GAQKSVLYSA ANLSKNKFKV FLAAGPGGVL DPFAKKLLKN 
KYFYINSLRQ RVCFYNLFYD LVSLFQTAWL IIKIRPHIIH TNCPKAGVVG RAAAFLTAPK
TKVIHTYHGL GFSVYGGIKR YLFYSKIEKY FSFITDQLVF VSNSNMQEAL TLGIGNVKKN
ILIYPGAEFE KLKPSFDYNA KLESLRIPKG AKVILSIGNF KPLKNARDFV LVAKHVLKKI
PGAYFLYAGC GGMEERKVKT LAKKSGLKNH LFFLGMRHDT RELLAVSDLY VSTSLREGMP
VALLEALGAG VPAVCYEADG TAEVLINGKN GFILGQRNKE GMSDKIIEIL KNDKIYFTIK
QGVKSFDKNL FSAVSTVRKQ EELYNKILLK NPGS