Gene Emin_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1307 
Symbol 
ID6263995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1408855 
End bp1409862 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content38% 
IMG OID642611786 
Productglycosyl transferase family protein 
Protein accessionYP_001876194 
Protein GI187251712 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAG TAAGTATTAT AATACCTGTT TATAACTTAG AAAATTATCT TCCACAATGC 
CTGCAATCGG TAGAACAGCA GACTTTGGAA GATATAGAAG CGCTTGTGGT TGATAACGCC
AGCACGGACA ACAGCGCTGA AATAATAAAA CAATTTGCCG CTTTAAACTC CAAAATAAGA
ATTTTACACT GTAAAACAAA AGGCGCCGCT AACGCCAGAA ACTGCGCCCT TAAAGAAGCC
TCCGGTGAAT ACTTATTCTT TTTAGACGGA GACGATTGGC TTACCCCGCA ATGTCTTGCC
GCTTTATATA AAGAAGCTAA AGCAAATGAC GCAGATGTTA CTGTTTGCGA CAACGCCCTT
TATACAGAAA CAACCAATTT AATGTCTTTC CCGCAGGAAA ATATGTTTTT TTCAGCCCCT
AAGCTTGAAA CTTTAAAAGA AAAAAGCCTT CTGTTAAAAG CCCCGTTTAC AGCATATTCC
TGCGCGGGCA AACTTATAAG AAGAAGCTTT TTTGAGAAAA ATAGCCTGTC TTTCCCTTCA
GAAATGCCCC GGGGCGACGA CTGGCCCGTT TCCATGAAAA TCACCGTGCT TGCCAACAGA
ATAAAACTTG TGCCCAATGA ATACTATTTT TACAGAGTTG GCAGACAAAA CGCCGAAAGC
GCAAATCTGA GCGCTTTTAA CTCTTACATT TACGCTTCCA GGCTGAATTA TAAATTTTTA
AAAGAAGCGG ACGCCTACGA AACTTTTGCC CCGCAGTTTG AATATTTAAG AATGTATTAC
ATTTTGTCTT TTATGGCTTT GCATAAACTT GATAAAGAGC AAAAAGCCGC GCTTTTAACA
CTTCGTAAAG ATATTTTATC AATACCGCTT TCGGTGTTTG AAGGGCGCGA ACTTAAATTT
AAACTGTCTT TTTTAGGTTT AAAAATTTGT ATATTATGTA AAATCACTTT ATACGCGGAT
ATGATAAATT TTATATATGC GCGTTTAAAA GGTAAAAAAA TATCATAA
 
Protein sequence
MPKVSIIIPV YNLENYLPQC LQSVEQQTLE DIEALVVDNA STDNSAEIIK QFAALNSKIR 
ILHCKTKGAA NARNCALKEA SGEYLFFLDG DDWLTPQCLA ALYKEAKAND ADVTVCDNAL
YTETTNLMSF PQENMFFSAP KLETLKEKSL LLKAPFTAYS CAGKLIRRSF FEKNSLSFPS
EMPRGDDWPV SMKITVLANR IKLVPNEYYF YRVGRQNAES ANLSAFNSYI YASRLNYKFL
KEADAYETFA PQFEYLRMYY ILSFMALHKL DKEQKAALLT LRKDILSIPL SVFEGRELKF
KLSFLGLKIC ILCKITLYAD MINFIYARLK GKKIS