Gene Msil_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1014 
Symbol 
ID7091842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1100757 
End bp1101938 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content64% 
IMG OID643464353 
Productacetylornithine deacetylase (ArgE) 
Protein accessionYP_002361345 
Protein GI217977198 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01892] acetylornithine deacetylase (ArgE) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00488739 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGTGA TGCAGGAGCC TTCCGCCGCA AGGCTTGAGG CCGCGATCGC CATGCTGGAG 
CGGCTCGTCG GCTTCGATAC CGAAAGCTCG AAATCGAATC TGGCGCTGGT CGCGGCGGTG
GAGACTTATC TGCGCGAGTG CGGGGCCGAT TACGTCAAGA TCCCCAATGC GATTGGCGAC
AAGGCGGCGC TGTTCATCAC CATCGGGCCA AAGATCGATG GCGGCGTCGT GCTCTCCGGC
CATACCGACG TCGTGCCGGT CGAGGGACAA AGCTGGAGCA GCGATCCGTT CCAGCTGCGC
CGCGAGGAGG GCCGTCTCTA CGGCCGCGGC GCTTGCGACA TGAAAGGCTT CGATTCGATC
TGCCTCGCGA TGATCCCTGA GTTTCAGAAG GCGGCGCTGT CGCGGCCGAT CCATATTCTT
TTGAGTTACG ACGAGGAGAC GACCTGCCGC GGTTCGCTCG ACACGATCCG GCGCTTTGGC
GCCGATCTGC CGCGTCCCGG CGCAATCCTC GTCGGCGAAC CGACCTTGAT GCAGGTGGCG
GACGCGCATA AGGCGATCGT CACCTATCGC ACCATCGTTC ATGGCCATGA GGCGCATTCG
TCAAAACCCT ATCTCGGCGC CAACGCCGTC GAGACGGCCT GCGATCTCGT TACCGAGCTT
TATCGTTTCA ATGAGGAGCT TGGCCGGCGC GGCGATCCCT CGGGCCGGTT CGACCCGCCG
GCCTCGACCA TCGAGGTCGG CGTCATCCAT GGCGGCACGG CGCGCAATAT TCTCGCCAAG
CAATGCGCCT TCGACTGGGA GTTCCGGAGC CTGCCGGACA CGCCACAAAA CCTCGCTTTG
GCGCATCTTG AGAGCTATAT CGCCCGCGTC GCGCTGCCGA ATTTGACCCG CAACGCCAAG
GACGCCTCGA TTGAGACCTT CGCCGAGGTC GAGGTGCCCG GCCTTGGACC CGCGCCGGGC
TCCGTGGCGG AGTCGCTTGC GCTAAAGCTT GCGCGATCGA ACAGCACCAT CAGCGTGCCC
TATGCGACGG AAGCCGGACA GTTCCAGGCA GCCGGCGCGC CGACCGTGGT GTGCGGCCCC
GGCTCGATCG ACCAGGCGCA TCAGCCGGAT GAGTTTTTGG AGATCGCGCA GGTCGAGGCC
GGCATAGAAT TCATGCGGCG GCTGGCGAAG GAGCTGAGCT GA
 
Protein sequence
MNVMQEPSAA RLEAAIAMLE RLVGFDTESS KSNLALVAAV ETYLRECGAD YVKIPNAIGD 
KAALFITIGP KIDGGVVLSG HTDVVPVEGQ SWSSDPFQLR REEGRLYGRG ACDMKGFDSI
CLAMIPEFQK AALSRPIHIL LSYDEETTCR GSLDTIRRFG ADLPRPGAIL VGEPTLMQVA
DAHKAIVTYR TIVHGHEAHS SKPYLGANAV ETACDLVTEL YRFNEELGRR GDPSGRFDPP
ASTIEVGVIH GGTARNILAK QCAFDWEFRS LPDTPQNLAL AHLESYIARV ALPNLTRNAK
DASIETFAEV EVPGLGPAPG SVAESLALKL ARSNSTISVP YATEAGQFQA AGAPTVVCGP
GSIDQAHQPD EFLEIAQVEA GIEFMRRLAK ELS