Gene Msil_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2988 
Symbol 
ID7093482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3299535 
End bp3300548 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID643466298 
Productdihydroorotate dehydrogenase 
Protein accessionYP_002363261 
Protein GI217979114 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.101094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCC GCACCCGCTA TCTCGGCCTT TCTCTGCGCA CCCCGCTGAT CGCGTCGGCG 
TCGCCGCTCT CGGGCGATGT CGGACTCATT CGACAAATGG AAGATTCGGG CGCAGGCGCC
GTAGTGCTGC CATCGCTGTT CCAGGAGCAG ATCGAGGAGG AGGCGCGAGC AGCCGATGAG
CTCGCAAGAA TCGGCGCCGA CAGCTCTCCG GAAGCAAGCT CCTATTTTCC GGCGGTCGTT
ACGTATAATT CGGGACCGCA CGGCTACCTC GATCTCGTCG CCCGAGCGCG CGCCGCCGTC
GACATCCCCG TTATTGCAAG TCTCAATGGA ACAACCGTTG CCGGCTGGGT CGATTATGCA
AGGCTGATCG AACAGGCCGG AGCGACAGCC CTCGAACTCA ACATCTATCG GATCGCGTCC
GGGCCCGGCG TGACGGGCGG GCAGGCCGAG GCCGATTGCG TGGCGCTGCT CGAAGCCGTC
CGCAGCCGGG TCAAACTTCC CGTGGCCGTC AAGCTGCATC CCTACTTCTC GGCGTTCGGC
GATTTCGCCC AGCAGCTCGA TCACGCAGGC GCCGACGGGC TGGTTCTCTT CAATCGCCTC
TACCAGCCCG ATATCGACCT CCTTCGCCTG GCCTGGAAAA ATGACGCGAC GCTGAGCGGC
GCGGGCGAGA TCCGGCTTGG CCTGCTCTGG CTCTCCGTTC TCTCGGGCCG GTTGCCGCAT
GCCTCGCTTG CCGCGGGCAC GGGCGTCGAT ACCGCCGAGG AGGTGATTAA ATACATTCTT
GCGGGCGCGA ATGCCGTGAT GACGGCCTCG TCGCTACTGC GGCATGGACC CGGGCATCTG
CGCACGCTTG TCTCGGGTTT GGAGACGTGG CTGAGCACAA GAGGCTTTGC TTCGGTCAGC
GCGATCACGG GATTGATGCG GCCTTCTCAT CCGGACTCAG AGGCCGAGGC GGACGAGCGC
GGCAGCTACA TCGAGAGTTT GTCGAGCTAT CAGGGTCCGT ATGTTCGCCA TTGA
 
Protein sequence
MDLRTRYLGL SLRTPLIASA SPLSGDVGLI RQMEDSGAGA VVLPSLFQEQ IEEEARAADE 
LARIGADSSP EASSYFPAVV TYNSGPHGYL DLVARARAAV DIPVIASLNG TTVAGWVDYA
RLIEQAGATA LELNIYRIAS GPGVTGGQAE ADCVALLEAV RSRVKLPVAV KLHPYFSAFG
DFAQQLDHAG ADGLVLFNRL YQPDIDLLRL AWKNDATLSG AGEIRLGLLW LSVLSGRLPH
ASLAAGTGVD TAEEVIKYIL AGANAVMTAS SLLRHGPGHL RTLVSGLETW LSTRGFASVS
AITGLMRPSH PDSEAEADER GSYIESLSSY QGPYVRH