Gene Msil_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3807 
Symbol 
ID7090735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4169671 
End bp4170795 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID643467092 
Productdihydroorotate dehydrogenase 
Protein accessionYP_002364051 
Protein GI217979904 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0649715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCCG CGCTCAACCG ACTGGTCGCC TGGATCGGCG CGGCGGCGAC GCCGCTCCTG 
CGGCAATTGG ATGCCGAGAC CGCGCATCGG CTGACGATCC GGGCGCTGGC GCTCTATCCC
GCAACAGGCG CAGCGCCGGA CGATCCGCGC CTCGCCGTCA CGGCGTTCGG CCTCCATTTT
CCCAATCCGG TCGGCCTTGC CGCTGGCTTT GATAAAAACG CCGAGGCGGT CGATGCGATC
CTCGCGCTTG GCTTCGGCTT TGCCGAAGTC GGCACGATCA CCCCGCTGCC GCAGCCGGGG
AATGCGCGGC CGCGCCTGTT CCGGCTGACG GCGGATGAAG CGGTGATCAA CCGCTTCGGC
TTCAACAGCG AAGGCGCCGC GGCCGTACGG GCGCGCCTGG CCAAGCGTGG CGTCCGCCGG
GCGGGGGTGC TCGGCGTCAA CGTCGGCGCC AACAAGGATT CAGCGGACCG CACGGCGGAT
TATGTGCGGG CGATTGCGCA GCTGGCGGCG CCCGCGGATT ATCTCACCGT CAATATTTCG
TCGCCGAATA CGCCGGGCCT GCGCGATCTC CAGCACGCCG CCGCGCTCGA CGATCTGCTG
GCGCGGATTC TTGACGCGCG CGATGAATTG ATCTCGGCCT GCGGCCGCAA GCCGGTGCTT
CTCAAAATCG CGCCCGACCT GACGCTCGAC GAACTCGACG CGATCATAGT TTGCGCCAGA
CGCCGCGCCA TCGACGGGCT GATCGTGTCC AACACGACGC TGTCGCGTCC CTCCGGCCTG
CGCGAGGCCG CTCTGGCGCG AGAGCAGGGC GGCCTGTCCG GTCGGCCGCT GTTCGATCTG
TCGACGCGGA TGCTGGCGGC GGCCTTCTTG CGCGCCGAGG GGGCGTTTCC GTTGGTTGGC
GCAGGAGGCG TCGACAGCGC CGAGCGCGCC TTCGCCAAGA TCGAAGCGGG CGCGAGCCTC
GTGCAGCTCT ATTCGGCGCT GGTTTTCAAG GGGCCGGGGC TTGCCGACGC CATCAAGCGC
GGCCTCGTCG CAACGCTGGA GCGCCGCGGC CTGCCCGCTA TTTCGGAGGC AATCGGGCGG
AGGGCGAAGG ATTTTGCCGC GGGAACGGCG GGGCCCGTTC CCTGA
 
Protein sequence
MAPALNRLVA WIGAAATPLL RQLDAETAHR LTIRALALYP ATGAAPDDPR LAVTAFGLHF 
PNPVGLAAGF DKNAEAVDAI LALGFGFAEV GTITPLPQPG NARPRLFRLT ADEAVINRFG
FNSEGAAAVR ARLAKRGVRR AGVLGVNVGA NKDSADRTAD YVRAIAQLAA PADYLTVNIS
SPNTPGLRDL QHAAALDDLL ARILDARDEL ISACGRKPVL LKIAPDLTLD ELDAIIVCAR
RRAIDGLIVS NTTLSRPSGL REAALAREQG GLSGRPLFDL STRMLAAAFL RAEGAFPLVG
AGGVDSAERA FAKIEAGASL VQLYSALVFK GPGLADAIKR GLVATLERRG LPAISEAIGR
RAKDFAAGTA GPVP