Gene Msil_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3533 
Symbol 
ID7092390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3881217 
End bp3882779 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content66% 
IMG OID643466824 
ProductCHAD domain containing protein 
Protein accessionYP_002363784 
Protein GI217979637 
COG category[S] Function unknown 
COG ID[COG3025] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0421561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA TCGAACTGAA GCTCGCCGTT TCGGCCGATC CCGCGGCCCG CGCCCAAACG 
CTGGACGCCC TGAAGGACGC GATGGCCCGC GCCCGCGCGC AAACGCTCGA CTCCTGGTAT
TTCGACACCG CCGATGAAGC GCTGCGCGAT GGCGGATTCA CTCTGCGGGT CCGGCGAGCC
GGAAAACAGC TGCTGCAAAC GATCAAGCAG GACAGCGGAT CGGTTTCCCA GCGTGGCGAG
TGGGAGCGCC CCATCGACGG CAAGGCGCCG ACGCGCAGAC AAGGCGCGAG CGCCGCCTGC
GTCGATTTTG CGATGATTAG CGAGACGCCC CTTGCCGAAT TGGCCGAGGC GGCGCGTCCG
GATCTGCGCC AGGCGTTTCA CATCTCGGTC AAACGCGCCT TTCTCTCGCT CAGCGAAGAG
AATGCGCAGA TTGAGGCTGT CCTCGACAGC GGCGAGATCA CGGTCCCCGG ACCAACCGCC
GCCGAGGCTA TCTTCGAGGT CGAGCTCGAA CTGAAAAGCG GCGGCAAGAG CGCCGTCTAC
ACCTTGGCGC GCCGGCTGGC GGCGAGGGCC CCCTTGTCGA TCAGCCTGAT CAGCAAGGCC
GAGCGCGGCT ATCGCCTAGC GGCCGGAGCG TCGATGCGTC CGGCCAAGGG ATCGCAGCCG
CGCCTGGGCG ACGCGATGAC GGCCGGCGCC GCGTTCGAGG CTATCTGCAA TGTCTGCCTG
CACGACTTCA TGTTGAATGC GCGCCTCCTT ACGGCGCGGC CCGCGCCGGC CCATCCCGTC
GAGGCAATCC ATCAGGGCCG CGTCGCGCTG CGGCGCCTCA GGGCCGCGCT CGCTCTGTTC
CAGCCCATTG CTGGCGACGA ATATTTTGCT GCGGCGAACG ACGAATTGAA ACGCATGGCG
CGATTGTTTG GCGCTGCGCG AGATCGCGAC GTCATGCATG AAGCGGAAAT CAAGGCCGCC
CGCGGCGAGC TCACAGGCGA GGCGCGCGAA TTTGCCGCTT GGCGCGACTC CAAACGCCTG
GCCCTGCGCG CCGCCCTGAT CGAGGCGATC GAAGCAAAAC CCTGGCGGAT CTTTTTGATC
GATTTCTGCG AATGGCTGGG AAGCGGCGGC TGGCGGGCGA AAAAGGCCGA GCGCGCCGAC
ACGGCGAAAT TCATTCGCAA GCGGCTCGCC AAGCGGCGTA AGGCGCTCCT GCAACAGGGG
GAAAACCTCG AGGGCCTCGA TCCGGAGGCG CGCCACAAGG TGCGGATCGA CGCCAAGAAG
CTGCGCTATA TGGCCGAGTT CTTCATCGAC TGCCCGGAGG TCGCTGACAA AAAGAGCCTC
GGCGCGCTTT TGAAGCGCCT TGAGACGATC CAGTGGTCGC TCGGCGAGAT GCATGACGCC
GAAACGAGGC TGGATGCGGA CGAAGCGGAT CTTCGCCTTT GGCGTCAAGA AACCGGTCGA
GTTGAATCCG GCGAACTCGC TCTCGCCGAC GCGCCCCTCG CCGCGCCGGC CGAGGACGGC
CAGAAATGGC TCGGCGAGGC GCTGCGGGCC TTCGCCAAAC TCGCGAAGGA CGACCCGTTC
TGA
 
Protein sequence
MTEIELKLAV SADPAARAQT LDALKDAMAR ARAQTLDSWY FDTADEALRD GGFTLRVRRA 
GKQLLQTIKQ DSGSVSQRGE WERPIDGKAP TRRQGASAAC VDFAMISETP LAELAEAARP
DLRQAFHISV KRAFLSLSEE NAQIEAVLDS GEITVPGPTA AEAIFEVELE LKSGGKSAVY
TLARRLAARA PLSISLISKA ERGYRLAAGA SMRPAKGSQP RLGDAMTAGA AFEAICNVCL
HDFMLNARLL TARPAPAHPV EAIHQGRVAL RRLRAALALF QPIAGDEYFA AANDELKRMA
RLFGAARDRD VMHEAEIKAA RGELTGEARE FAAWRDSKRL ALRAALIEAI EAKPWRIFLI
DFCEWLGSGG WRAKKAERAD TAKFIRKRLA KRRKALLQQG ENLEGLDPEA RHKVRIDAKK
LRYMAEFFID CPEVADKKSL GALLKRLETI QWSLGEMHDA ETRLDADEAD LRLWRQETGR
VESGELALAD APLAAPAEDG QKWLGEALRA FAKLAKDDPF