Gene Msil_3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3857 
Symbol 
ID7092553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4224857 
End bp4226125 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID643467142 
ProductDNA protecting protein DprA 
Protein accessionYP_002364101 
Protein GI217979954 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.113237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGG CGGCTGCTTT GACCCCGCAG GAGCGCTTCG CCTTTCTGCG GCTTTATCGC 
AGCGAAACGA TTGGCCCCCG CACTTTCGTG GCGTTGCTGG CCCGATACGG GTCGGCGCAG
GCGGCGCTTG AGGCCCTGCC GGGCGTCGTC GCCAGCGGCA AGGCGGGGCG GCCGATCGCC
CTGGCGCCCG TTCATGAGAT TGAGGCCGAG CTGGAGGCGA TCGAGCGCGC CGGGGCGACG
CTGATCTGTC TGCCCGAGCC CGACTATCCG GCGCTGCTGC GGCAGATCCA CTCGGCCCCG
CCGCTGCTTA CAATGCGCGG CGATCGCGCC TGCCTGAAAC GTTCGAAAAT TGCAATCGTC
GGCGCGCGCA ACGCCTCCGC CGCCGGCCTC GCCTTCACCG AACAGCTGTC GCGGGGAATC
GCGCGAGCGG GGTATGTCAT CGTCTCCGGG CTCGCGCGCG GGGTCGACGC GCGGGCGCAC
CAGGCCGCGC TGGCGACGGG AACGATCGCC GTCCTTGCCG GCGGACTCGG CAATATTTAT
CCGGCGGCGC ATGCCGAACT GGTGGAGCGC CTCATCGAGA CTGGCGCTGC GGTGAGCGAA
ATGCCGTTCG GATGGGAGGC GCGCGGGCGC GATTTTCCCC GCCGCAACCG CATCGTCTCG
GGGCTTTCGC GCGGCGTCGT CGTGGTCGAG GCCGCGCGCC GCTCGGGCTC GCTGATCACT
GCGGGCTTTG CCGCGGAGCA GGGGCGGGAG GTGTTCGCCG TGCCGGGATC GCCGCTCGAT
CCGCGCGCGG AAGGGCCGAA CCAGTTGTTG CGCGACGGCG CGACCCTCTG CACCGGGCCC
GAGGACGTGC TCGACGCGCT GGCCCGGCAG GATCTTTCGC CTCCTGCCGA TTTCAGCTTC
GCCGAGGCGC AGCCGCAATC CTACGAGTCG TTCTGGGACG AGCTCGATCT GCCGGATATT
TTCGCGGCGC AGGGCGGCGC GGCAAATGGC GCGGCGGAGC AAATCTCGTC GCGGCCCGCG
CCGGCTTCCT CGCGCCGCGC CGCTTTGCCG CCGCCGGCTG ATGAGCCGCC GCGCGAAGAC
GCGCCTGCGC CCTCCCGAGA AGCCGCCTTC GCCCGCGTCA TTGCGCTTTT AGGGCCCTCG
CCGGTTTCGG TCGACGAACT CGTTCGCGCC TCGGAAGCGC CGGCGCGGGA GGTGCGGGCG
ATCCTGTTCG AGCTGGAGCT TCAAGGCCGG CTGGAGCGCC ACGGCGCCGA TCTCGTGTCG
AAGATCTGA
 
Protein sequence
MTSAAALTPQ ERFAFLRLYR SETIGPRTFV ALLARYGSAQ AALEALPGVV ASGKAGRPIA 
LAPVHEIEAE LEAIERAGAT LICLPEPDYP ALLRQIHSAP PLLTMRGDRA CLKRSKIAIV
GARNASAAGL AFTEQLSRGI ARAGYVIVSG LARGVDARAH QAALATGTIA VLAGGLGNIY
PAAHAELVER LIETGAAVSE MPFGWEARGR DFPRRNRIVS GLSRGVVVVE AARRSGSLIT
AGFAAEQGRE VFAVPGSPLD PRAEGPNQLL RDGATLCTGP EDVLDALARQ DLSPPADFSF
AEAQPQSYES FWDELDLPDI FAAQGGAANG AAEQISSRPA PASSRRAALP PPADEPPRED
APAPSREAAF ARVIALLGPS PVSVDELVRA SEAPAREVRA ILFELELQGR LERHGADLVS
KI