Gene Msil_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3098 
Symbol 
ID7092776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3403128 
End bp3404414 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID643466408 
Productintegrase family protein 
Protein accessionYP_002363369 
Protein GI217979222 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA ACGTTCGCGA CAACGCCCTC GGGTCCCGTG CTTCGCGTGA AAAACTGAAG 
GCTTCCGGCA AGCCCTATTA TCGCTCGCTC GACGCCGGGC TTCTGCACCT CGGCTATCGG
AAAGGGCGGC ACGGCGGCAA ATGGGTTATG CGGCGCTATC TCGGAAACGA GAAGTACGAG
GTGGAGACCA TCGCCGTTGC GGACGATCGC GACGACGCCG ATGGAAAAGA CATCCTCACC
TTCAATGAGG CCCAGGCCAA AGCGAGAGAG ATCGCCAAAG CCCGGCGTCA GGAAGCCAGC
GGCGCGGCGC CGATCACGAT CTCGACGGTC CTCGACGCCT ATCTCAAGCA AGCCGAGGCT
CAGCATTCGA AATCGGTTTC CGACTCTCGC AATCGGATCG AAAACCATAT CCGCCCGGCT
TTCGGCGCCA TGCTGGCATC CGATCTGACA CAGGAAGCGA TCCAGAAATG GCTGAAGGCC
CTCGCCGACA GTCCGCGCAA TGTCCGCGGC AAGGCCGGAA CAGTGTCGAG AGCACTGGCC
AAGCCGAAGA CCGATGATGA AAAGCGTCGA CGCCGCGCCA GCGCCAATCG GACGCTGACG
ATCCTGCGAG CCGCGCTCAA TCAAGGTTTC CGTTCGGGCA AGATCACTTC GGACACCGTA
TGGCGGACCA TCCAGCCTTT CCGCGAGGTC GATGCGCCGA GGGTGCGCTA TTTCACCCAG
GATGAGGTCC GGAGGCTTGT TAATGCGGCT CAGGGCGAGT TTCGATCGCT GGTCAATGCC
GCGCTGTTCA CCGGCTGCCG ATACGGCGAG CTATGTCGCC TGCAGGTCGG CGATTTCAAT
CCAGACGCCG GGACCGTCTT CGTCGGGCAG AGTAAATCGG GCAAGGCGCG GCACGTCGTC
TTGACCGAGG AAGGACAAGG TTTCTTCCGT CAGCTAACTG CCGGCCGGCC GACCAACGCT
TTGATGCTTT CGAGGGCCGA TGGCGCTCCA TGGGGCGCGT CGCATCAGAT CCGGCCGATG
GCTGAGGCCT GCAAGGCCGC CAAGATCGCC AAGGCGGGCT TTCATATCCT CCGCCACACC
GCGGCGAGTC ACAATGTCAT GGGCGGCGTG CCGATGCCGG TCGTGGCGAA GAACTTAGGT
CACGCTGATT CGCGGATGAC GGAGAAGCAT TACGCGCACC TCGCGCCGTC CTATGTCGCC
GATCAGATCC GGCAATTTGC GCCGACGTTC GGAACGGTGC AGCAGACGAA CGTGGCGTTA
CTTCATAAAT CGACCAAGGC GAACTGA
 
Protein sequence
MAKNVRDNAL GSRASREKLK ASGKPYYRSL DAGLLHLGYR KGRHGGKWVM RRYLGNEKYE 
VETIAVADDR DDADGKDILT FNEAQAKARE IAKARRQEAS GAAPITISTV LDAYLKQAEA
QHSKSVSDSR NRIENHIRPA FGAMLASDLT QEAIQKWLKA LADSPRNVRG KAGTVSRALA
KPKTDDEKRR RRASANRTLT ILRAALNQGF RSGKITSDTV WRTIQPFREV DAPRVRYFTQ
DEVRRLVNAA QGEFRSLVNA ALFTGCRYGE LCRLQVGDFN PDAGTVFVGQ SKSGKARHVV
LTEEGQGFFR QLTAGRPTNA LMLSRADGAP WGASHQIRPM AEACKAAKIA KAGFHILRHT
AASHNVMGGV PMPVVAKNLG HADSRMTEKH YAHLAPSYVA DQIRQFAPTF GTVQQTNVAL
LHKSTKAN