Gene Msil_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2203 
Symbol 
ID7093424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2381281 
End bp2382495 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content63% 
IMG OID643465523 
Productintegrase family protein 
Protein accessionYP_002362499 
Protein GI217978352 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTAATA AGATCAACAG GTTATCCGCG CGCACCGTCG CGGCCCTAAC CAAACCCGGC 
CGTCATGCCG ATGGCGGAAA TCTCTATCTA AGAATCGAAC GCAGCGGCTC GAAACGCTGG
ACTTTCATGT ATGTCCAGGG CGGCCGGCAA AGGGAAGCCG GGCTCGGCTC GGTCGCCAGG
ATGCCGCTCG CAAAGGCGCG GGTTAAAGCC GGAGAGTTGC GCCAAATGCT CGCCGACGGG
ATTGATCCGC TCGCGGCCAA GCAGGCCGAG CGGGAGGCCC GGCAAGCAAT CGTCGAAGCG
GAACAAGCTC GGCGCACATT CGGCCAGGTC GCCGACAGCC TCCTCGCTGC CAAAGAGGCC
GGCTGGCGCA ACGCCAAACA TCGCGCGCAA TGGCGCATGA CCCTCGAAAC CTATGCGGCC
TCCCTTTGGA ATATGCCCGT CGAGGAGGTC GATACGCAGG CCGTTCTCGC CGCCCTGCAA
CCCGTATGGC AAGCAAAGCC TGAGACCGCA TCGCGGCTGC GCGGCCGCAT CGAGGCCGTG
CTCGACGCCG CGCGCGTGGC GGGCCATTCG GGAGCCGATC GGCCGAACCC GGCCCGATGG
AAAGGCCACC TCGACAAGCT GCTCCCCGCC CCCAAGAAGC TTTACCGCGG CCATCACGCC
GCAATGCCTT ACGGTGAGCT GCCCGAGTTC CTTGCGCGCC TTCGAAAGCG CCCCGCTGTC
GCTGCACTGG CGCTCGAATT TTTGATCCTG ACGGCCGCGC GCTCAAGCGA AGTTCTCAGC
GCGGAGTGGA GCGAGGTCGA CCTTGCAGCG AAGGTTTGGG TGATCTCGGC GCGACGCATG
AAAGGCGGCC GGGAGCATCG CGTGCCGCTT TCTAGCAGGG CGTTGGAGAT CCTCGAAAAC
CTCGCCAAGA CAAAAACGGG CGCCTTCATT TTTTCCGGCC AAGATTTCAG GCGTTCGTTA
TCATCCCATG CGTTTGTCAT GTTGCTGCGC CGCATGAAGG CCGATCATGT GACTGCGCAC
GGTTTTAGAA GCTCTTTTCG CGATTGGGCC GGCGACGCGA CAAGTTTTCC GCGGGAGATC
GCCGAAGCGG CGTTGGCGCA TGTAGCCGGC GATGCGACAG AGCTCGCCTA CCGTCGCGGC
GATGCGCTTG AGAGGCGGCG CCCGCTCATG GAGGATTGGG CTGCTTTTTG CCTAGGCCAT
AAACGCACTC AGTGA
 
Protein sequence
MVNKINRLSA RTVAALTKPG RHADGGNLYL RIERSGSKRW TFMYVQGGRQ REAGLGSVAR 
MPLAKARVKA GELRQMLADG IDPLAAKQAE REARQAIVEA EQARRTFGQV ADSLLAAKEA
GWRNAKHRAQ WRMTLETYAA SLWNMPVEEV DTQAVLAALQ PVWQAKPETA SRLRGRIEAV
LDAARVAGHS GADRPNPARW KGHLDKLLPA PKKLYRGHHA AMPYGELPEF LARLRKRPAV
AALALEFLIL TAARSSEVLS AEWSEVDLAA KVWVISARRM KGGREHRVPL SSRALEILEN
LAKTKTGAFI FSGQDFRRSL SSHAFVMLLR RMKADHVTAH GFRSSFRDWA GDATSFPREI
AEAALAHVAG DATELAYRRG DALERRRPLM EDWAAFCLGH KRTQ