Gene Mnod_5109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5109 
Symbol 
ID7302286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5167893 
End bp5169155 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID643602739 
Productintegrase family protein 
Protein accessionYP_002500258 
Protein GI220924956 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGGGC GCCGATGGGG CGAATGGTGG GGTGCGGTGG GGCGGGAGGT CAACAAGCTC 
TCGGCGCGGC GTGTCCAGAC GCTGACGGAG CCGGGCCGAC ACGCTGATGG CGGCGGCCTC
TATCTTGTCG TGGATCCGGC CGGCGGCAAG CGCTGGGTCT TTCTCTATCG GATGGCCGGC
AGGCGCCGCG AGATGGGCCT CGGGCCCGTC CTGTCCGTGC CGCTCGCCCG CGCCCGCGAG
CTCGCCGCTG AAGCCAGGGC TCAGGTCGCT GGTGGGGTCG ATCCGATTGA CGCCAGGCGC
GCCCCACCAA GCGATGAGCC GCCACCCCAC CGGATCACGT TCGCCGAGGT CGCCGAGGTC
TACATGACCG ATCGCGAGCG GGCGTGGCGC AACGCCGCGC ATCGCCGGCA GTGGCGGCAG
ACCCTTGAGG TGCAGGCGGC ATCGCTCTGG GCAATGCCGG TCGCGGATGT CAACACGGAA
GCCGTGCTGG CGGTGCTGCG ACCGATCTGG CACAGCAAGG CCGAGACGGC GCGCCGGCTG
CGCGGCCGCA TCGAGCGCAT CCTGGATGCC GCCCGGGTGG CAGGACACCG CGGGCCCGAA
AACCCCGCCC GGTGGAAGGG GCATCTCGAC GTCCTGTTGC CGCGAGCCGG TAAGCTCCAG
CGCGGGCATC ACAACGCGCT CCCATATATG GAAGTGCCGG CCTTTGTCGC CGAGATCCGC
CAGCGCGAGG CGCAGACCGC GCGTGCGCTT GAGCTGCTGA TCCTCACGGC GGCCCGGTCC
GGCGAGGTCC GCGGCATGAC CTGGGCCGAG GTGGATCTGG TGGGCGCGCT CTGGACTGTG
CCGAAGGAAC GGATGAAGGC GAAGCGGCCG CATCGCGTGC CGCTCTGTGC CCGCGCTGTT
GAGATCCTGT CCAAACTACA CTTGGAGGCC CCGGACGCGG AGGGTCTGAT TTTCCCAAGC
CGCAATGACA CGGTGCTGTC CGATATGGTG TTCGCGGCCC TGCTGCGCCG GGCGAAATAT
CCAGACATCA CCGCCCATGG CTTCCGGTCG TCATTTCGAG ATTGGGCCGC GGATGAGACT
GACCATCCGC GCGAGGTGAT CGAAGCGGCC CTGGCGCACA TGGTGGGAGA CGCCACCGAA
CGGGCCTATC GGCGGGGCGA CGCGCTGGCG AAGCGCCGCC TGCTGATGGA TGATTGGGGC
GCCTATGTCT GCGGCGGAGC ATCCGCGGCC GAGCCGCCAA TGAGCGCGAC GAGATCGCCG
TGA
 
Protein sequence
MVGRRWGEWW GAVGREVNKL SARRVQTLTE PGRHADGGGL YLVVDPAGGK RWVFLYRMAG 
RRREMGLGPV LSVPLARARE LAAEARAQVA GGVDPIDARR APPSDEPPPH RITFAEVAEV
YMTDRERAWR NAAHRRQWRQ TLEVQAASLW AMPVADVNTE AVLAVLRPIW HSKAETARRL
RGRIERILDA ARVAGHRGPE NPARWKGHLD VLLPRAGKLQ RGHHNALPYM EVPAFVAEIR
QREAQTARAL ELLILTAARS GEVRGMTWAE VDLVGALWTV PKERMKAKRP HRVPLCARAV
EILSKLHLEA PDAEGLIFPS RNDTVLSDMV FAALLRRAKY PDITAHGFRS SFRDWAADET
DHPREVIEAA LAHMVGDATE RAYRRGDALA KRRLLMDDWG AYVCGGASAA EPPMSATRSP