Gene Mext_4889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4889 
Symbol 
ID5832508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5468358 
End bp5469683 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content58% 
IMG OID641370687 
Productintegrase family protein 
Protein accessionYP_001642328 
Protein GI163854285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.467938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.240701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCACTCA GAATCGCGCG ACCGTGGAAG AACCCTAAGA CGGGCGTTTT CCATCTACGG 
CAGCGAACGC CCCTCGATCT CGTCGGACGC TTGCAGGGAC AGTCCGTTAG CCTTCCTATT
GGCGATACGT GGGCGTCCAT CACCGTCGGT TCGATCGTCC AAGCATCATT GCGGACCAAA
GAACCGCGTG AGGCCCGTCA CCGTCATTCG ATCGCGGACG GCGCCCTTAA GCGATTCTGG
CAGGAGCAAC GCGCCGGTCC CTCTCTGAAC TTCGGTAACA GCGACGCTGA CGGGAGGACG
GGCGAGCATG TTCGGGCGAC TCTGCTCCAA CCGATGGCCC TCACTGCCGA ACGCGAATTG
ATCCAGGTGG CGCCTGTCTC GCAGCCGGCG CTTCTAACCG TGGACGATCT GTTCGAACGG
TGGGCAGCCT ACAATGCCGA CAAGAGGGCA CGGAACACCA TCAAACGGTA TCGTGCCAGC
TTCCGCTCCC TAGCCGCCTT TGCTCGCCGA CGTGACGCTC GAAGCCTTAA TGCCGACGAT
CTTTTCGCTT GGGCCGAGCG GCGCCGTGAT TTCGAGGGCG TCTCACCGCG TGCAATCAAC
AAAAACGATT TGGTTGCGGT CAGTTCAGTC CTTCAATGGG CGACGGGCCG ATCAGGTGGA
CGAATCCTGC CTGACAATCC AGCCAGGGGC CTATCTTTGG ATGAACCACG CGTCGTCGCA
CAACGTGAGA GAACGTTCCG AGAGCACGAG ATTACCGCCA TCCTAAGAGC GGCTTCTGCG
GTGGTGCAGG AACATGACAA CATCACCCGC TCGGCTGCGC GACGGTGGTG TCCGTGGTTG
GCGGCCTACT CAGGAGCACG GATCGCTGAA TTAACTAGTC TTGTCAGGCA GGACGTCCGC
ATCGACGGTG GCGCTCCTGT GATGGACATA CGCATCACGA AGACCGGCGA ACCGCGCACA
GTCCCGCTAC ATAACCACCT GATCGAACAG GGGTTCTTGA GATTTGTGGA AGCGTCAGAG
GTCGGACCGC TCTTTTTCGA CTTCAAGCGA CACAAGGCGA ATGCAGAAAC TTCTCCCGCA
GAGCAACAGG CTAAGGCGGT CGCGAAGTGG GTTCGAGCCA CTGTGAAGCT TGACCCTGGC
GTCGATCCCA ATCACGGCTG GCGACACACA TTCAAGACAA GGGCACTCGG AGCAGGGATA
GAAGAACGCT TACGTGACGC TATCACCGGT CATCGAGTCG CATCAGTCGG GCGAAGATAT
GAGACGCCTT CACTTTCAAT GCTTAGTGAT GCTATGAATA GATTTCCAGC ATATAATATT
CATTGA
 
Protein sequence
MSLRIARPWK NPKTGVFHLR QRTPLDLVGR LQGQSVSLPI GDTWASITVG SIVQASLRTK 
EPREARHRHS IADGALKRFW QEQRAGPSLN FGNSDADGRT GEHVRATLLQ PMALTAEREL
IQVAPVSQPA LLTVDDLFER WAAYNADKRA RNTIKRYRAS FRSLAAFARR RDARSLNADD
LFAWAERRRD FEGVSPRAIN KNDLVAVSSV LQWATGRSGG RILPDNPARG LSLDEPRVVA
QRERTFREHE ITAILRAASA VVQEHDNITR SAARRWCPWL AAYSGARIAE LTSLVRQDVR
IDGGAPVMDI RITKTGEPRT VPLHNHLIEQ GFLRFVEASE VGPLFFDFKR HKANAETSPA
EQQAKAVAKW VRATVKLDPG VDPNHGWRHT FKTRALGAGI EERLRDAITG HRVASVGRRY
ETPSLSMLSD AMNRFPAYNI H