Gene Mext_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4026 
Symbol 
ID5833628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4474001 
End bp4476268 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content66% 
IMG OID641369817 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001641467 
Protein GI163853424 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.865466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.196913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACG CTCGCCAGAG CCGCGCTGTG CTGCGGCCGA TGCTGCTTGG CTCCGTCGCC 
TGTACAAGCA TCGCTGCGGC TTTCATCCAT CCGGCGCGGG CGCAAGTCAC GGACATCAAC
ACCGGCCTGC CGACCGCTCA GGCGGTCCAT CCGCTCACCG CGTTTCCCTC GGTCGGCGGT
GTCACCCTCG ATATGATCAG CGTCGCCGGT TCCGGATCCG GGCGCGGCCT CGTCGTCGAC
AGCTCGGGTT CGCAGGTCGG CTACCTCGCC CGACGCCTGC GCTCCTCGAC CAAGACCGAC
ACGCCGCTGG TCGACACGCC GCAGGCGATC TCGGTGGTGA CGGAGGCGCA GATCCGCGAC
CAGAACGTGC AGAGCATCGG CGAGGCGCTC CGCTACGTCC CCGGCGTCGC CATCGCCCAG
GGCGAGGGCC ACCGCGACGA GATCCTGATC CGCGGCCAGC GCACGACGGC CGACTTCTTC
GTCAACGGCA TCCGCGACGA CGCGCGCTAC TTCCGCGACC TCTACAACAC CCAGCGCATC
GAGGTCCTGA AGGGCCCCAA CGCGATGATC TTCGGCCGCG GCGGCGGCGG CGGCGTGGTC
AACCGCGTGC TCAAGGAAGC CGACGGGGTG CCGGTCCGCG AGGTGCTGGT GCAGGGCGGC
CAGTTCGGGA ATAAACGCAT GGCGGTGGAT CTCGGCGACC GCGTCTCCGA CAGCGCCTTC
TTCCGCCTCA ACGGCGTGTT CGAGGATACC GGCACCTATC GCGACTTCAT CGACATCCGC
CGCTACGGCG TGAACCCGAC GATGACCTTC CTGCTCGGGC CGCAGACGAC GCTGCGCCTG
TCCTACGAGT TCTTCTCCGA CAACCGCATC GCCGACCGCG GCATCCCCTC GCAGTTCGGG
CGCCCCTGGC GCTATCGCGA GAACACCAGC ACGCTGTTCG GCGCGCCCTT GATCTCGAAC
GGCTTCGTCG ATGCCCATAT CGGCAACGCG CAGTTGGACC ACGTCTTCGA GAGCGGCGTC
GTGATGCGCA GCCAGACGCG GATCGCCGAC TACGCGAAGT ACCATCAGAA CGCCTACCCC
AACAGTCCGG TCAGTGCCGA CGAGACCTCG TTCGTGATGC GCGGCTACGG CAGCCAGACC
GACCGCACGA ACACCTTCAA CCAGACCGAC TTCACCTACA AATTCAACAC CGGCCCCCTC
GCCCACACCG TGGTGGCCGG CCTGGAACTC GGCTTTCAGG AGGGGATCGA CTTCCGGCGC
GACTTCATCT GGAATTCGAC CGGCACCCGC AACCTCCCCG TCAATCCCTT CGCGCCGACC
ACGACCGAGG GGGCAACGCT GCGGAATCTC GCCTCGGGCC GCAACAACAC CTATCGGCTC
GGCGTGTTCT CCGCCTTCGC CCAGGACCAG ATCGAGATCG ACGAGCATCT GCAGTTCATC
GTCGGCGCCC GCTTCGACCG CTTCGATTTC CAATCGCGCG ACCGTCGCCC CGACGCCGCG
ACCGGCCTCC CTGCCCAACC GAATAGCCGC ATCGACAATC TTGTCTCGCC GCGGGTCGGC
GTTGTGGTGA AACCGCTGCC GAATCTGGCC TTCTACGGCA GCTATTCCGT GTCCTTCCTG
CCGAGCGCGG GTGATCAGTT CCGCGTGCTG GACCCGATGA CCGCTTTGTC CGCACCCGAG
CGGTTCGAGA ACGCGGAAAT CGGCGTGAAA TACGAGATCA CGCCGGCCCT GATCCTCACC
GCTGCGCTGT TCAACCTCGA TCGCGACAAC CAGCCGATCC CGTCATCCAC CGAAGCGGGT
TTCTCCGCCG GCCCGGGCAA GACGAACACA AGGGGTGCCG AGATCGGCAT TGCCGGCTAT
GCGACCGATT GGTGGCAGAT CTCCGGCGGC TACGCCTATA CCGAGCCGCG CATCGTCGCC
GACATCGATG ACGACGGCGA CGTCATCCGG GCCGGCAACC TCGTCGGCGG CGTTCCGCTC
AACACCTTCA GCCTGTGGAA CAAATTCGAC ATCGGCGAGC GATTCTCCGT CGGCGTCGGC
TACCTCTACC AGGATGCCAG CTTCGCATCG TCGGACAACG CGGTGCGCCT GCCGAGCTTC
TCGCGCTTCG ATGCCGGCGT GTTCTACGCG TTCAGCGAGA CTATGCGCGC GCAGGTCAAC
ATCGAGAATC TGTTCGACCG GCGCTACGTC ATCTCCGCGC ACAACAACAA CAACATCCTG
CCCGGCGCAC CCCGGACCGT TCGCTTCCAG CTCATCGCAC GGTTCTGA
 
Protein sequence
MRNARQSRAV LRPMLLGSVA CTSIAAAFIH PARAQVTDIN TGLPTAQAVH PLTAFPSVGG 
VTLDMISVAG SGSGRGLVVD SSGSQVGYLA RRLRSSTKTD TPLVDTPQAI SVVTEAQIRD
QNVQSIGEAL RYVPGVAIAQ GEGHRDEILI RGQRTTADFF VNGIRDDARY FRDLYNTQRI
EVLKGPNAMI FGRGGGGGVV NRVLKEADGV PVREVLVQGG QFGNKRMAVD LGDRVSDSAF
FRLNGVFEDT GTYRDFIDIR RYGVNPTMTF LLGPQTTLRL SYEFFSDNRI ADRGIPSQFG
RPWRYRENTS TLFGAPLISN GFVDAHIGNA QLDHVFESGV VMRSQTRIAD YAKYHQNAYP
NSPVSADETS FVMRGYGSQT DRTNTFNQTD FTYKFNTGPL AHTVVAGLEL GFQEGIDFRR
DFIWNSTGTR NLPVNPFAPT TTEGATLRNL ASGRNNTYRL GVFSAFAQDQ IEIDEHLQFI
VGARFDRFDF QSRDRRPDAA TGLPAQPNSR IDNLVSPRVG VVVKPLPNLA FYGSYSVSFL
PSAGDQFRVL DPMTALSAPE RFENAEIGVK YEITPALILT AALFNLDRDN QPIPSSTEAG
FSAGPGKTNT RGAEIGIAGY ATDWWQISGG YAYTEPRIVA DIDDDGDVIR AGNLVGGVPL
NTFSLWNKFD IGERFSVGVG YLYQDASFAS SDNAVRLPSF SRFDAGVFYA FSETMRAQVN
IENLFDRRYV ISAHNNNNIL PGAPRTVRFQ LIARF