Gene Mext_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2122 
Symbol 
ID5831864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2380090 
End bp2382270 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content68% 
IMG OID641367919 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001639588 
Protein GI163851545 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.892393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0378186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG CAGCGCAGAG ACGACGTCCC GTCACACCGG GATCGGCCGC GCTCATCACC 
GGCTCACGCG CTCTGCTCGG CCTCGTCCTG TTCTGCCCGC CGAGCGTCCA GGCGCAAACC
GTCCCCCCGG AGGGCGCCGA GGCTGTTTTG TCCGAACTCT CGGTCACCGG CACCGGCGAG
CGAGCGGGCG GTCCCGTCGT CGGATACCGC GCCACGCGCT CGGCCACGGC GACGCGGACG
GACACGGCTT TGCGCGACAC GCCGCAATCG ATCCAGGTCG TCCCGCGCGA GGTTCTGGTC
GATCAGCAGA ATGTCCGCCT GACCGACGCG CTCACCAATG TCAGCAACGT CCAGCCGGGG
GGCACCATCC AGGGCCGCTC CGACACCTAC ATCCTGCGCG GCTTCCGCAC CCAGACCTAC
GCCATCGACG GGCTGGTGCT GAACCCGGCC AACGCCTTTC AGCCGACGCA GCGCGACCTC
GCCAATGTCG AGCGGATCGA GGTGCTCAAA GGCCCGGCCT CCGTGCTCTA CGGCCAGGGC
GATCCCGGCG GCCTGATCAA CATCGTCACC CGCCAGCCGA CCCTTACGCC ATCCGCCGAC
CTGACGGTCC AGGGCGGCTC GTTCGGCTTC CGGCGGGTGC AGGGCTCGGT CTCGGGCGCG
ATCCCGAGCG TGGAGGGGCT TGCCGCCCGC TTCAGCTTCG GCACGCAGAA CGAGGCGACC
TTCCGCGATT TCGGCGGCCC GGAAAACTCC CGGCACTTCT TCGCGCCGGC CTTCGTCTGG
ACGCCCGATG CCTCGACGCG GGTCTACCTC AACGCCGAGT TCACCCGTCA GCACAGCCAG
TACGACGAGG GCCTGATCGC CTTTGGCGGC CGGGTGCCGC TCGACAACAT CAGCCGCTTC
TACGGCGAGC CGTGGTCGCG CTACTATGGC GAATCGAACT CGATCACCCT GCTGGCCGAG
CACGACGTCA ACGAGAACCT GACCCTGCGG CAGGCGATCA ACGGCCAGTG GGGATCGTTC
AATCTCCTGG CGACCCGGGC GACAGGCGTG AATGCCGCCG GCACCACGGT AACGCGGCGC
CTCACCGAGG GCGATTCGAT CTATCACTCG ATCGATAGCC GGACCGAGGC GCTCGGGCGC
TTCATCGATC CGCTCGGCTT CCGCCACACC GCCCTGGCCG GCTTCGAGAT CGTGGACGGC
TTCCGCCATC CGTTCACCAC GCAGGGGACC GCGACCTCGG TCTCCTTCCT CAACCCGATC
CGGGGCTCGG TGCCGCAGAT CGGCACGCTC ACCCTGCAGA GCGACCTGCG GCAGAAGCTC
AGCCTGTTCG GCCTCTACAT GCAGGACCAG ATCGAGTTCT TTCCCGGCCT GCAGCTCGTG
CTCGGCGTGC GCTTCGATAC GGCCGACCAG CTCTATTTCC AGCGCACGCC GACCACGCGG
ACGATCCCGC CGGAGCAGAA CCTCACCGGG GTCTCGCCGC GGGTCGGCCT CGTCTGGCGG
CCGCTGGAGC CGCTCACGCT CTATGGCAGC TACACCACCT CCTTCGTGCC GCAGACCGCC
AACGTCCTCA ACGTCGCGAG CCCACCTCCG GAGACCGGCG AGCAGGTCGA GGTCGGCGCC
CGCTTCGACC TGATCCCCGA CCGGCTCACC GTGAGTGCGG CCGCCTTCCG CATCCTGCGG
ACGAACGTCG CCGCCTCCGA TCCGGTCAAT ACCGGCTTCT CGATCATCAC CGGCGAACAG
CGCTCGCAGG GGTTCGAGGG CGACATCGCC GGCGAGATCC TGCCGGGCTG GAAGGTCATC
GGCGGCATCG GCTATCTCGA TGCGGAGGTC ACGAAGGATG CGACCGTCGC CATCGGCAAC
CGCCTGCCCG CGGCACCGGT CTTCAGCGCC AGCGTCTGGT CGACCTACCA ATTCCAGGGC
GGCCCGCTGC ACGGCTGGGG CTTCGGCGGC GGCCTCACCT ATGTCGGCGA GCGCTTCGGC
GACATCACCA ACACCTACAA GGTCGGCGCC TATGCCCGCC TCGACGCAAC GGTGTTCTAC
GAGATCGACC CGACCTGGCG CTTCGCCGTG AACGGCCGCA ACCTGACCGA CCGGCGCTAC
ATCGAGCAGC CGTTCAACCA ATTCAACAAC CAGCCCGGCG CGCCCCTGAC CGTGCTCGCC
AGCCTGACGG CGCGCTACTG A
 
Protein sequence
MSNAAQRRRP VTPGSAALIT GSRALLGLVL FCPPSVQAQT VPPEGAEAVL SELSVTGTGE 
RAGGPVVGYR ATRSATATRT DTALRDTPQS IQVVPREVLV DQQNVRLTDA LTNVSNVQPG
GTIQGRSDTY ILRGFRTQTY AIDGLVLNPA NAFQPTQRDL ANVERIEVLK GPASVLYGQG
DPGGLINIVT RQPTLTPSAD LTVQGGSFGF RRVQGSVSGA IPSVEGLAAR FSFGTQNEAT
FRDFGGPENS RHFFAPAFVW TPDASTRVYL NAEFTRQHSQ YDEGLIAFGG RVPLDNISRF
YGEPWSRYYG ESNSITLLAE HDVNENLTLR QAINGQWGSF NLLATRATGV NAAGTTVTRR
LTEGDSIYHS IDSRTEALGR FIDPLGFRHT ALAGFEIVDG FRHPFTTQGT ATSVSFLNPI
RGSVPQIGTL TLQSDLRQKL SLFGLYMQDQ IEFFPGLQLV LGVRFDTADQ LYFQRTPTTR
TIPPEQNLTG VSPRVGLVWR PLEPLTLYGS YTTSFVPQTA NVLNVASPPP ETGEQVEVGA
RFDLIPDRLT VSAAAFRILR TNVAASDPVN TGFSIITGEQ RSQGFEGDIA GEILPGWKVI
GGIGYLDAEV TKDATVAIGN RLPAAPVFSA SVWSTYQFQG GPLHGWGFGG GLTYVGERFG
DITNTYKVGA YARLDATVFY EIDPTWRFAV NGRNLTDRRY IEQPFNQFNN QPGAPLTVLA
SLTARY