Gene Mext_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2005 
Symbol 
ID5832845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2234813 
End bp2236987 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content65% 
IMG OID641367806 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001639475 
Protein GI163851432 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.22147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.931942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACGTC GCGGAACGAA CGAAGTCGCG TTTGCGCTTC TGGCAGGCAC GGCCGTCGCG 
GTTGCCCTCC CTGCACGGGC ACAACCCCTT GTTCCGGTCG AGGCGTCTCC CGCCGCGGTC
GCGCTGGACG AACTGGCCGT CGAAGGCCTC GGCCGCGGCG CGCTGCGGCT TGAACCCCAG
GGCGGTGTCA CGGTCGGCTA TCTCGGCAAG GCGACGCGGA GCGCCACAAA GACCCCGACG
CCCCTGCTCG ATACGCCGCA ATCCGTCTCG GTCATCACGC GCGAGCAGAT CCTGGACCAG
GGCTTCCAGT CGATCGGCGA GGCGACGCGC TACGTGCCGG GCGTGATCCA GGCCCAGGGC
GAGGGCAATC GCGACGAGCT GATCATCCGC GGCCAGCGCT CGAACGCCGA CTTCTTCGTC
AACGGCATCC GCGATGACGT GCAGTATTAC CGCGACCTCT ACAACATCCA GCGCATCGAA
GTCCTGAAGG GGCCCAATGC GATGATCTTC GGCCGCGGCG GCGGTGGCGG CGTCATCAAC
CGCGTGCTCA AGGAAGCCGA CGGCGTCCCG ACCCGCGAGA TCGTTGCCCA GGGCGGCCAG
TTCGCGAACA AGCGCGTGGC GCTCGATGTC GGCGACCGCG TCTCCGACAG CGTGTTCTTC
CGCATGAACG GCGTGTTCGA GGATACGGCG ACCTACCGCG ACTTCGTCGA TATCCGCCGC
TACGGCGTGA ACCCGACGAT GACCTTCCTG CTCGGGCCGC AGACGACGCT GCGCCTGTCC
TACGAGTACT TCCACGACGA CCGCACCACC GATCGCGGCA TCCCCTCGCA GTTCGGCCGG
CCCTACCGGT ACCGCGACAA CCGGACGACC TATTTCGGCA ACCCGTTCCT GTCGCCGACC
TACGTCAACG CCCACATCGC CACGGCGCAG CTCGACCACG TCTTCGAGAA CGGCGTCGTG
ATGCGCAGTC AGTCACGCAT CGCCGATTAC GAAAAATTCT ATCAGAACGT CTTCCCCGGC
GGGGCGGTGA ACGCGGCCGG CACGGCCGTC AACATCTCGG CCTATAACAG CCAGACCGAC
CGGACGAACT ACTTCAACCA GACCGACTTC ACCTACCAGT TCCTCACGGG ACCGGTGAAG
CACACCCTGC TCGGCGGGTT CGAACTCGGC TACCAGGAAG GTCTGAGCGT CCGTGAAGAC
GGCTTCTTCG CGACCACCGG CACCCAGACC CTCGTCGTCA ACCCGCTCGC GCCGCTGACG
CGCGTCGGCG TCAACTTCCG CAACATCGCC AGCGGGGCCA ACAGCACCTA CGATCTCGGC
CTCGCCGCAG CCTACGTGCA GGATCAGGTC GAACTGAACG ACTACGTGCA GCTCATCGGC
GGCCTGCGCT TCGACCATTT CGACTTCGCG GCGACCGACC GGCGCACCAA CATCACCAAT
GCCCGCGTGG ACGACCTGAT CTCGCCCCGC GCTGGCCTCG TCGTGAAGCC GCTGCCGAAC
CTCGCCTTCT ACACGAGCTA CAGCATCTCC TACCTGCCCT CGTCCGGCGA TCAGTTCAGC
GCATTGACGC CGGGCCTCGT CATCGCTCAG CCAGAGAGAT TCGAGAACAC GGAAGTGGGC
GTGAAGTACG ACGTCTCGCC CGTGCTTCAG CTCACCGGCG CGCTGTTCAA CCTCGACCGC
ACCAACCAGC GCATCGCCGA TCCGAACCGG CCCGGCTTCT TCCTGACCTC GGGCCAGACC
AACACGCAGG GTGCGGAGAT CGGCGCCAAC GGCTACGTCA CCGATTGGTG GTCGATCGCG
GGCGGCTACG CCTTCACCGA TGCGCGCATC GCGAACCGGC TCTCCGATAC GATCGTGGCC
GGCAACTTCG TCGGCCTCGT GCCGCTCAAT TCCTTCACAC TGTGGAACAA GTTCGACATC
GATCCGAGCT TCTCGGTCGG CGTCGGCTTC ATCAACCAGT CGCACTCCTT CGCGACTTCG
GACAACACCG TCCGGCTTCC GAGCTACTCG CGCTTCGATC TGGGCCTGTT CTACCGGATC
AGCGAGAACG CACGCGCGCA GGTGAACATC GAGAACCTGT TCGACCGCAA CTACATCGTC
TCGGCGCACA ACAACAACAA CATCCTGCCC GGCGCACCCC GTACGGTCCG GGCACAGATC
ATCGTGCGCT GGTAG
 
Protein sequence
MRRRGTNEVA FALLAGTAVA VALPARAQPL VPVEASPAAV ALDELAVEGL GRGALRLEPQ 
GGVTVGYLGK ATRSATKTPT PLLDTPQSVS VITREQILDQ GFQSIGEATR YVPGVIQAQG
EGNRDELIIR GQRSNADFFV NGIRDDVQYY RDLYNIQRIE VLKGPNAMIF GRGGGGGVIN
RVLKEADGVP TREIVAQGGQ FANKRVALDV GDRVSDSVFF RMNGVFEDTA TYRDFVDIRR
YGVNPTMTFL LGPQTTLRLS YEYFHDDRTT DRGIPSQFGR PYRYRDNRTT YFGNPFLSPT
YVNAHIATAQ LDHVFENGVV MRSQSRIADY EKFYQNVFPG GAVNAAGTAV NISAYNSQTD
RTNYFNQTDF TYQFLTGPVK HTLLGGFELG YQEGLSVRED GFFATTGTQT LVVNPLAPLT
RVGVNFRNIA SGANSTYDLG LAAAYVQDQV ELNDYVQLIG GLRFDHFDFA ATDRRTNITN
ARVDDLISPR AGLVVKPLPN LAFYTSYSIS YLPSSGDQFS ALTPGLVIAQ PERFENTEVG
VKYDVSPVLQ LTGALFNLDR TNQRIADPNR PGFFLTSGQT NTQGAEIGAN GYVTDWWSIA
GGYAFTDARI ANRLSDTIVA GNFVGLVPLN SFTLWNKFDI DPSFSVGVGF INQSHSFATS
DNTVRLPSYS RFDLGLFYRI SENARAQVNI ENLFDRNYIV SAHNNNNILP GAPRTVRAQI
IVRW