Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2005 |
Symbol | |
ID | 5832845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2234813 |
End bp | 2236987 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641367806 |
Product | TonB-dependent siderophore receptor |
Protein accession | YP_001639475 |
Protein GI | 163851432 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4774] Outer membrane receptor for monomeric catechols |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.22147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.931942 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGACGTC GCGGAACGAA CGAAGTCGCG TTTGCGCTTC TGGCAGGCAC GGCCGTCGCG GTTGCCCTCC CTGCACGGGC ACAACCCCTT GTTCCGGTCG AGGCGTCTCC CGCCGCGGTC GCGCTGGACG AACTGGCCGT CGAAGGCCTC GGCCGCGGCG CGCTGCGGCT TGAACCCCAG GGCGGTGTCA CGGTCGGCTA TCTCGGCAAG GCGACGCGGA GCGCCACAAA GACCCCGACG CCCCTGCTCG ATACGCCGCA ATCCGTCTCG GTCATCACGC GCGAGCAGAT CCTGGACCAG GGCTTCCAGT CGATCGGCGA GGCGACGCGC TACGTGCCGG GCGTGATCCA GGCCCAGGGC GAGGGCAATC GCGACGAGCT GATCATCCGC GGCCAGCGCT CGAACGCCGA CTTCTTCGTC AACGGCATCC GCGATGACGT GCAGTATTAC CGCGACCTCT ACAACATCCA GCGCATCGAA GTCCTGAAGG GGCCCAATGC GATGATCTTC GGCCGCGGCG GCGGTGGCGG CGTCATCAAC CGCGTGCTCA AGGAAGCCGA CGGCGTCCCG ACCCGCGAGA TCGTTGCCCA GGGCGGCCAG TTCGCGAACA AGCGCGTGGC GCTCGATGTC GGCGACCGCG TCTCCGACAG CGTGTTCTTC CGCATGAACG GCGTGTTCGA GGATACGGCG ACCTACCGCG ACTTCGTCGA TATCCGCCGC TACGGCGTGA ACCCGACGAT GACCTTCCTG CTCGGGCCGC AGACGACGCT GCGCCTGTCC TACGAGTACT TCCACGACGA CCGCACCACC GATCGCGGCA TCCCCTCGCA GTTCGGCCGG CCCTACCGGT ACCGCGACAA CCGGACGACC TATTTCGGCA ACCCGTTCCT GTCGCCGACC TACGTCAACG CCCACATCGC CACGGCGCAG CTCGACCACG TCTTCGAGAA CGGCGTCGTG ATGCGCAGTC AGTCACGCAT CGCCGATTAC GAAAAATTCT ATCAGAACGT CTTCCCCGGC GGGGCGGTGA ACGCGGCCGG CACGGCCGTC AACATCTCGG CCTATAACAG CCAGACCGAC CGGACGAACT ACTTCAACCA GACCGACTTC ACCTACCAGT TCCTCACGGG ACCGGTGAAG CACACCCTGC TCGGCGGGTT CGAACTCGGC TACCAGGAAG GTCTGAGCGT CCGTGAAGAC GGCTTCTTCG CGACCACCGG CACCCAGACC CTCGTCGTCA ACCCGCTCGC GCCGCTGACG CGCGTCGGCG TCAACTTCCG CAACATCGCC AGCGGGGCCA ACAGCACCTA CGATCTCGGC CTCGCCGCAG CCTACGTGCA GGATCAGGTC GAACTGAACG ACTACGTGCA GCTCATCGGC GGCCTGCGCT TCGACCATTT CGACTTCGCG GCGACCGACC GGCGCACCAA CATCACCAAT GCCCGCGTGG ACGACCTGAT CTCGCCCCGC GCTGGCCTCG TCGTGAAGCC GCTGCCGAAC CTCGCCTTCT ACACGAGCTA CAGCATCTCC TACCTGCCCT CGTCCGGCGA TCAGTTCAGC GCATTGACGC CGGGCCTCGT CATCGCTCAG CCAGAGAGAT TCGAGAACAC GGAAGTGGGC GTGAAGTACG ACGTCTCGCC CGTGCTTCAG CTCACCGGCG CGCTGTTCAA CCTCGACCGC ACCAACCAGC GCATCGCCGA TCCGAACCGG CCCGGCTTCT TCCTGACCTC GGGCCAGACC AACACGCAGG GTGCGGAGAT CGGCGCCAAC GGCTACGTCA CCGATTGGTG GTCGATCGCG GGCGGCTACG CCTTCACCGA TGCGCGCATC GCGAACCGGC TCTCCGATAC GATCGTGGCC GGCAACTTCG TCGGCCTCGT GCCGCTCAAT TCCTTCACAC TGTGGAACAA GTTCGACATC GATCCGAGCT TCTCGGTCGG CGTCGGCTTC ATCAACCAGT CGCACTCCTT CGCGACTTCG GACAACACCG TCCGGCTTCC GAGCTACTCG CGCTTCGATC TGGGCCTGTT CTACCGGATC AGCGAGAACG CACGCGCGCA GGTGAACATC GAGAACCTGT TCGACCGCAA CTACATCGTC TCGGCGCACA ACAACAACAA CATCCTGCCC GGCGCACCCC GTACGGTCCG GGCACAGATC ATCGTGCGCT GGTAG
|
Protein sequence | MRRRGTNEVA FALLAGTAVA VALPARAQPL VPVEASPAAV ALDELAVEGL GRGALRLEPQ GGVTVGYLGK ATRSATKTPT PLLDTPQSVS VITREQILDQ GFQSIGEATR YVPGVIQAQG EGNRDELIIR GQRSNADFFV NGIRDDVQYY RDLYNIQRIE VLKGPNAMIF GRGGGGGVIN RVLKEADGVP TREIVAQGGQ FANKRVALDV GDRVSDSVFF RMNGVFEDTA TYRDFVDIRR YGVNPTMTFL LGPQTTLRLS YEYFHDDRTT DRGIPSQFGR PYRYRDNRTT YFGNPFLSPT YVNAHIATAQ LDHVFENGVV MRSQSRIADY EKFYQNVFPG GAVNAAGTAV NISAYNSQTD RTNYFNQTDF TYQFLTGPVK HTLLGGFELG YQEGLSVRED GFFATTGTQT LVVNPLAPLT RVGVNFRNIA SGANSTYDLG LAAAYVQDQV ELNDYVQLIG GLRFDHFDFA ATDRRTNITN ARVDDLISPR AGLVVKPLPN LAFYTSYSIS YLPSSGDQFS ALTPGLVIAQ PERFENTEVG VKYDVSPVLQ LTGALFNLDR TNQRIADPNR PGFFLTSGQT NTQGAEIGAN GYVTDWWSIA GGYAFTDARI ANRLSDTIVA GNFVGLVPLN SFTLWNKFDI DPSFSVGVGF INQSHSFATS DNTVRLPSYS RFDLGLFYRI SENARAQVNI ENLFDRNYIV SAHNNNNILP GAPRTVRAQI IVRW
|
| |