Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1736 |
Symbol | |
ID | 4785289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1861149 |
End bp | 1863362 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640090307 |
Product | TonB-dependent receptor protein |
Protein accession | YP_001020931 |
Protein GI | 124266927 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00567556 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACAC TCGCCCCCGT GGCGCTGGCT GCGGCTATGG CCTGTCAGCT CCCCGCCGCG TACGCGCAAT CCGAAACGCC ATCGGCCCCG GCCGACGCGA ACGCCACCGT GACGCTGCCG CCGATCACCG TGAAGGCCAG TGCCGACGCC TCGGCGCAAG GGCTGTCCCC GGCGTTTCCG GGCGGGCAGG TGGCGCGCGG CGGGCGCGCC GGCATCCTCG GCACCAAGGA CAACCTGGAC ACGCCGTTCA GCATCACCAG CTACACCAAC GAACTGATCC AGGATCGCCA GGCCAGGAGC GTCGGCGACG TGCTGCAGAA CGATCCGGGC GTGCGCGTGG CGCGCGGCTT CGGCAACTTC CAGGAGTCGT ACTTCATCCG CGGCTTCATC CTCGGCTCCG ACGACGTGGC CTACAACGGC CTGTACAACC TGCTGCCGCG CCAGTACATC GCGACGGAAT TGTTCGAGCG CGTGGAGGTG TTGCGTGGCG CGTCGGCCTT CCTGTCCGGC GCCACGCCGT CGGGGGGTGG CCTCGGCGGC AGCATCAACC TGCTGCCCAA GCGCGCCTCC AACGAAGACC TGAACCGCGT GACGGTCGGC GCGGCCAGCG GGCCGCAGGG CAGCGCGGCC GTGGACATTT CCCGTCGCTT CGGTCCGGAC CGGAGCACCG GCTTGCGCCT GAACGCGGCT TACCGTGATG GTGACACCGC GGTCGACGAG GAAAAGACCC GTCTCGGCCT CGTCTCGTTG GGCGCGGACT GGCGCAGCCG CGACGTGCGA CTGTCCGGCG ACATCGGCTT CCAGGACAAC CGGCTGAAGC GCACCCGCAG CAGCGTGACG CTGGGCGATT TCCTGACGCC GGTGACGGCG ATGCCGGCCG TGCCGGACAA CGAGACGAAC GCATCGCAAC CGTGGTCGTA CTCGAACGAG AAAGACGTGT TCGGTACCTT GCGCGGCGAA GTCGACCTCG GAGCCGACGT CACGGCATGG GCGGCCTACG GCCTGCGCCG CAGCGACGAG GCCAATTCGC TGGCGAACTT CACGCTGCTC GACCCGGTCA GCGGCGACGG CTACACCTAT CGCTTCGACA ACACCCGCGA AGACCGCGTC GACACCGGCG AGTTGGGGCT GCGTGGCAAG CTCAGGACGG GCAGCGTGGG TCACGAGTGG GTCGTCTCGG CGTCCTACTT CAGGCTGGAA ACGAAGAACG CCTATCTGTT CGACTTCGGC AACCAGCAGG CCACCAATCT CTATCGGCCG GTGCGGTCGC CCGAGCCGGC ATTCAGCGCC GGCGCCGGCG GCGGCAACGA TCTGGCGTCG CCGGCGCTGA CGGCGCGCAC CCGCCTCACC AGTCTGGCGC TCGGCGACAC CTTGTCGTTC GATGGCGATC GCTTTCTGCT GACACTGGGT GCCCGCCATC AGAACCTGCG CATCGAGAAC TACGCCTACA CCACCGGCGT CGCTGCACCG GTCTACGACG ACAGCCGCGT CAGCCCGCTG GTCGGCGCGG TCTACAAGGC GACGAAGCAG TTGTCGCTCT ACGCCAACTA CATCGAAGGC CTGAGCAAGG GTGACACCGC GCCGGCCACA GCCAACAATC CGAACGCCAT GCTGGCGCCC TATGTCTCCA AGCAGAAGGA GATCGGCGCG AAGTACGAGG CCGATCGCCT GGGCCTGGGT GCAGCGCTCT TCAGCACCGA CAAGCCGCGA GCGGTGGGTG GCACCACCGG GACCACGTTC ACCGCGGGCG GCGAGGATCG CCACCGGGGT GTCGAGCTGA CCGCCTACGG CGAGGCCGTA CGCGGCCTGC GTGTGCTGGG CGGCCTGACT TGGCTCGATG CCAAGCAGAA GGAAACCGGC AGTGCGGCCA CCGAGGGCAA GCGCACGCTG GGCATTCCCC GCACGCAGGC CAATCTCGGC GTCGAGTGGG ACGTGACCGG CGTGCAGGGC CTCGCGCTCG ACGCCCGCGT GGTCCGCACC GGGTCGGTCT ACGCCGACAG CGCCAACACG TTGCGCGTGC CCGGCTGGAC CCGGGTGGAT GTGGGGGCGC GCTACCTGTT GGACGTGCAG GACCGCCTGG TGACGCTGCG CGCGCGCATC GACAACCTGA CCGACCGTGA CTATTGGGCT TCGGTCGGCG GTTATCCGGA TGCCGGCTAT CTGAACATCG GCGCGCCCCG CACCTTCGTG CTGAGCGCCA GCGTCGACTT CTGA
|
Protein sequence | MRTLAPVALA AAMACQLPAA YAQSETPSAP ADANATVTLP PITVKASADA SAQGLSPAFP GGQVARGGRA GILGTKDNLD TPFSITSYTN ELIQDRQARS VGDVLQNDPG VRVARGFGNF QESYFIRGFI LGSDDVAYNG LYNLLPRQYI ATELFERVEV LRGASAFLSG ATPSGGGLGG SINLLPKRAS NEDLNRVTVG AASGPQGSAA VDISRRFGPD RSTGLRLNAA YRDGDTAVDE EKTRLGLVSL GADWRSRDVR LSGDIGFQDN RLKRTRSSVT LGDFLTPVTA MPAVPDNETN ASQPWSYSNE KDVFGTLRGE VDLGADVTAW AAYGLRRSDE ANSLANFTLL DPVSGDGYTY RFDNTREDRV DTGELGLRGK LRTGSVGHEW VVSASYFRLE TKNAYLFDFG NQQATNLYRP VRSPEPAFSA GAGGGNDLAS PALTARTRLT SLALGDTLSF DGDRFLLTLG ARHQNLRIEN YAYTTGVAAP VYDDSRVSPL VGAVYKATKQ LSLYANYIEG LSKGDTAPAT ANNPNAMLAP YVSKQKEIGA KYEADRLGLG AALFSTDKPR AVGGTTGTTF TAGGEDRHRG VELTAYGEAV RGLRVLGGLT WLDAKQKETG SAATEGKRTL GIPRTQANLG VEWDVTGVQG LALDARVVRT GSVYADSANT LRVPGWTRVD VGARYLLDVQ DRLVTLRARI DNLTDRDYWA SVGGYPDAGY LNIGAPRTFV LSASVDF
|
| |