Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3761 |
Symbol | |
ID | 5833369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4172413 |
End bp | 4175409 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369551 |
Product | IucA/IucC family protein |
Protein accession | YP_001641206 |
Protein GI | 163853163 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG4264] Siderophore synthetase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0951466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCGC GTCCTGAGGC CATGCGTCCC GAAACCGTGG AGCAGCCCGA AGCGCGTGTC CTGCGCCAGT TGGCCGAAGC CGTCCTGTTC GAAGGGCTTG CCGAGCGCGA GCCTATCCGC GACGTCGCCG GTCGGATCGC ATGGCGCCTC GGCTCCCGCC GCTTCCGCGC GACCGGCACC CTCGGCCCGT TCGGCCGCCC CCGCCTCGAT TCCGGTTCGG TCGAGATGGC AGGCGAAGAG GACGCCTGGG TGCCGGCCGA CCTCGCCACC CTCGTCGAGG CCCTTCCCGC CGCCCCGGAG CACCGGACGC GGCTGCGGGC CGAGCTTCGG CAGACGATCG AACTCTGCCG CTGGAACGCC CAAAACCTCT CGCCGCCGGA GCGCCGCGCG CTGCCCTTCG CCGCGCTCGA TGCGGCTTTG TGGGAGGGCC ATCCCTACCA TCCGAGCTTC AAGGCGCGCA CCGGCTTCAC CCTGGAGGAT CATCGCCGCT ACGGGCCCGA GACCGCCGCA CCGTTCCGCC TCGAATGGCT GGCTGTCCGC CGGGACACCA TCGCCCTCGC GCTGCCGGGC CCGGAGGACG GGTTCTGGCG GGCCGAACTC GGCGGGGAGG GGGACGTCCT CGCCAGCCGC CTCGCCGCGG CCGGGCATTC CCTCGACACG CACACGCTGC TGCCGGTCCA TCCCTGGCAG ATGCGCCGGC TTGAAGAGGA GGCTTTGCGT CCCTGGCTGA CGGAGGGCCG CGCCGTTGCG CTCGGCACCG CCGGCCCGCG CTATGTCGCG AGCCAGTCCC TGCGCACGCT GCACAATCTC GACGACCCGT CCGCCGCGAG CGTGAAGCTG GCGCTTGCCG TCGTCTCGAC CTCGAGCCTG CGCATCCTCG ACCCGCATTT CGTGCTGACC GGCCCGGCCC TGTCCGACTG GCTCGCCGGC CTCGTCGCGG GCGATCCCGC GCTGCAGGGC CGCGTGACGG TGCTGCGCGA ATACGCCGCC GCGCTCGCCG ACCGGGACGG TCCGCTCGCC GGGCAGCTCG CCGCGATCTG GCGGGAGAGC CCGCGTCTCG TTCCCGGCGA AGCGGCCGTG CCCTTCAACG CGCTGGCCGT GTGCGAGGCG GACGGGAGCC CCTTCATCGC GCCCTGGCTC GAACGCTACG GCCGCGACGC CTGGCTCGAC CGCTTGGTCA CGGTCGCGGT GCTGCCGGTC TGGCACCTGC TCGCGGGCCA CGGCGTCGCC CTGGAGGCGC ACGGCCAGAA CATGATTCTG GTCCACCGCG ACGGCTGGCC GGATCGGGTG ATCCTGCGCG ACTTCCACGA GAGCGCGGAA TACGCCCCCG ACTTCGTGAC GAGCCCGGAA CGCGTGCCGG ATTTCGGGGC GATCGACCCG GCCCATGCCG GGCCGGCGGA CGACCGCTTC CACGCCATGC GCTCGGCCGC GACGCTCGCC GAACTCGTCA CCGACAGCCT GTTCGTCTTC AACCTCGGCG AGATCACCAC CCTTCTCAAG CGCCGGCACG GCCTCGACGA GGCAGGGTTC TGGCGCCGCC TCGGCCAGCG TCTGCGGCAC CACACGGCGG AGCATGGGCT GGAGGCGCGG TTCACCCGCC TTGGGGTCGA GGCGCCGCAG CTGCGGGTCG AGGCGCTGCT GTCGCGCAAG CTCGGACTCG GCGAGGCGGG CGGCAGCCTG CTGGCCCCCA ACGCCCTGTT CCCTTCTCCT GACGCCCTTT CCGGAGCCTG CATGATCGAG ATCGACGGCC GCACCATACC GGCGGACGCG ATGGAGGCCG CCATCCAATG CGTCGCGGAT ACGGCGGCCT TGCGCGGCGG CAGCGGCGAG CGCGTCGCCG CGCGCTTCCG CGACACCGCC CAGGGTCTCG CCTTCATCCT CGCCGCCCGC CGCAGCGGCG CGAGCCTGCT GCCGATCCAT CCGGCCCTGC CGGACGAGGG CGCGCGTCGG CTCGCCCAGC GCGCCGGCTG CCACCGCCTG TTCCTCGACG GCCTGGAGGG CGAGCCTCTG GACGGCGCCG CCCCGCCGGT TCCTGGGGAG GGCGAGCTGC TCCAGATGAG TTCCGGCACC ACCGGCGAGC CGAAATGCAT CGCCCGCCCC TGGAGCGCGG TGGAGCGCGA GGTCGAGAGC TATGTCGGCG CCTTCACCGA GCCGGACGGG ATGACACCGG TCATCGCCTG CCCGATCACC CATTCCTACG GGCTGATCTG CGGCCTGTTC GTGGGTCTGC GCCGCGGCCG CGTGCCGGTG ATCGTGGACA CCACCAACCC GAAATACCTC CTGCGCCGCC TGCGCGAGAT CGAGCGGCCG GTGCTCTACA CCGCACCCGC CATGCTGCAC ACGCTGGCCC GGCTGATGCC CGAGGGCGAG ACCCTCCACG CGGCGATGGT CTCGGGCACG CTCCTGCCCG CACCCTGGTT TTCCGCCATC CGCGGGCGCG TCACCCACCT GTTCCAGCAA TACGGCTGCT CGGAAGCCGG CTGCATCGCG ATCAACCCGG ATCTGCGCCG CGCCGACGCC ATCGGCCGCC CGCTGCCGCA CCACCGCGTG CGAGCGGGGA CGGGCGCCGA GGCCCCGGCG GAGATCGTGG TCGAGGGGGA GGGCGGGGCG ATCCACACCG CCGATCTCGG CTACCTGGCG CCCGACGGCA TGCTGATCTT CGTCGCGCGC AAGGACGACA CGATCAACGT CTCGGGCCTC AACGTCTATC CCGGCGAGGT CGAGGACGTG GTGATGGCGA TGCCCGGCAT CACCGACGCG GTGGCCTTCG CCCGGCCCGA TCCGTTTGCG GGCGAGCGGG TGACGCTGCT GTTCAGCGCC GACGGTCCCG TGCCGCCCCG GGCGCTGCAG GACTGGTGCC GCCGCTGGCT CGCCGGCCAT CAGGTGCCGG TCGAGGCGGT GCAGGTCGGC GCGATCCCGC GCGAGGCCAA CGGCAAGATT TCTCGTCGCG CGGTCGCTGC GCAGTACCGG GACGGCGCGT TGGAGGCGGT GGCGTGA
|
Protein sequence | MPSRPEAMRP ETVEQPEARV LRQLAEAVLF EGLAEREPIR DVAGRIAWRL GSRRFRATGT LGPFGRPRLD SGSVEMAGEE DAWVPADLAT LVEALPAAPE HRTRLRAELR QTIELCRWNA QNLSPPERRA LPFAALDAAL WEGHPYHPSF KARTGFTLED HRRYGPETAA PFRLEWLAVR RDTIALALPG PEDGFWRAEL GGEGDVLASR LAAAGHSLDT HTLLPVHPWQ MRRLEEEALR PWLTEGRAVA LGTAGPRYVA SQSLRTLHNL DDPSAASVKL ALAVVSTSSL RILDPHFVLT GPALSDWLAG LVAGDPALQG RVTVLREYAA ALADRDGPLA GQLAAIWRES PRLVPGEAAV PFNALAVCEA DGSPFIAPWL ERYGRDAWLD RLVTVAVLPV WHLLAGHGVA LEAHGQNMIL VHRDGWPDRV ILRDFHESAE YAPDFVTSPE RVPDFGAIDP AHAGPADDRF HAMRSAATLA ELVTDSLFVF NLGEITTLLK RRHGLDEAGF WRRLGQRLRH HTAEHGLEAR FTRLGVEAPQ LRVEALLSRK LGLGEAGGSL LAPNALFPSP DALSGACMIE IDGRTIPADA MEAAIQCVAD TAALRGGSGE RVAARFRDTA QGLAFILAAR RSGASLLPIH PALPDEGARR LAQRAGCHRL FLDGLEGEPL DGAAPPVPGE GELLQMSSGT TGEPKCIARP WSAVEREVES YVGAFTEPDG MTPVIACPIT HSYGLICGLF VGLRRGRVPV IVDTTNPKYL LRRLREIERP VLYTAPAMLH TLARLMPEGE TLHAAMVSGT LLPAPWFSAI RGRVTHLFQQ YGCSEAGCIA INPDLRRADA IGRPLPHHRV RAGTGAEAPA EIVVEGEGGA IHTADLGYLA PDGMLIFVAR KDDTINVSGL NVYPGEVEDV VMAMPGITDA VAFARPDPFA GERVTLLFSA DGPVPPRALQ DWCRRWLAGH QVPVEAVQVG AIPREANGKI SRRAVAAQYR DGALEAVA
|
| |