Gene Mext_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3761 
Symbol 
ID5833369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4172413 
End bp4175409 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content73% 
IMG OID641369551 
ProductIucA/IucC family protein 
Protein accessionYP_001641206 
Protein GI163853163 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG4264] Siderophore synthetase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0951466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCGC GTCCTGAGGC CATGCGTCCC GAAACCGTGG AGCAGCCCGA AGCGCGTGTC 
CTGCGCCAGT TGGCCGAAGC CGTCCTGTTC GAAGGGCTTG CCGAGCGCGA GCCTATCCGC
GACGTCGCCG GTCGGATCGC ATGGCGCCTC GGCTCCCGCC GCTTCCGCGC GACCGGCACC
CTCGGCCCGT TCGGCCGCCC CCGCCTCGAT TCCGGTTCGG TCGAGATGGC AGGCGAAGAG
GACGCCTGGG TGCCGGCCGA CCTCGCCACC CTCGTCGAGG CCCTTCCCGC CGCCCCGGAG
CACCGGACGC GGCTGCGGGC CGAGCTTCGG CAGACGATCG AACTCTGCCG CTGGAACGCC
CAAAACCTCT CGCCGCCGGA GCGCCGCGCG CTGCCCTTCG CCGCGCTCGA TGCGGCTTTG
TGGGAGGGCC ATCCCTACCA TCCGAGCTTC AAGGCGCGCA CCGGCTTCAC CCTGGAGGAT
CATCGCCGCT ACGGGCCCGA GACCGCCGCA CCGTTCCGCC TCGAATGGCT GGCTGTCCGC
CGGGACACCA TCGCCCTCGC GCTGCCGGGC CCGGAGGACG GGTTCTGGCG GGCCGAACTC
GGCGGGGAGG GGGACGTCCT CGCCAGCCGC CTCGCCGCGG CCGGGCATTC CCTCGACACG
CACACGCTGC TGCCGGTCCA TCCCTGGCAG ATGCGCCGGC TTGAAGAGGA GGCTTTGCGT
CCCTGGCTGA CGGAGGGCCG CGCCGTTGCG CTCGGCACCG CCGGCCCGCG CTATGTCGCG
AGCCAGTCCC TGCGCACGCT GCACAATCTC GACGACCCGT CCGCCGCGAG CGTGAAGCTG
GCGCTTGCCG TCGTCTCGAC CTCGAGCCTG CGCATCCTCG ACCCGCATTT CGTGCTGACC
GGCCCGGCCC TGTCCGACTG GCTCGCCGGC CTCGTCGCGG GCGATCCCGC GCTGCAGGGC
CGCGTGACGG TGCTGCGCGA ATACGCCGCC GCGCTCGCCG ACCGGGACGG TCCGCTCGCC
GGGCAGCTCG CCGCGATCTG GCGGGAGAGC CCGCGTCTCG TTCCCGGCGA AGCGGCCGTG
CCCTTCAACG CGCTGGCCGT GTGCGAGGCG GACGGGAGCC CCTTCATCGC GCCCTGGCTC
GAACGCTACG GCCGCGACGC CTGGCTCGAC CGCTTGGTCA CGGTCGCGGT GCTGCCGGTC
TGGCACCTGC TCGCGGGCCA CGGCGTCGCC CTGGAGGCGC ACGGCCAGAA CATGATTCTG
GTCCACCGCG ACGGCTGGCC GGATCGGGTG ATCCTGCGCG ACTTCCACGA GAGCGCGGAA
TACGCCCCCG ACTTCGTGAC GAGCCCGGAA CGCGTGCCGG ATTTCGGGGC GATCGACCCG
GCCCATGCCG GGCCGGCGGA CGACCGCTTC CACGCCATGC GCTCGGCCGC GACGCTCGCC
GAACTCGTCA CCGACAGCCT GTTCGTCTTC AACCTCGGCG AGATCACCAC CCTTCTCAAG
CGCCGGCACG GCCTCGACGA GGCAGGGTTC TGGCGCCGCC TCGGCCAGCG TCTGCGGCAC
CACACGGCGG AGCATGGGCT GGAGGCGCGG TTCACCCGCC TTGGGGTCGA GGCGCCGCAG
CTGCGGGTCG AGGCGCTGCT GTCGCGCAAG CTCGGACTCG GCGAGGCGGG CGGCAGCCTG
CTGGCCCCCA ACGCCCTGTT CCCTTCTCCT GACGCCCTTT CCGGAGCCTG CATGATCGAG
ATCGACGGCC GCACCATACC GGCGGACGCG ATGGAGGCCG CCATCCAATG CGTCGCGGAT
ACGGCGGCCT TGCGCGGCGG CAGCGGCGAG CGCGTCGCCG CGCGCTTCCG CGACACCGCC
CAGGGTCTCG CCTTCATCCT CGCCGCCCGC CGCAGCGGCG CGAGCCTGCT GCCGATCCAT
CCGGCCCTGC CGGACGAGGG CGCGCGTCGG CTCGCCCAGC GCGCCGGCTG CCACCGCCTG
TTCCTCGACG GCCTGGAGGG CGAGCCTCTG GACGGCGCCG CCCCGCCGGT TCCTGGGGAG
GGCGAGCTGC TCCAGATGAG TTCCGGCACC ACCGGCGAGC CGAAATGCAT CGCCCGCCCC
TGGAGCGCGG TGGAGCGCGA GGTCGAGAGC TATGTCGGCG CCTTCACCGA GCCGGACGGG
ATGACACCGG TCATCGCCTG CCCGATCACC CATTCCTACG GGCTGATCTG CGGCCTGTTC
GTGGGTCTGC GCCGCGGCCG CGTGCCGGTG ATCGTGGACA CCACCAACCC GAAATACCTC
CTGCGCCGCC TGCGCGAGAT CGAGCGGCCG GTGCTCTACA CCGCACCCGC CATGCTGCAC
ACGCTGGCCC GGCTGATGCC CGAGGGCGAG ACCCTCCACG CGGCGATGGT CTCGGGCACG
CTCCTGCCCG CACCCTGGTT TTCCGCCATC CGCGGGCGCG TCACCCACCT GTTCCAGCAA
TACGGCTGCT CGGAAGCCGG CTGCATCGCG ATCAACCCGG ATCTGCGCCG CGCCGACGCC
ATCGGCCGCC CGCTGCCGCA CCACCGCGTG CGAGCGGGGA CGGGCGCCGA GGCCCCGGCG
GAGATCGTGG TCGAGGGGGA GGGCGGGGCG ATCCACACCG CCGATCTCGG CTACCTGGCG
CCCGACGGCA TGCTGATCTT CGTCGCGCGC AAGGACGACA CGATCAACGT CTCGGGCCTC
AACGTCTATC CCGGCGAGGT CGAGGACGTG GTGATGGCGA TGCCCGGCAT CACCGACGCG
GTGGCCTTCG CCCGGCCCGA TCCGTTTGCG GGCGAGCGGG TGACGCTGCT GTTCAGCGCC
GACGGTCCCG TGCCGCCCCG GGCGCTGCAG GACTGGTGCC GCCGCTGGCT CGCCGGCCAT
CAGGTGCCGG TCGAGGCGGT GCAGGTCGGC GCGATCCCGC GCGAGGCCAA CGGCAAGATT
TCTCGTCGCG CGGTCGCTGC GCAGTACCGG GACGGCGCGT TGGAGGCGGT GGCGTGA
 
Protein sequence
MPSRPEAMRP ETVEQPEARV LRQLAEAVLF EGLAEREPIR DVAGRIAWRL GSRRFRATGT 
LGPFGRPRLD SGSVEMAGEE DAWVPADLAT LVEALPAAPE HRTRLRAELR QTIELCRWNA
QNLSPPERRA LPFAALDAAL WEGHPYHPSF KARTGFTLED HRRYGPETAA PFRLEWLAVR
RDTIALALPG PEDGFWRAEL GGEGDVLASR LAAAGHSLDT HTLLPVHPWQ MRRLEEEALR
PWLTEGRAVA LGTAGPRYVA SQSLRTLHNL DDPSAASVKL ALAVVSTSSL RILDPHFVLT
GPALSDWLAG LVAGDPALQG RVTVLREYAA ALADRDGPLA GQLAAIWRES PRLVPGEAAV
PFNALAVCEA DGSPFIAPWL ERYGRDAWLD RLVTVAVLPV WHLLAGHGVA LEAHGQNMIL
VHRDGWPDRV ILRDFHESAE YAPDFVTSPE RVPDFGAIDP AHAGPADDRF HAMRSAATLA
ELVTDSLFVF NLGEITTLLK RRHGLDEAGF WRRLGQRLRH HTAEHGLEAR FTRLGVEAPQ
LRVEALLSRK LGLGEAGGSL LAPNALFPSP DALSGACMIE IDGRTIPADA MEAAIQCVAD
TAALRGGSGE RVAARFRDTA QGLAFILAAR RSGASLLPIH PALPDEGARR LAQRAGCHRL
FLDGLEGEPL DGAAPPVPGE GELLQMSSGT TGEPKCIARP WSAVEREVES YVGAFTEPDG
MTPVIACPIT HSYGLICGLF VGLRRGRVPV IVDTTNPKYL LRRLREIERP VLYTAPAMLH
TLARLMPEGE TLHAAMVSGT LLPAPWFSAI RGRVTHLFQQ YGCSEAGCIA INPDLRRADA
IGRPLPHHRV RAGTGAEAPA EIVVEGEGGA IHTADLGYLA PDGMLIFVAR KDDTINVSGL
NVYPGEVEDV VMAMPGITDA VAFARPDPFA GERVTLLFSA DGPVPPRALQ DWCRRWLAGH
QVPVEAVQVG AIPREANGKI SRRAVAAQYR DGALEAVA