Gene Mpe_A3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3035 
Symbol 
ID4784957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3225602 
End bp3226942 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID640091606 
Productargininosuccinate synthase 
Protein accessionYP_001022223 
Protein GI124268219 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCA TCCTCCAGCA CCTTCCCACC GGCCAGAAGG TCGGCATCGC ATTCTCCGGC 
GGACTGGACA CCAGCGCCGC GCTGCACTGG ATGAAGCTCA AAGGCGCCCT GCCCTACGCC
TACACGGCCC ACCTCGGCCA GCCCGACGAG CCCGACTACG ACGAGATCCC GCGCAAGGCG
ATGCAGTACG GCGCCGAGAA GGCACGCCTG ATCGACTGCC GGGCGCAGCT GGTCGCCGAA
GGCCTGGCCG CGCTGCAGGC CGGTGCCTTC CACATCTCGA CCGCCGGGGT GACCTACTTC
AACACCACGC CGATCGGGCG CGCGGTCACC GGCACGATGC TGGTCGCGGC CATGAAGGAG
GACGACGTCC ACATCTGGGG CGACGGCAGC ACCTTCAAGG GCAACGACAT CGAGCGCTTC
TACCGCTACG GCCTGCTCAC CAACCCGGCG CTGAAGATCT ACAAGCCCTG GCTCGACCAG
ACCTTCATCG ACGAGCTCGG CGGCCGTGCC GAGATGTCGG CCTTCATGAC GCAGGCCGGC
TTCGGCTACA AGATGAGCGC CGAGAAGGCC TACTCGACCG ACTCCAACCT GCTCGGCGCC
ACGCACGAGG CCAAGGATCT GGAGCACCTG AGCAGCGGCA TCCGCATCGT CAACCCGATC
ATGGGCGTGG CGTTCTGGCG CGACGAGGTC GAGGTGAAGC GCGAGGAGGT GACCGTGCGC
TTCGAAGAGG GCCGGCCGGT CGCGCTGAAC GGCATCGAGT TCGCCGACCC GGTGGCGCTG
CTGCTGGAGG CCAACCGCAT CGGGGGCCGC CACGGGCTGG GCATGAGCGA CCAGATCGAG
AACCGCATCA TCGAGGCCAA GAGCCGCGGC ATCTACGAGG CCCCGGGTCT GGCGCTGCTG
CACATCGCCT ACGAGCGCCT CGTGACGGGC ATCCACAATG AAGACACGAT CGAGCAGTAC
CGTGACAACG GCCGCAAGCT CGGCCGCCTG CTCTACCAGG GCCGCTGGTT CGACCCGCAG
GCCATCATGC TGCGCGAGAC CGCGCAGCGC TGGGTGGCGC GCGCCGTGAC TGGCGAGGTC
GCGCTCGAGC TGCGCCGCGG CAACGACTAC TCGATCCTCG ACACGCGCTC CCCCAACCTC
ACCTACCAGC CCGAGCGGCT GTCGATGGAG AAGGTGGAGG ATGCCCCGTT CTCGCCGGCC
GACCGCATCG GCCAGCTGAC GATGCGCAAC CTCGACATCG TCGACACCCG CGCCAAGCTC
GGCATCTACG CGAAGAGCGG GCTGCTGTCG CTGGGCAGCG GCGCGGCGCT GGCGCGCCTG
CAGAACGACG ACCCGTCCTG A
 
Protein sequence
MATILQHLPT GQKVGIAFSG GLDTSAALHW MKLKGALPYA YTAHLGQPDE PDYDEIPRKA 
MQYGAEKARL IDCRAQLVAE GLAALQAGAF HISTAGVTYF NTTPIGRAVT GTMLVAAMKE
DDVHIWGDGS TFKGNDIERF YRYGLLTNPA LKIYKPWLDQ TFIDELGGRA EMSAFMTQAG
FGYKMSAEKA YSTDSNLLGA THEAKDLEHL SSGIRIVNPI MGVAFWRDEV EVKREEVTVR
FEEGRPVALN GIEFADPVAL LLEANRIGGR HGLGMSDQIE NRIIEAKSRG IYEAPGLALL
HIAYERLVTG IHNEDTIEQY RDNGRKLGRL LYQGRWFDPQ AIMLRETAQR WVARAVTGEV
ALELRRGNDY SILDTRSPNL TYQPERLSME KVEDAPFSPA DRIGQLTMRN LDIVDTRAKL
GIYAKSGLLS LGSGAALARL QNDDPS