Gene Mpe_A1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1111 
Symbol 
ID4784593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1187903 
End bp1189306 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content67% 
IMG OID640089674 
Productargininosuccinate lyase 
Protein accessionYP_001020307 
Protein GI124266303 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.552146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG ACAACCAACT CGACAAGAAA TCCCAGGCCT GGTCGGCGCT GTTCTCCGAA 
CCGATGAGCG AGCTGGTGAA GCGCTACACC GCCAGCGTCG ATTTCGACCA GCGCCTGTGG
CGGGCTGACA TCGACGGCTC GCTCGCCCAC GCCGAGATGC TGGCGGCGCA GGGCATCCTG
ACGGCTGAAG ATCACGCGGC CATCGTCCGC GGCATGGCGC AGGTCGTGGC CGAGATCGAA
TCGGGAGCGT TCGAGTGGAA GCTCGACCTC GAGGACGTGC ACCTCAACAT CGAGGCGCGC
CTGACCCAGT TGGTCGGCGA CGCCGGCAAG CGGCTGCACA CCGGGCGCTC GCGCAACGAC
CAGGTCGCGA CCGACGTGCG CCTGTGGCTG CGCGGCGAGA TTGACGCGAT CGGCGCCTTG
CTCTCGGCGC TGCAGCGTGC GCTGGTCGAC GTGGCCGAGC CGAATGCCGA AGTCATCCTG
CCGGGCTTCA CGCACCTGCA GGTGGCCCAG CCGGTGAGCT TCGGGCACCA CCTGCTGGCC
TATGTGGAGA TGTTTGCCCG CGATGCCGAG CGCCTGCTCG ACGTGCGCCG GCGCGTCAAC
CGGCTGCCGC TCGGCGCCGC CGCGCTCGCC GGCACCAGCT ACCCGCTCGA CCGCGAGCGC
GTCGCCCGCA CGCTGGGTTT CGACGGCGTG TGCCAGAACT CGCTCGACGC GGTGAGCGAC
CGCGACTTCG CGATCGAGTT CACCGCTGCC GCCTCGCTGT GCATGGTGCA CGTGTCGCGT
CTGAGTGAAG AGCTGATCCT GTGGATGAGC CAGAGCTTCG GCTTCATCGA CTTGGCCGAT
CGCTTCTGCA CCGGTTCGTC GATCATGCCG CAGAAGAAGA ACCCCGACGT GCCCGAACTG
GCACGCGGCA AGACCGGCCG CGTCGTTGGT CACCTGATGG CGCTGATCAC GCTGATGAAG
GGCCAGCCGC TGGCCTACAA CAAGGACAAC CAGGAAGACA AGGAACCACT GTTCGACACG
GTGGACACCC TGAAGGACAC GCTGCGCATC TTCGCCGAAC TGGTGGGCGG CATCAGCGTC
AAGCCCGAGG CGATGGAACG CGCCGCGCTG AAGGGCTATG CGACGGCGAC CGACCTGGCC
GACTATCTGG TGAAGAAGGG CTTGCCGTTC CGGGATGCCC ACGAAGTGGT CGCCCATGCG
GTCAAGACCG CGATCGCCCA GGGCCGCGAC CTGAGCGAGC TGCCGTTGCC GGCCCTGCAG
GCCTTCCATC CGGCGATCAC TGACGATGTC CATGCCGCGC TGACGCTGCG CGGCTCGCTC
GATGCGCGCC AGGTGCTGGG CGGCACCGCG CCGGCGCAGG TGCGATTCCA GATCGCACGG
CATCGCACAC GGCTCGGTAG CTGA
 
Protein sequence
MSNDNQLDKK SQAWSALFSE PMSELVKRYT ASVDFDQRLW RADIDGSLAH AEMLAAQGIL 
TAEDHAAIVR GMAQVVAEIE SGAFEWKLDL EDVHLNIEAR LTQLVGDAGK RLHTGRSRND
QVATDVRLWL RGEIDAIGAL LSALQRALVD VAEPNAEVIL PGFTHLQVAQ PVSFGHHLLA
YVEMFARDAE RLLDVRRRVN RLPLGAAALA GTSYPLDRER VARTLGFDGV CQNSLDAVSD
RDFAIEFTAA ASLCMVHVSR LSEELILWMS QSFGFIDLAD RFCTGSSIMP QKKNPDVPEL
ARGKTGRVVG HLMALITLMK GQPLAYNKDN QEDKEPLFDT VDTLKDTLRI FAELVGGISV
KPEAMERAAL KGYATATDLA DYLVKKGLPF RDAHEVVAHA VKTAIAQGRD LSELPLPALQ
AFHPAITDDV HAALTLRGSL DARQVLGGTA PAQVRFQIAR HRTRLGS