Gene Mpe_A3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3401 
Symbol 
ID4786331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3615574 
End bp3617127 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID640091977 
ProductAMP nucleosidase 
Protein accessionYP_001022589 
Protein GI124268585 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0546432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC CCTCTCTTCC AGACGATGGC CCCCGGCGCT ACACCGACGC GCAGGCCGCC 
CTCGACGCCG CGCGCAGCCT GTACGACGCC AGTCTGGTGC GACTGCGCGA CCACCTGCAG
CGCTTCCTGG CCGGCGAGGA CTTCCCTCAG CGCGTGCGCG CCTGCTACCC CCGCGTGGCC
GTCCACATCG ACACCGTGGC GCGGGCCGAC ACGCCGCTGG CCTACGGCTT CGTCGCCGGC
CCGGGCCGCT ACGAGACCAC GCTGACGCGG CCCGATCTGT TCGGGGACTA CTACCTCGAG
CAGTTCCGCC TGCTGCTGCG CAACCACGGT GTCGCGCTGG AGATCGGCAG CAGCACGCAG
CCGATCCCGG TGCACTTCTC GTTCGCCGAG CACGACCACG TGGAAGGCAC GCTGACGCCC
GAGCGGCGCA CGCTGATGCG CGACCTGTTC GATCTGCCCG ATCTCGGCGC CATGGACGAC
GGCATCGCCA ACGGCACGCA CGAACCGACG CCAGACGCCT CCGGCGCGGC GACCCACCCA
CTGGCGCTGT TCACCGCAGC GCGCGTCGAC TACTCGCTGC ATCGGCTGCG CCACTACACC
GGCACCACGC CAGAGCACTT CCAGAACTTC GTGCTGTTCA CGAACTACCA GTTCTACATC
GACGAGTTCA TCAAGCTCGG TCACGAGCTG ATGCACCTGC CGCGCGGCCA GGCCTCACTG
TTCGAGGGCG GCAGCGGCGG TGGCCAGGAC GACGGCTATG TGGCCTTCGT CGAGCCCGGC
AACGTGGTGA TGCGGCGCAC CGGCTGCACG CTGGAGCCGG GCGACTTCCT CGGCGCGCCG
CCGCCGCGGC TGCCGCAGAT GCCTGCCTAC CACCTGGTGC GCCACGATCG TGCCGGCATC
ACCATGGTGA ACATCGGCGT CGGCCCCAGC AACGCCAAGA CCATCACCGA CCACATCGCC
GTGCTGCGCC CGCATGCCTG GATCATGCTG GGCCATTGCG CCGGCCTGCG CACCACGCAG
CAGCTCGGCG ACTACGTGCT CGCGCACGGC TACGTGCGCG AGGACCATGT GCTCGACGAG
GAGCTGCCGC TGTGGGTGCC GATCCCGCCG CTGGCGGAGA TCCAGGTGGC GCTGGAAGCC
GCGGTCGCCG ATGTGACCCA GCTCGAGCGC AGCGAACTCA AGCGCGTGAT GCGGACCGGC
ACCGTCGCCA GCACCGACAA CCGCAACTGG GAGCTGCTGC CCTTCCACCA CAGCCACAGC
ACGCCGGAAC GCCGCTTCAG CCAGAGCCGC GCGATCGCGC TCGACATGGA GAGCGCCACC
ATCGCCGCCA ACGGCTTTCG TTTCCGCGTG CCCTACGGCA CGCTGCTGTG CGTGAGCGAC
AAGCCGCTGC ACGGCGAGAT CAAGCTGCCC GGCATGGCCA ATCACTTCTA CCGCGAGCGC
GTGAACCAGC ACCTGCGCAT CGGGCTGCGG GCGATCGAAC TGCTGCGGCG CAACGGCATC
GACCAGCTGC ACAGCCGCAA GCTGAGGAGC TTTGCGGAGG TGGCGTTCCA GTAG
 
Protein sequence
MTTPSLPDDG PRRYTDAQAA LDAARSLYDA SLVRLRDHLQ RFLAGEDFPQ RVRACYPRVA 
VHIDTVARAD TPLAYGFVAG PGRYETTLTR PDLFGDYYLE QFRLLLRNHG VALEIGSSTQ
PIPVHFSFAE HDHVEGTLTP ERRTLMRDLF DLPDLGAMDD GIANGTHEPT PDASGAATHP
LALFTAARVD YSLHRLRHYT GTTPEHFQNF VLFTNYQFYI DEFIKLGHEL MHLPRGQASL
FEGGSGGGQD DGYVAFVEPG NVVMRRTGCT LEPGDFLGAP PPRLPQMPAY HLVRHDRAGI
TMVNIGVGPS NAKTITDHIA VLRPHAWIML GHCAGLRTTQ QLGDYVLAHG YVREDHVLDE
ELPLWVPIPP LAEIQVALEA AVADVTQLER SELKRVMRTG TVASTDNRNW ELLPFHHSHS
TPERRFSQSR AIALDMESAT IAANGFRFRV PYGTLLCVSD KPLHGEIKLP GMANHFYRER
VNQHLRIGLR AIELLRRNGI DQLHSRKLRS FAEVAFQ