Gene Mpe_A3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3801 
Symbol 
ID4785970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4019223 
End bp4020731 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content74% 
IMG OID640092384 
Productcysteine proteinase 
Protein accessionYP_001022989 
Protein GI124268985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0184346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC ACCGGGCTCG CGCAGAATCG CGGCGCATGA GAGCAAGCCG TTTCCTCCGT 
CGCGCGGCCA CGCTGTCGGC TCTGTGGCTG GCCGGCGCGG CCGCTGCGCA GGACGCGGCG
CCGACGGTCT GTCGCGTCAA AGGCCTGAAG CACGAGGTGT TGTGCGGCCA TGTGACGCGG
GCCCTGGATC CGGCGCAGCC GGGGGGCACG ACGGTCACGG TGCACTACGT GGTCGTGCCG
GCGGCGGCGC GTCACAAGCG AGCCGACCCG ATCTTCTTCT TTGCCGGCGG CCCTGGACAG
AGCGCCATCG CGCTGGCCGG TTCGGTGCTG CCGCTGTTCC AGCGCCTGAA CAACCGGCGT
GACCTGGTCT TCATCGATCA GCGCGGCACC GGGCGCTCGG CGCCCCTGGC CTGCGACGCC
GAGGACGAGC TGCCGCTGGC GCAGCGCTTC GATGCCGAGC GTGGTCGCCA GCGTCTCGCC
GCCTGCCTGG CTTCGCTGCG AAAGCTGCCG CATGGCGATC TGCGCCAGTA CACGACGAGC
ATCGCGATGG CCGATGCCGA TGCGGTGCGG GCCGCACTGG GTGCGTCGCA GATCAACCTG
GTGGGCGGCT CCTACGGCAC GCGGGCGGCG CTGGACTATC TGCGGCAGTT CCCGTCGCAT
GTGCGCCGTA TCGTGCTCGA CGGCGTCGCG CCGCCCGACA TGGTGCTGCC CGCCAGCATG
GGCCAGGATG TGGAAGCCGC GCTGGCGCGG CTTTTCACCG ACTGCGAGCA GGAGCCAAGC
TGCCAGGCGC GCCACCCGCG GCTGCGGGCG CACTGGCAGG GCCTGCTGAG CGCCGCGCCT
CGGCCAGCGA GCGTGGTCGA TCCGCTGGAT GGCCGGCCGG CCACGGTGAG GATCGATGTC
GATCTGCTGG CCAACGCGGT GCGCGGGCCG CTGTACGCGC CGGGCCTGGC CGCGGCCCTG
CCCTTCGCGA TCGACGAGGC GGCCGCCGGG CGCTACGCGG CCCTGGTCGG ATTGGCCGGG
GTGCTGGGCG GCGGGCCGCG GACGACGCGG CTGTTCGAGG GCCTGCATTT CTCGGTGGTG
TGCGCAGAGG ATGCGCCGGA CGCCGCGGCT CCGCCGCCGT CCGGGCTGGG TGCCGTGTAC
CTGCGTCCCT ATGCGGCGCT GTGCCGCGAC TGGCCGCGCG GCAGTGTGCC GCCGACTTTC
CGCGACCTGC CGACCAGCCA GGTTCCGGTG CTGGCCCTCA GCGGCACGCT CGACCCGGTG
ACGCCGCCGC GCCACGGCGA GCGGGTGGTG AAGGCGCTCG GGCCGCGGGC ACGTCATGTG
GTGGTACCGA ATGCCGGTCA CGGCGTGATG GCGATCGGCT GCACGCGCGA GCTGCTGTAC
CGCTTCATCG ACGCGGACGA CGAGGCCCAG GCCCTGGCGG TCGACGCCGG GTGCCTGGCG
CACCTGCCGC GCCCGCCGGC GTTCGAGCCG CCGCGGCCCG GACCGTCGCT GGCGGGAGCG
GCGCGATGA
 
Protein sequence
MPRHRARAES RRMRASRFLR RAATLSALWL AGAAAAQDAA PTVCRVKGLK HEVLCGHVTR 
ALDPAQPGGT TVTVHYVVVP AAARHKRADP IFFFAGGPGQ SAIALAGSVL PLFQRLNNRR
DLVFIDQRGT GRSAPLACDA EDELPLAQRF DAERGRQRLA ACLASLRKLP HGDLRQYTTS
IAMADADAVR AALGASQINL VGGSYGTRAA LDYLRQFPSH VRRIVLDGVA PPDMVLPASM
GQDVEAALAR LFTDCEQEPS CQARHPRLRA HWQGLLSAAP RPASVVDPLD GRPATVRIDV
DLLANAVRGP LYAPGLAAAL PFAIDEAAAG RYAALVGLAG VLGGGPRTTR LFEGLHFSVV
CAEDAPDAAA PPPSGLGAVY LRPYAALCRD WPRGSVPPTF RDLPTSQVPV LALSGTLDPV
TPPRHGERVV KALGPRARHV VVPNAGHGVM AIGCTRELLY RFIDADDEAQ ALAVDAGCLA
HLPRPPAFEP PRPGPSLAGA AR