Gene Mpe_A3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3475 
Symbol 
ID4786293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3686344 
End bp3687372 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID640092055 
Producthypothetical protein 
Protein accessionYP_001022663 
Protein GI124268659 
COG category[R] General function prediction only 
COG ID[COG2144] Selenophosphate synthetase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.451422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATG CAATGCTGGG ACAGGGGGGG CTCGACGGGC TCGCCTCGGC GTTGCTGCGC 
GGGCGCGGCT TCGCACACAA GCGCGACATC AGCGACGTGG TGTCGGCGCT GTCGGCCGCG
CTGCCGGGCG GCACCGCCGC GCTGGGCCAG GCGGTGGGCG TCGGCGACGA CTGTGCGGCG
ATCCCCGACG GCGACGGCGG CTACCTGCTG TTCGCGATCG AGGGTTTCGT CGACGACTTC
GTGCAGCGCA TGCCCTGGTT CGCCGGCTAT TGCGGCGTGA TGGTCAACGT GAGCGACATC
TGCGCGATGG GCGGGCGGCC GATCGCCGTG GTCGACGCGC TGTGGAGCCG CGGCATGGCG
CCGGGCCAGC AGGTGCTCGA GGGCCTGGCG GCCGCCTCGC AGCGCTACGG CGTGCCGATC
GTCGGCGGCC ACAGCAACAA CCAGGCGGTC GGCGGCCAGC TCGCGGTGGC GATCCTCGGC
CGAGCGAAGA CGCTGCTGAC GAGCTTCAAC GCCCGCCCCG GCGACACGCT GGTGATGGCG
ATCGACCTGC GCGGCGCCTA CCAGGAGCCC AACCCCTACT GGGACGCGTC GACCCGCGCG
CCGGCCGAGC GCCTGCGCGC CGACCTCGAG CTGCTGCCGG CGCTGGCCGA GAGCGGCCTG
TGCGATGCGG CCAAGGACAT CAGCATGGCC GGCGCCGTGG GCACGGCGCT GATGCTGCTG
GAGTGCTCGC AGGTCGGCGG CGTGATCGAC GTGCAGGCGA TCCCGCGCCC GCCCGGCGTG
CCGCTGCTGC GCTGGCTGCA GTCCTTTCCG AGCTACGGCT ACGTGTTCAG CGTGCGGCCG
GCGCAGGCAG CCGCGGTGGC GCGGCATTTC GAGTCGCAGG GCATCGCCTG TGCCGCGGTC
GGCGAGGTCA CGGCGACCCC GCAGCTGCAT CTGCGCGACG GCGAGACGAG CGCGCTGCTG
TGGGACCTGG TGGCGCAACC CTTCATCGGC GCGCGAGCCG TCGTGCCGCG GGAGCCGGCC
CATGTCTAG
 
Protein sequence
MEDAMLGQGG LDGLASALLR GRGFAHKRDI SDVVSALSAA LPGGTAALGQ AVGVGDDCAA 
IPDGDGGYLL FAIEGFVDDF VQRMPWFAGY CGVMVNVSDI CAMGGRPIAV VDALWSRGMA
PGQQVLEGLA AASQRYGVPI VGGHSNNQAV GGQLAVAILG RAKTLLTSFN ARPGDTLVMA
IDLRGAYQEP NPYWDASTRA PAERLRADLE LLPALAESGL CDAAKDISMA GAVGTALMLL
ECSQVGGVID VQAIPRPPGV PLLRWLQSFP SYGYVFSVRP AQAAAVARHF ESQGIACAAV
GEVTATPQLH LRDGETSALL WDLVAQPFIG ARAVVPREPA HV