Gene Mpe_A1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1085 
Symbol 
ID4783688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1158651 
End bp1159820 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content71% 
IMG OID640089647 
Productputative transport system ATP-binding protein 
Protein accessionYP_001020281 
Protein GI124266277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3842] ABC-type spermidine/putrescine transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.224743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.500866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAAGG CGCGGCAGCG GGGCGAATAC GGGAATGCGA ACGAAATTCA TTTGAAGTCA 
GCTCGCTATC ATAGAGGGAT GCTGCTGCGT CTCGCCGATG TCTCGATCCG CTACCCGGCG
GCGCACCCGG GCGGCGGCGA ACGGCCGGCG GTCGACGGCG TGTCCTTCGG TCTGGCCGAG
GGCGAGATCG GTGTGCTGAT CGGCCCGTCG GGCTGCGGCA AGACTTCGCT GCTGCGTGCC
GTCGCCGGCC TGGAGCCGCT GGCCGGCGGC AGCATCTCGC TGGGCGACGA ACGGCTCGGC
GACGCGGCCA CCGGCCGCCA CCTCGCACCG GAGCAGCGCC GCATCGGCAT GGTGTTTCAG
GACTATGCGC TGTTCCCCCA TCTCAGCGTG GCGCAGAACG TGGCCTTCGG CCTGCACGAC
CTGCCGCGGG CGCAGCGCGA GCAACGCGTG GCCGAATTGC TCGACCTGGT GGGCCTGGGC
AGTGCCGCGA AGCGCGCGCC GCACCAGCTG TCGGGCGGGC AGCAGCAGCG CATCGCCCTG
GCCCGGGCCC TGGCGCCGGC GCCGCGGCTG CTGCTGCTGG ACGAACCCTT CTCCAGCCTC
GACGTCGATC TGCGCGAACG CCTGGCGCAG GAGCTGCGCG CGATTCTCAA GCGGACCGGC
ACGACGGCGC TCTTCGTCAC CCACGATCAG CTCGAGGCCT TCGCGCTGGG TGACGTGATC
GGCGTGATGC ACCGGGGCCG CCTCGACCAG TGGGACAGCG CCTACCAGCT CTACCACCGT
CCGGCGACAC GCTTCGTGGC CGACTTCATC GGGCACGGCG TGTTCGCGCC GGCCCGCATC
GTGGACGGCG CCGACGGTCC GCGTGTGCAC ACCCCCGTGG GCGACCTGAG CGATCTCGAG
GAATGCCCGT TGTCCGCCGC CTACCCGGGT GGCGAGTGCG AGGTGCTGCT GCGGGCGGAC
GACATCGTCC ACGACGACAG CTCGCCGGTG AAGGCCCGCA TCGAACGCAA GGCCTTCCGC
GGCTCCGAGT TCCTGTACAC GCTGCAGCTC GCCAGCGGCG AACGCGTGCT CGCCCACGTG
CCCTCGCACC ACGATCACCA GCTCGGCGAA TGGATCGGCA TCCGCGCCGA GGTCGACCAC
GTGGTCACCT TCCCACGCCA GCCGAGCTGA
 
Protein sequence
MLKARQRGEY GNANEIHLKS ARYHRGMLLR LADVSIRYPA AHPGGGERPA VDGVSFGLAE 
GEIGVLIGPS GCGKTSLLRA VAGLEPLAGG SISLGDERLG DAATGRHLAP EQRRIGMVFQ
DYALFPHLSV AQNVAFGLHD LPRAQREQRV AELLDLVGLG SAAKRAPHQL SGGQQQRIAL
ARALAPAPRL LLLDEPFSSL DVDLRERLAQ ELRAILKRTG TTALFVTHDQ LEAFALGDVI
GVMHRGRLDQ WDSAYQLYHR PATRFVADFI GHGVFAPARI VDGADGPRVH TPVGDLSDLE
ECPLSAAYPG GECEVLLRAD DIVHDDSSPV KARIERKAFR GSEFLYTLQL ASGERVLAHV
PSHHDHQLGE WIGIRAEVDH VVTFPRQPS