Gene Mpe_A3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3122 
SymbolnikA 
ID4786635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3322997 
End bp3324238 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID640091693 
Productperiplasmic-binding protein 
Protein accessionYP_001022310 
Protein GI124268306 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00833115 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0608186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCAAT TAAGTGCATC AAAGTTGTCG TCCCGCAACA GAATAAGCAT AGAACAAGCT 
CAGTGCTGGC GGGGGGCGCA ACGGTCTGGT GGAATCGCTC CTCAGTTGGG ACGGCGAGCA
GCTTCGGGTG TCGCACACAG CATCCGAGGT TCCGGCTTCA CCGAAACCAC ATGTTCAACT
CTGGAGATAG TTGCAATGAA AAAATCAGTA CTCGCTCTGG CCGCTCTGGG TGCTTTCGCC
GGTGCCGCTT CGGCGCAATC GTCGGTCACC CTGTACGGCC GTCTGGACAC TGCTGTCACC
TGGACTGACA GCACGATCGC CGAAGACCGC TTCACGCTGA ACAACCACCA GCCGATCGGT
GGTTCGCGCT GGGGCCTGAA GGGCAGCGAA GACCTGGGTG GTGGCCTGAA GGCCAACTTC
ACGCTGGAGT CGGGCTTCAA CTCCGACGAC GGCACTGGTA ATGCCAAGCT ATTCGACCGC
GCCGCCTGGG TGGGCCTGAG CTCGGCCAGC CTCGGCGAAA TCCGCCTCGG TCGTCACGAC
ACGCTGACCC GTCAGTTGAA CCTCGGCTAC GGCTCGGACC TGACCGCTGA AGGCGAAATC
ACGGTCGTGG ACGGTAATTT TGCAGCCGGC ACGGCTCTTG CCCCGACCGG TCGCGTTCTG
TTCCAGAACT TCGGCACCCG CGTCGACAAC TCGGTCGTCT ACCTGTCGCC GAGCTTCGGT
GGCTTCCAGG TGCGCGCGCT GGTCGCCGCT GGCGAAGGCG CCACGGCTCG CCAGCAAGGT
CTGTTGCTGG GTTATGCGGC TGGCCCGATC AAGGCAGGTC TGTCGTACGA AGAGTACGAC
GACGCCCCGG GCGGCGGTGG CAGCGCCTAC AACAAGGTGT TCACCGCGGG CGGCAGCTAC
AACTTCGGCG TCGCGACGCT GGGCCTGGGC TATCAGAAGA CCAGCGACTT CGGCTCGAAC
GCTGGCGAGT CGGTTGTGAT CAATGATGTC GATGCCTACA ACGTCGGCGT GCTCGTGCCG
TTCGGCAGCT TCGAGTTCCG TGCCCAGTAC ACGCACTCGA AGGCTGATCT GGATGCGGGT
GGCAGCAACA AGAACGACAA GTACGGCGCT TCGCTCCGTT ACGCGCTGTC GAAGCGGACC
ACGATCTACA GTGCCTACCT GCACCGCGAG TCGGACAACG ACGACACGTT CAACCTGACT
GGCAAGGACC AGTTCCTGGT CGGTATCGGC CACAACTTCT GA
 
Protein sequence
MHQLSASKLS SRNRISIEQA QCWRGAQRSG GIAPQLGRRA ASGVAHSIRG SGFTETTCST 
LEIVAMKKSV LALAALGAFA GAASAQSSVT LYGRLDTAVT WTDSTIAEDR FTLNNHQPIG
GSRWGLKGSE DLGGGLKANF TLESGFNSDD GTGNAKLFDR AAWVGLSSAS LGEIRLGRHD
TLTRQLNLGY GSDLTAEGEI TVVDGNFAAG TALAPTGRVL FQNFGTRVDN SVVYLSPSFG
GFQVRALVAA GEGATARQQG LLLGYAAGPI KAGLSYEEYD DAPGGGGSAY NKVFTAGGSY
NFGVATLGLG YQKTSDFGSN AGESVVINDV DAYNVGVLVP FGSFEFRAQY THSKADLDAG
GSNKNDKYGA SLRYALSKRT TIYSAYLHRE SDNDDTFNLT GKDQFLVGIG HNF