Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3122 |
Symbol | nikA |
ID | 4786635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3322997 |
End bp | 3324238 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640091693 |
Product | periplasmic-binding protein |
Protein accession | YP_001022310 |
Protein GI | 124268306 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3203] Outer membrane protein (porin) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00833115 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0608186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCAAT TAAGTGCATC AAAGTTGTCG TCCCGCAACA GAATAAGCAT AGAACAAGCT CAGTGCTGGC GGGGGGCGCA ACGGTCTGGT GGAATCGCTC CTCAGTTGGG ACGGCGAGCA GCTTCGGGTG TCGCACACAG CATCCGAGGT TCCGGCTTCA CCGAAACCAC ATGTTCAACT CTGGAGATAG TTGCAATGAA AAAATCAGTA CTCGCTCTGG CCGCTCTGGG TGCTTTCGCC GGTGCCGCTT CGGCGCAATC GTCGGTCACC CTGTACGGCC GTCTGGACAC TGCTGTCACC TGGACTGACA GCACGATCGC CGAAGACCGC TTCACGCTGA ACAACCACCA GCCGATCGGT GGTTCGCGCT GGGGCCTGAA GGGCAGCGAA GACCTGGGTG GTGGCCTGAA GGCCAACTTC ACGCTGGAGT CGGGCTTCAA CTCCGACGAC GGCACTGGTA ATGCCAAGCT ATTCGACCGC GCCGCCTGGG TGGGCCTGAG CTCGGCCAGC CTCGGCGAAA TCCGCCTCGG TCGTCACGAC ACGCTGACCC GTCAGTTGAA CCTCGGCTAC GGCTCGGACC TGACCGCTGA AGGCGAAATC ACGGTCGTGG ACGGTAATTT TGCAGCCGGC ACGGCTCTTG CCCCGACCGG TCGCGTTCTG TTCCAGAACT TCGGCACCCG CGTCGACAAC TCGGTCGTCT ACCTGTCGCC GAGCTTCGGT GGCTTCCAGG TGCGCGCGCT GGTCGCCGCT GGCGAAGGCG CCACGGCTCG CCAGCAAGGT CTGTTGCTGG GTTATGCGGC TGGCCCGATC AAGGCAGGTC TGTCGTACGA AGAGTACGAC GACGCCCCGG GCGGCGGTGG CAGCGCCTAC AACAAGGTGT TCACCGCGGG CGGCAGCTAC AACTTCGGCG TCGCGACGCT GGGCCTGGGC TATCAGAAGA CCAGCGACTT CGGCTCGAAC GCTGGCGAGT CGGTTGTGAT CAATGATGTC GATGCCTACA ACGTCGGCGT GCTCGTGCCG TTCGGCAGCT TCGAGTTCCG TGCCCAGTAC ACGCACTCGA AGGCTGATCT GGATGCGGGT GGCAGCAACA AGAACGACAA GTACGGCGCT TCGCTCCGTT ACGCGCTGTC GAAGCGGACC ACGATCTACA GTGCCTACCT GCACCGCGAG TCGGACAACG ACGACACGTT CAACCTGACT GGCAAGGACC AGTTCCTGGT CGGTATCGGC CACAACTTCT GA
|
Protein sequence | MHQLSASKLS SRNRISIEQA QCWRGAQRSG GIAPQLGRRA ASGVAHSIRG SGFTETTCST LEIVAMKKSV LALAALGAFA GAASAQSSVT LYGRLDTAVT WTDSTIAEDR FTLNNHQPIG GSRWGLKGSE DLGGGLKANF TLESGFNSDD GTGNAKLFDR AAWVGLSSAS LGEIRLGRHD TLTRQLNLGY GSDLTAEGEI TVVDGNFAAG TALAPTGRVL FQNFGTRVDN SVVYLSPSFG GFQVRALVAA GEGATARQQG LLLGYAAGPI KAGLSYEEYD DAPGGGGSAY NKVFTAGGSY NFGVATLGLG YQKTSDFGSN AGESVVINDV DAYNVGVLVP FGSFEFRAQY THSKADLDAG GSNKNDKYGA SLRYALSKRT TIYSAYLHRE SDNDDTFNLT GKDQFLVGIG HNF
|
| |