Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0923 |
Symbol | |
ID | 4787305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 979043 |
End bp | 980188 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089484 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic component-like protein |
Protein accession | YP_001020120 |
Protein GI | 124266116 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0258844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTCT GGATGAATCA GCGTGTACGC GCGGCCGCCG CAGCGATGGC GGTGGCTTGC GGCCTCGGCA TCGGGGCTCC TTCGACGGCG CTCGCCGCCG ATCTCGACTA CGGCAAGCCG GGTGATCCGG TCCAACTGGT GATCGGCTAC CAGCCGTACT ACACCCAGTC GTGGTCCGGC GTGGTCATGC GGGGCAAGAA GTTCTACGAG AAGTACCTGC CCAAGGGCTC CACGGTCGAC TTCTCGATCG GCCTGCAGGG CGCGGTGATC GTCAACGCCA TGCTGGCCGG CAAGCAGCAC ATCGGCTACA TGGGCGACAT GCCGGCGATC GTCTCGACCA CCAAGGAATC GGTGGCGGAC ATCCGCATCG TCGCGACGCT CGGCGTCGGC TTCGACCAGT GCAACATCCT GCTGGCTCGC AACGACGCGC CCAAGTTCGG CAACGGCAAG GAAGGCGTCA AGTGGCTCGA GGGCAAGCGC ATCGGCATCC CGCTGGGCAG CTGCGCCGAC CGTTTCGCGA AGGAGGCCTT CCGCAAGGAG GGGGTCGCGC CGGCGGCCAT AATGAACCAG AACATCGAGG TCATCACCAG CGGCTTCCGC GCCGGCAAGC TCGATGCCGC GGCGATCTGG GAGCCCACCG CCTCGCGGCT GGTGGAGGAG GGGCTGGCGC GGCGCATCGC CAGCGGCGCG ACGGTCAACG AGAAGGACGC CGGCTTCCTG GCCATGCGGG CCGACCTGAT CAAGCAGCGT CCCGACGTCG CCAAGGCCTG GCTGAACGCC GAGCTCGACG CCCAGCTGTT CCTCGCCGAC CCGAAGAACG CGATGGAGGT GGCGGCGATG GCCGCGCAGC AGGCGACCGG CTTCACCGAG AAGATGCTGT GGCACTCCCT CTACGGCCAG TACCCGGCCG AGATCGGCGG GATCCCGGTG CGCATGCAGA TGCCCTTCAC GCTGACGCCC GACGTGGTGG CGCAGATCAA CCAGTCCGCG GCCTTCCTGT TCTCCATCAA GAGCATCAAC GTCGAGAAGC TGCGCGCCGA CGCGCTGATG AACGACATGG CGGCGCAGGT GCTCAAGGAG CGCAACCTGA GTTCGCCGAT CGGTGAGGTC AAGGCCATGC CCGACAGCGA GTACGGCAAG AAGTAG
|
Protein sequence | MTFWMNQRVR AAAAAMAVAC GLGIGAPSTA LAADLDYGKP GDPVQLVIGY QPYYTQSWSG VVMRGKKFYE KYLPKGSTVD FSIGLQGAVI VNAMLAGKQH IGYMGDMPAI VSTTKESVAD IRIVATLGVG FDQCNILLAR NDAPKFGNGK EGVKWLEGKR IGIPLGSCAD RFAKEAFRKE GVAPAAIMNQ NIEVITSGFR AGKLDAAAIW EPTASRLVEE GLARRIASGA TVNEKDAGFL AMRADLIKQR PDVAKAWLNA ELDAQLFLAD PKNAMEVAAM AAQQATGFTE KMLWHSLYGQ YPAEIGGIPV RMQMPFTLTP DVVAQINQSA AFLFSIKSIN VEKLRADALM NDMAAQVLKE RNLSSPIGEV KAMPDSEYGK K
|
| |