Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2319 |
Symbol | |
ID | 4783836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2486606 |
End bp | 2489395 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640090888 |
Product | assimilatory nitrate reductase (NADH) alpha subunit apoprotein |
Protein accession | YP_001021510 |
Protein GI | 124267506 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00502256 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAGA CCCGATCGAC CTGTCCTTAC TGCGGCGTCG GCTGCGGCGT GATCATCGAG TCGCAGGGCG CGCAGATCAC CGGCGTGCGC GGCGATCCGG AGCACCCGGC GAACTTCGGC CGCCTGTGCA CCAAGGGCAG CACGCTGCAT CTCACCGCCT CGGCCGCCAT CACGCAGCAG ACGCGCCTGC TGCAGCCGCT GCGGCGCGCC GCGCGCGGCC AGGCTGCTCA GCCGGTGGGC TGGGACGCCG CGCTCGACGA GGCCGCCGAG CGCTTCGCGG CGGTGATCGC GCAGCACGGG CCGGACGCAG TGGGCTTCTA CCTGTCGGGC CAGTTGCTGA CCGAGGATTA CTACGTCTTC AACAAGATCG CCAAGGGCCT GATCGGCACC AACAACGTCG ACACCAACTC ACGCCTGTGC ATGAGCAGCG CGGTGGCCGG CTACAAGGCC ACGCTCGGCG CCGACGCGCC GCCGGCCTGC TACGACGACC TGAAGCACGC CAGCACGATC TTCATCGCCG GCAGCAACAC CGCCTGGGCC CACCCCATCC TGTTCCGCCG GCTCGAGGAC GCGCGCGCCG CCAACCCGGC GATGAAGCTG ATCGTCGCCG ACCCGCGCCG CACCGAGACC GCCGAGGTCG CCGACCTGCA CCTGCCGCTG CTGCCCGGCA CCGACGTGGC ACTGTTCCAC GGCTTGCTGC ATCTGCTGAT GTGGGAGGGC CTGACCGACA CCGCCTACAT CGCCACCCAC ACCACCGGCT TCGAGGCGCT GCGCGACCGG GTGCGCGAGT TCACGCCCAA GCACGTGGCC CAGGTCTGCG GGCTCGACGA GTCCGCCATC GTGCAGGCGG CGCGCTGGTT CGGCCAGAAC ACGCCGACGC TGAGCCTGTA CTGCCAGGGC CTGAACCAGA GCTCGAGCGG CACCGACAAG AACGCCGCGC TGATCAACCT GCACCTCGCC ACCGGGCAGA TCGGCAAACC CGGCGCCGGC CCGTTCTCGC TGACCGGCCA GCCCAATGCG ATGGGCGGGC GCGAGGTCGG CGGCCTGGCC AACCTGCTGA GCGCACACCG CGACCTCGCC AACCCGCAGC ACCGCGCCGA GGTCGCCGCG CTGTGGGGCG TGCCGGACGT GCCCGCCGCA CCAGGCAAGA GTGCGGTGGA GATGTTCCAG GCCGCCGCCG ACGGCGAGAT CAAGGCGCTG TGGATCAGCT GCACCAACCC CGCGCAGTCG ATGCCCGACC AGACGACGGT GCGCCGCGCC CTCGAACGCT GCGAGTTCGT CGTGTTGCAG GAAGCCTTCG CCACCACCGC CACCGCGGCC TATGCCGACC TGCTGCTGCC GGCCACGACC TGGGGCGAGA AGGACGGCAC CGTCACCAAC AGCGAACGCC GCATCAGCCG CGTGCGCCCG GCCGTTGCGG CGCCCGGCGA GGCCCGCCAC GATTGGGCCG CCGCGGTCGA CTTCGCACGC CGGCTCGAAG CGAAGCTGCC GCGCCGCGCC GCGAACGGCG GCTCGCTGTT CCCGTATGCC GGCCCCGAGT CGGTGTGGAA CGAACACCGC GAGAGCACGC GCGGACGCGA CCTCGACATC ACCGGCCTCA GCTACGCCCG CCTCGACCAG GGCCCTCAGC TGTGGCCCTG CCCGCAAGGT CAGGCCGAAG GCCGCGCCCG GCTCTACGAG GACGGCGTGT TCGCCACCCC CGACGGCCGA GCCCGCTTCG CGGCGATGGC CTATCGGCCG GTGGCGGAGC CGCGCGACGC GCGCTACCCG GTCTCGCTGA CCACCGGCCG GCTGCGCGAT CACTGGCACG GCATGAGCCG CACCGGCACG CTGGGCCGGC TGTTCGGCCA CGTCGCGGAG CCCGGCGTCG AGATCCATCC GCGGGAGCTG GTGCGTCGCG GCTGGGAGGA AGGCGCGCTG GTGCACGTGA CCTCGCGCCG CGGCTCGGTG ATGCTGCCGG TGCGGGCCAG CGATTCGGTC GCGCCGGCGC AGGCCTTCGT GGCGATGCAC TGGGGCGAGG AATTCGTCTC CGGCCTGTCG CCGTCGGGCG TGCGGCTGGC CGGCATCAAC ACGCTGATGC CCAGCGCCTA CTGTCCGCAG TCGAAGCAGC CCGAGCTCAA GCACGCGGCG GTGAAGCTGC TGAAGGCCGA GATGCCCTGG CAACTGCTCG CCGCTGCCTG GCTGCCCGAG GACCAGGCCC TGCTGGCGCA GCAGCGGCTG CGCGACCTGA TGCCCTCGTT CTCCTACGCG CACTGCGTGC CCTTCGGGCG CGAGCCGCAT GCGCGCGGCG TGGTCGGCGT GCTGTTCCGG GCCGCCGCCG AGGAGCCGGC GGCCGACGAA CTGATGGCGC AGATCGAGAG CCTGCTCGGC GTGGCGGGCC CGCAGGCGCT GCGCTATGCC GACCCCAAGC GCGGCCAGCG TCGTGCGATG CGGCTGGTGG CCGGCGACGG CGGCAACGCG AGGCTCGACG CCTTCGTGCT CGGCGGCGAC ATCCGCGCAG AGGCCTGGAT CAAGGCGCTG CTGACCGACG AACTGCCGGC GCAGGCCTAC GGTCGCGCGC TGCTGCGGCC GGGCGCCGAG CCGCCGGGGG CGGTGGTGGC GCGCGGCAGG CAGGTGTGCT CGTGCTTCAA CGTCACCGAG CCCGAGATCC TGGACGCCCT CACGCGCTGC AGCGGAACCC CGGAGGCTCG GCTCGATCAA CTGCAAGGCC TGCTGAAGTG CGGCACGAAC TGCGGCTCCT GCATTCCGGC CCTGCGCACC CTCGTTCGCG GTTCGGTGCA GGCCGCCTGA
|
Protein sequence | MRETRSTCPY CGVGCGVIIE SQGAQITGVR GDPEHPANFG RLCTKGSTLH LTASAAITQQ TRLLQPLRRA ARGQAAQPVG WDAALDEAAE RFAAVIAQHG PDAVGFYLSG QLLTEDYYVF NKIAKGLIGT NNVDTNSRLC MSSAVAGYKA TLGADAPPAC YDDLKHASTI FIAGSNTAWA HPILFRRLED ARAANPAMKL IVADPRRTET AEVADLHLPL LPGTDVALFH GLLHLLMWEG LTDTAYIATH TTGFEALRDR VREFTPKHVA QVCGLDESAI VQAARWFGQN TPTLSLYCQG LNQSSSGTDK NAALINLHLA TGQIGKPGAG PFSLTGQPNA MGGREVGGLA NLLSAHRDLA NPQHRAEVAA LWGVPDVPAA PGKSAVEMFQ AAADGEIKAL WISCTNPAQS MPDQTTVRRA LERCEFVVLQ EAFATTATAA YADLLLPATT WGEKDGTVTN SERRISRVRP AVAAPGEARH DWAAAVDFAR RLEAKLPRRA ANGGSLFPYA GPESVWNEHR ESTRGRDLDI TGLSYARLDQ GPQLWPCPQG QAEGRARLYE DGVFATPDGR ARFAAMAYRP VAEPRDARYP VSLTTGRLRD HWHGMSRTGT LGRLFGHVAE PGVEIHPREL VRRGWEEGAL VHVTSRRGSV MLPVRASDSV APAQAFVAMH WGEEFVSGLS PSGVRLAGIN TLMPSAYCPQ SKQPELKHAA VKLLKAEMPW QLLAAAWLPE DQALLAQQRL RDLMPSFSYA HCVPFGREPH ARGVVGVLFR AAAEEPAADE LMAQIESLLG VAGPQALRYA DPKRGQRRAM RLVAGDGGNA RLDAFVLGGD IRAEAWIKAL LTDELPAQAY GRALLRPGAE PPGAVVARGR QVCSCFNVTE PEILDALTRC SGTPEARLDQ LQGLLKCGTN CGSCIPALRT LVRGSVQAA
|
| |