Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2052 |
Symbol | |
ID | 3909867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2331334 |
End bp | 2333652 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637883945 |
Product | large subunit of N,N-dimethylformamidase |
Protein accession | YP_485670 |
Protein GI | 86749174 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAA AGAAAGTATT TGGATACGCG GACAAGATTT CGGTCAAGCC CGGCGACGAC ATCTCGTTCT TCGTGCACGC CGATGGCACC GACGTCGTCG ACGCGCAGCT GGTGCGACTG ATCCATGGCG ACGCGCATCC GGCCGGACCG GGCTATCGGG AGGAGGAGAT CGCCTGCGAA GCCAACGGGG TGTGGCGGGT TCGCAAGCAG TTCACCCAGG TCGGCTCGTT TCTCACCGTG GCCGATCCGG AACAGCGCCT CGCGCCGAAC GGCAGCTTCA GTCTCTGTAT CTTCGTGCAT CCGAACAGCC CGGGCGGCGG CCGACGGCAA TGCCTGCTCG GCAAGTGGGA CGCGTTCGGC AACCGTGGCT ACGGTTTGTG GCTCAACCCG GACGGCTTTC TCGAATTCGG CTTCGGCGAC GGCCACGAGG TCGACTATCT CGACGCCGAA GTGCCGGTTC TGAAGAACAA CTGGTACTTC GTCGCGGCGA CGTTCGACGC AACGACCGGC GTGGCGACGC TGTATCAGGA GGGGGTAGCG ACCCGCTATA ACTCGCTGCT GAGCAAGGTC GCCAATGTCG ACTTCCGCTC GCATGTGCGC GAGACGTTGC GATTCCGCCC GGTCAATCCG CCGGACGTGC CGTTCCTGCT CGCGGGCGCG CGGGATCATC ACACGCTGCG CGGCGATTTC GTCACCCAAT GCCTGAACGG CAAACTGGAC CGGCCCGCGG TGTTCGATCG GGTGCTGACG CATGACGAAC TCGACCGCTA TCGCGACACC GGGCTGGCGC CGCAGAATGG CCTGCTGGCC TATTGGGACA CGGCGCAGGG CTACACCGCG CAGGGCATCG GCGATCGGGT CATCGATGTC GGGCCCTATG GATTGCACGC CGAGGGCTAC AATCGCCCGG TGCGCGCGCA GACCGGGTTC AACTGGCAGG GCCGCGACGA CTGCTTCCGG CTCGCGCCCG AGCAATATGG TGGCATCGAG TTTCACGACG ATGCCATCAT CGATTGCAAG TGGGAGCTGA CGCGGTCGAT CAAGCTGCCG GATCTGCGCA GCGGCGCCTA TGCGTTCCGG CTGCGCACCG GCGACGCCAA GGGCATGCGC GAGGAATACA TCGTGTTCTT CGTCCGCTCC GTCGCGCCGA AAGCGCCGAT TCTGTTTCTG GTGCCGACGG GCAGCTACCT GGCCTATGCC AACGAGCATC TCAGTTTCGA CGCTGAGATC ATGCAGCCGC TCGCCGGACA ATCACCGATC CTCTCCGAGG TCGACATCGA ATTGTACCAG ACCGCCGAGT TCGGTCTGTC GCTGTACGAC CATCACGCCG ATGGCGCAGG CGTCTGCTAC AGCACCTATC GCCGGCCGAT CCTGAATATG CGTCCGAAGG CGCGGATGTC GTCGATGGGC GTGACCTGGC AGTTCCCGGC GGATCTGTCG ATCATCGCCT GGCTCGAGCA CATGGGCTAC GACTACGATT TGACGACCGA CGAGGATCTG CATCGGGAGG GGGCCGATGC GCTGAAGCCG TACAATGTGG TGCTGAGCGG GACGCATCCG GAATACACCT CGGAAGCGAT GCTCGATGCC ACCGAAGACT ACATCGCCGC CGGCGGCCGG TTCATCTATC TCGGCGGCAA CGGCTTCTAT TGGAACGTCG GCTACCACAG CGATGATCCG TGGTGCATGG AAGTGCGGAA GCTGAATTCC GGCATGCGGG CATGGCAGGC GCGCCCCGGC GAGTACTATC TGGCGACGAC CGGTCAGAAG AGCGGCCTGT GGAAGGATCT CGGGCGGCCG CCGCAGAAGA TCTTCGGCGT CGGCTTCATC TCGGAAGGTT TCGATTCGGC GCGGCCGTTC CGGCGGATGC CGGATAGCTG GCATCGCCGC GTGTCATGGA TTATGGACGG CATCGAGGGC GAGATCATCG GCGATTTCGG TCTGGCGCAG GGCGCCGCCG GCGGCATCGA GATTGATCGC TACGATCTGA CGCTCGGCAC CCCGCCGCAT TCGCTGATCG TCGCCTCTTC CGGCGGACAC AGCGACAACT ATCAGACAGT GGTGGAGGAG GTGCTCTATC CCTATCCAGG GCTGTCCGGA TCGCACGACT ATCGTGTGCG GGCCGACATG GTCTATTTCA CGGCTCCGAA TGACGGCGCG GTGTTTTCGA CCGGATCGAT CGCCTTCAGT CAATCTTTGC CGTACCAGAA TTTCGACAAC AACGTATCGC GCCTGCTGGC GAACGTCGTC ACGGCATTCA GCAAGCCGGG GAAACTGCCG GGTTGGGCGT GGTCGGCCGA GGAAAAGCAA TGGCGATGA
|
Protein sequence | MAEKKVFGYA DKISVKPGDD ISFFVHADGT DVVDAQLVRL IHGDAHPAGP GYREEEIACE ANGVWRVRKQ FTQVGSFLTV ADPEQRLAPN GSFSLCIFVH PNSPGGGRRQ CLLGKWDAFG NRGYGLWLNP DGFLEFGFGD GHEVDYLDAE VPVLKNNWYF VAATFDATTG VATLYQEGVA TRYNSLLSKV ANVDFRSHVR ETLRFRPVNP PDVPFLLAGA RDHHTLRGDF VTQCLNGKLD RPAVFDRVLT HDELDRYRDT GLAPQNGLLA YWDTAQGYTA QGIGDRVIDV GPYGLHAEGY NRPVRAQTGF NWQGRDDCFR LAPEQYGGIE FHDDAIIDCK WELTRSIKLP DLRSGAYAFR LRTGDAKGMR EEYIVFFVRS VAPKAPILFL VPTGSYLAYA NEHLSFDAEI MQPLAGQSPI LSEVDIELYQ TAEFGLSLYD HHADGAGVCY STYRRPILNM RPKARMSSMG VTWQFPADLS IIAWLEHMGY DYDLTTDEDL HREGADALKP YNVVLSGTHP EYTSEAMLDA TEDYIAAGGR FIYLGGNGFY WNVGYHSDDP WCMEVRKLNS GMRAWQARPG EYYLATTGQK SGLWKDLGRP PQKIFGVGFI SEGFDSARPF RRMPDSWHRR VSWIMDGIEG EIIGDFGLAQ GAAGGIEIDR YDLTLGTPPH SLIVASSGGH SDNYQTVVEE VLYPYPGLSG SHDYRVRADM VYFTAPNDGA VFSTGSIAFS QSLPYQNFDN NVSRLLANVV TAFSKPGKLP GWAWSAEEKQ WR
|
| |