Gene RPB_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2052 
Symbol 
ID3909867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2331334 
End bp2333652 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content63% 
IMG OID637883945 
Productlarge subunit of N,N-dimethylformamidase 
Protein accessionYP_485670 
Protein GI86749174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAA AGAAAGTATT TGGATACGCG GACAAGATTT CGGTCAAGCC CGGCGACGAC 
ATCTCGTTCT TCGTGCACGC CGATGGCACC GACGTCGTCG ACGCGCAGCT GGTGCGACTG
ATCCATGGCG ACGCGCATCC GGCCGGACCG GGCTATCGGG AGGAGGAGAT CGCCTGCGAA
GCCAACGGGG TGTGGCGGGT TCGCAAGCAG TTCACCCAGG TCGGCTCGTT TCTCACCGTG
GCCGATCCGG AACAGCGCCT CGCGCCGAAC GGCAGCTTCA GTCTCTGTAT CTTCGTGCAT
CCGAACAGCC CGGGCGGCGG CCGACGGCAA TGCCTGCTCG GCAAGTGGGA CGCGTTCGGC
AACCGTGGCT ACGGTTTGTG GCTCAACCCG GACGGCTTTC TCGAATTCGG CTTCGGCGAC
GGCCACGAGG TCGACTATCT CGACGCCGAA GTGCCGGTTC TGAAGAACAA CTGGTACTTC
GTCGCGGCGA CGTTCGACGC AACGACCGGC GTGGCGACGC TGTATCAGGA GGGGGTAGCG
ACCCGCTATA ACTCGCTGCT GAGCAAGGTC GCCAATGTCG ACTTCCGCTC GCATGTGCGC
GAGACGTTGC GATTCCGCCC GGTCAATCCG CCGGACGTGC CGTTCCTGCT CGCGGGCGCG
CGGGATCATC ACACGCTGCG CGGCGATTTC GTCACCCAAT GCCTGAACGG CAAACTGGAC
CGGCCCGCGG TGTTCGATCG GGTGCTGACG CATGACGAAC TCGACCGCTA TCGCGACACC
GGGCTGGCGC CGCAGAATGG CCTGCTGGCC TATTGGGACA CGGCGCAGGG CTACACCGCG
CAGGGCATCG GCGATCGGGT CATCGATGTC GGGCCCTATG GATTGCACGC CGAGGGCTAC
AATCGCCCGG TGCGCGCGCA GACCGGGTTC AACTGGCAGG GCCGCGACGA CTGCTTCCGG
CTCGCGCCCG AGCAATATGG TGGCATCGAG TTTCACGACG ATGCCATCAT CGATTGCAAG
TGGGAGCTGA CGCGGTCGAT CAAGCTGCCG GATCTGCGCA GCGGCGCCTA TGCGTTCCGG
CTGCGCACCG GCGACGCCAA GGGCATGCGC GAGGAATACA TCGTGTTCTT CGTCCGCTCC
GTCGCGCCGA AAGCGCCGAT TCTGTTTCTG GTGCCGACGG GCAGCTACCT GGCCTATGCC
AACGAGCATC TCAGTTTCGA CGCTGAGATC ATGCAGCCGC TCGCCGGACA ATCACCGATC
CTCTCCGAGG TCGACATCGA ATTGTACCAG ACCGCCGAGT TCGGTCTGTC GCTGTACGAC
CATCACGCCG ATGGCGCAGG CGTCTGCTAC AGCACCTATC GCCGGCCGAT CCTGAATATG
CGTCCGAAGG CGCGGATGTC GTCGATGGGC GTGACCTGGC AGTTCCCGGC GGATCTGTCG
ATCATCGCCT GGCTCGAGCA CATGGGCTAC GACTACGATT TGACGACCGA CGAGGATCTG
CATCGGGAGG GGGCCGATGC GCTGAAGCCG TACAATGTGG TGCTGAGCGG GACGCATCCG
GAATACACCT CGGAAGCGAT GCTCGATGCC ACCGAAGACT ACATCGCCGC CGGCGGCCGG
TTCATCTATC TCGGCGGCAA CGGCTTCTAT TGGAACGTCG GCTACCACAG CGATGATCCG
TGGTGCATGG AAGTGCGGAA GCTGAATTCC GGCATGCGGG CATGGCAGGC GCGCCCCGGC
GAGTACTATC TGGCGACGAC CGGTCAGAAG AGCGGCCTGT GGAAGGATCT CGGGCGGCCG
CCGCAGAAGA TCTTCGGCGT CGGCTTCATC TCGGAAGGTT TCGATTCGGC GCGGCCGTTC
CGGCGGATGC CGGATAGCTG GCATCGCCGC GTGTCATGGA TTATGGACGG CATCGAGGGC
GAGATCATCG GCGATTTCGG TCTGGCGCAG GGCGCCGCCG GCGGCATCGA GATTGATCGC
TACGATCTGA CGCTCGGCAC CCCGCCGCAT TCGCTGATCG TCGCCTCTTC CGGCGGACAC
AGCGACAACT ATCAGACAGT GGTGGAGGAG GTGCTCTATC CCTATCCAGG GCTGTCCGGA
TCGCACGACT ATCGTGTGCG GGCCGACATG GTCTATTTCA CGGCTCCGAA TGACGGCGCG
GTGTTTTCGA CCGGATCGAT CGCCTTCAGT CAATCTTTGC CGTACCAGAA TTTCGACAAC
AACGTATCGC GCCTGCTGGC GAACGTCGTC ACGGCATTCA GCAAGCCGGG GAAACTGCCG
GGTTGGGCGT GGTCGGCCGA GGAAAAGCAA TGGCGATGA
 
Protein sequence
MAEKKVFGYA DKISVKPGDD ISFFVHADGT DVVDAQLVRL IHGDAHPAGP GYREEEIACE 
ANGVWRVRKQ FTQVGSFLTV ADPEQRLAPN GSFSLCIFVH PNSPGGGRRQ CLLGKWDAFG
NRGYGLWLNP DGFLEFGFGD GHEVDYLDAE VPVLKNNWYF VAATFDATTG VATLYQEGVA
TRYNSLLSKV ANVDFRSHVR ETLRFRPVNP PDVPFLLAGA RDHHTLRGDF VTQCLNGKLD
RPAVFDRVLT HDELDRYRDT GLAPQNGLLA YWDTAQGYTA QGIGDRVIDV GPYGLHAEGY
NRPVRAQTGF NWQGRDDCFR LAPEQYGGIE FHDDAIIDCK WELTRSIKLP DLRSGAYAFR
LRTGDAKGMR EEYIVFFVRS VAPKAPILFL VPTGSYLAYA NEHLSFDAEI MQPLAGQSPI
LSEVDIELYQ TAEFGLSLYD HHADGAGVCY STYRRPILNM RPKARMSSMG VTWQFPADLS
IIAWLEHMGY DYDLTTDEDL HREGADALKP YNVVLSGTHP EYTSEAMLDA TEDYIAAGGR
FIYLGGNGFY WNVGYHSDDP WCMEVRKLNS GMRAWQARPG EYYLATTGQK SGLWKDLGRP
PQKIFGVGFI SEGFDSARPF RRMPDSWHRR VSWIMDGIEG EIIGDFGLAQ GAAGGIEIDR
YDLTLGTPPH SLIVASSGGH SDNYQTVVEE VLYPYPGLSG SHDYRVRADM VYFTAPNDGA
VFSTGSIAFS QSLPYQNFDN NVSRLLANVV TAFSKPGKLP GWAWSAEEKQ WR