Gene RPD_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0042 
Symbol 
ID4020496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp53163 
End bp55085 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content63% 
IMG OID637960218 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_567183 
Protein GI91974524 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCCT GGGAGGTCAA AGGTCCTGCG AAGGGAACTT GCAGGTCGTG GCGCGACGCT 
TATCCGGAAC TTGCCGGAAT GGTCGCTCGA ATCACTGCTC CGCGCGGGCG CGGGATTGCG
TGGGAAGGCA CCGAGATCTG GGCCTATCGC GACCGGCCGA GCAGCATATC GGAGTGCCTT
GCCGCAAATG TCGCGCGCTG GCCCGATCGC GAGGCTTATG TATTCCATCC CGGCGGCGAG
AGGCTGACAT GGGGCGAAGT CGGCGCCCAA GTCGATCGCG TCGCTGCGGC GCTTCGACAG
GAATTCGGTT TCAGGAAGCG CGACCGGCTA TGTCTGCTGA CGGCGGGCTG CCCGGAATAC
GTCATCGCCT ATCTCGCCAT TGTGCAGCTC GGCGGCGTCG CCGTGCCGGT CAATCTCGGG
CTGACCGATG AAGGACTGGC GGCGCAGATC AACAAGGTTG GCGCCAAGGG GCTGGTGGTC
TCTTCCGAGG TCTGGAGCGG CAAGCTCGAT GCGGTGCGCG GCGGCCTCGA CAGCGTCGAG
GCGGTGTTCG TCATCGGCGG CGCGGCGCCG CAAGGGACGC TGGCTTTCTC CGAACTAAGC
TCCCTGAGAA CGACGCCGGT CGATCACGAG GCGGTCGATG AATGGGATCT GTGTGCGATC
TCCTTCACGT CGGGCACGAC CGGCGTGCCG AAGGGGACGA TGGCGATGCA CATCAATGCG
CTCGGCTGCG CGCAGAACGT GGTGATCGCA GCCAAGGGGC TGGGCCCCGA CGACGTCAAC
CTGTGTATGC CGCCTTTGTA TCACAACACG GCGGTCTATG CGGATTTCCT GCCTGCGCTG
CTGAGTGGCG GAAAATGCGT GATCATGTCG GCGTTCACGC CGCTGGAGGC CATCAAGCTG
ATCGAGGCCG AGCGGGCAAC GTGGGCCGTC GCGGCCCCGA TCATGCTCTG GATGATGATG
AACCATCCGG AATTCAGGAA CCACGACTGC TCGACGCTCA AGAAGATCCT CTTCGGCGGC
CATGCGTCAT CCGAAACCTT CATCAACCAG CTCAATCGCG AATTCGCGCC GATCGCGATG
GTGAACGCCG GCTCGGTGTC GGAGAGCACG GCGGTCGGTT TCGCGCTGCC CACCGAAGAC
GCGATCCGCA AGATCACCAG TTGCGGCCTC GCGACTCCGA ATACCGACAT CGCCATCTTC
GACGATGCCG GCAACGAAGT TCTCGAGCCC AACGTGATCG GTGAAGTTGC CTATCGTGGC
CAGCAGACCA ATGCGGGTTA CTGGGAAGAA CCAGGCAAGA CCGCCGAAGT CTTTCGCCGC
GACGGCTTTG TGCTGTCCGG CGACTGGGCC AAGATCGACG AAGACGGCTA TCTCTGGCTG
CTGGACCGCA AGAAAGACAT GGTCGTGCGG GGCGGCCAGA ACGTCTACTG CATCGAGGTC
GAGAACAAGT TGTACCTGCA CCCCAAGGTC CTGCGGGCAG CGGTGGTCGG CGTGCCGGAC
CACGTGTTCT CGGAGCGATT GAAGGCGATC GTGGTGCTCA AGCCCGGTGA GAGTGCAACA
GCCGACGAGA TCCGCGAGCA TTGCGCCAAG CATCTCGCCA AATACGAGAC GCCGGAATAC
GTGGTGTTCG GAGCCAGTCT GCCAGCCAAT GCCGCAGGAA AGACCCTGAA GCGGCCGCTG
GTGGACTTCT GGGGCGACTC GCCGGGCACA CCGCTTGCGC GATTCTCCGC ATTCTGCGCC
AGCCTGCCGC CGGCGTTGTT CGATACGCCG CATCTCAAGC TTGATGGACG GCCGATGACG
CCCCGCGAGG CACTGGGCGA ACTCCAACAA GGCTCGGAGC GCGGGCAACA TCTTGCGCGA
ATGATCGAAC AGCAGGGCGT CTGTGGGCTG ACGACGCCCG ACGAAGCCAG ATTTCGCAAA
TGA
 
Protein sequence
MGAWEVKGPA KGTCRSWRDA YPELAGMVAR ITAPRGRGIA WEGTEIWAYR DRPSSISECL 
AANVARWPDR EAYVFHPGGE RLTWGEVGAQ VDRVAAALRQ EFGFRKRDRL CLLTAGCPEY
VIAYLAIVQL GGVAVPVNLG LTDEGLAAQI NKVGAKGLVV SSEVWSGKLD AVRGGLDSVE
AVFVIGGAAP QGTLAFSELS SLRTTPVDHE AVDEWDLCAI SFTSGTTGVP KGTMAMHINA
LGCAQNVVIA AKGLGPDDVN LCMPPLYHNT AVYADFLPAL LSGGKCVIMS AFTPLEAIKL
IEAERATWAV AAPIMLWMMM NHPEFRNHDC STLKKILFGG HASSETFINQ LNREFAPIAM
VNAGSVSEST AVGFALPTED AIRKITSCGL ATPNTDIAIF DDAGNEVLEP NVIGEVAYRG
QQTNAGYWEE PGKTAEVFRR DGFVLSGDWA KIDEDGYLWL LDRKKDMVVR GGQNVYCIEV
ENKLYLHPKV LRAAVVGVPD HVFSERLKAI VVLKPGESAT ADEIREHCAK HLAKYETPEY
VVFGASLPAN AAGKTLKRPL VDFWGDSPGT PLARFSAFCA SLPPALFDTP HLKLDGRPMT
PREALGELQQ GSERGQHLAR MIEQQGVCGL TTPDEARFRK