Gene RPD_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2203 
Symbol 
ID4022688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2462581 
End bp2464185 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content64% 
IMG OID637962398 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_569339 
Protein GI91976680 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.334168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.340753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTC TTGAAGCAAC CGGCGGCGTG CCGGGACCGG GCCGGATCGG CCGGGTCGCG 
ATCGGCGATA TCCTGCGGAA ATCGGCGCTG CGGTTTCGCG ACCGCATCGC GCTCACCGAT
GGCGCGCGGC AGGTGAGTTA TGACGAGCTT GAACGCGACG CCAATCGCTT CGCGAACTAC
CTCGTCTCGC GCGGCCTGAA GCCGGGAGAC AAGATTTCGA CGATCTGCAA CAACTCGATC
GAGTTCGTCA AAGCGCTGTT CGGCATTCAC CGCGCCGGCC TCGTCTGGGT GCCGATCAAC
ACCATGCTCG GCCCCGACGA CATGGGCTAC ATCCTCGACC ATGCCGGCGT GAAATTCGCG
CTGATCGACG ACAATCTGCA CGGCCAGCCC GATCGCCGCT CGGCGCTCGA GCAGCGCGGC
ATCGCGCTGG TCGCGGTCGA TCTCACCGGC AAGGCCGGCG AGGCCGGGCT GAAGAGCTTC
AACGAACTGA TCGCCGGGCA GTCCGAGATC GAGCCGGAGG TCGCGTTCGA CGACCGCGAT
CTGGCGATGA TCATCTACAC GTCCGGAACC ACGTCGCGAC CGAAGGGCGC GATGCATTGT
CACCTCGCGG TAACGATGGC GGTGATGAGC AATGCGATCG AGCTGCAGCT GTCGCGCAAG
GACGGCATCA CCGGGCAGTT CCCGCTGTTT CACTGCGCCG CGCACGTGCT GCTGCTGAGC
TATCTGGTGG TCGGCGGAAA GATGGCGCTG ATGCGCGGCT TCGATCCCGT CGCCTGCATG
GAGGCGATCC AGCGCGACAA GCTCAGCGTC TTCATCGGAC TGCCGCTGAT GTATCAGGTG
ATCCTCGACC ATCCGCGGCG CAAGGAGTTC GATCTCTCGA CGCTGCGATG CTGCATCTAC
ACCATGGCGC CGATGCCGCG GCCGTTGCTC GAGCGCGCGA TCGCCGAACT GTGCCCGAAT
TTCGTCCAGC CCAGCGGCCA GACCGAAATG TATCCGGCGA CCACGATGTC GCAGCCCGAT
CGCCAGCTCG CGCGCTTCGG CAACTACTGG GGCGAATCGA CCATGGTCAA CGAGACTGCG
ATCATGGACG ATGCCGGAAA CCTGCTGCCG CCCGGCGAGA TCGGCGAACT GGTGCATCGC
GGTCCCAACG TGATGCTCGG CTACTACAAG GATCCCGAGG CCACCGAAGC GGCCCGAAAG
TTCGGCTGGC ATCACACCGG CGATCTGGCG ATGATCGATG CGCATGGCGA GGTGTTGTTC
GTCGACCGCA AGAAGGACAT GATCAAGTCC GGCGGCGAGA ACGTCGCTTC GGTCAAGATC
GAGGAGACCC TGCTCGCGCA TCCGGCGGTG ATGAACGCCG CGGTTGTCGG TCTGCCGCAT
CCGCAATGGG GCGAGGCGGT GTCGGGCTTC GTTAAGCTCA AGCCCGGCGC CTCGGCGACC
GAGACGGAGA TCATCGAGCA CTGCCGCAAA TCGCTCGGCG GTTTCCAGAT TCCGAAAATG
GTGCGAATCG TCGAGGAGAT GCCGATGACC GCGACCGGCA AGCTTCGCAA GATCGAGCTG
CGCAACCAGT TCACGGATTA CTTCGCGATG GCGCAGACGG GGTGA
 
Protein sequence
MTSLEATGGV PGPGRIGRVA IGDILRKSAL RFRDRIALTD GARQVSYDEL ERDANRFANY 
LVSRGLKPGD KISTICNNSI EFVKALFGIH RAGLVWVPIN TMLGPDDMGY ILDHAGVKFA
LIDDNLHGQP DRRSALEQRG IALVAVDLTG KAGEAGLKSF NELIAGQSEI EPEVAFDDRD
LAMIIYTSGT TSRPKGAMHC HLAVTMAVMS NAIELQLSRK DGITGQFPLF HCAAHVLLLS
YLVVGGKMAL MRGFDPVACM EAIQRDKLSV FIGLPLMYQV ILDHPRRKEF DLSTLRCCIY
TMAPMPRPLL ERAIAELCPN FVQPSGQTEM YPATTMSQPD RQLARFGNYW GESTMVNETA
IMDDAGNLLP PGEIGELVHR GPNVMLGYYK DPEATEAARK FGWHHTGDLA MIDAHGEVLF
VDRKKDMIKS GGENVASVKI EETLLAHPAV MNAAVVGLPH PQWGEAVSGF VKLKPGASAT
ETEIIEHCRK SLGGFQIPKM VRIVEEMPMT ATGKLRKIEL RNQFTDYFAM AQTG