Gene RPC_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3202 
Symbol 
ID3972000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3545333 
End bp3546934 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content65% 
IMG OID637926312 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_533063 
Protein GI90424693 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0351635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.132345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC TTGCAGCCAC TGGCGGCGTT TCCGGGCCCG GGCGTATCGG CCGGGTGGCG 
ATCGGCGATC TGCTGAAGAA GGCCGCGGTG CGCTTCGCCG ATCGCGTCGC ACTGAGCGAT
GGCGAGCGCC GCGTCAGCTT CGCCGAGTTG GAACGCGACG CCAACCGCTT CGCCAACTAT
CTGGTGTCGC GCGGGTTGCG GCCCGGCAGC AAGATCTCGA CGATTTGCAA CAACTCGGTG
GAATTCGTCA AAGCGCTGTT CGGGATTCAC CGCGCTGGCC TGGTTTGGGT GCCGATCAAC
ACCATGCTGG GGCCGGCGGA TATGGACTAC ATCCTGGACC ACGCCGAGGT GAACCTCGCG
CTGATCGACG ACAATCTGCA CGCCCAGCCG GAGCGCCGCG CCGCACTCGA AAAGCGCGGC
ATCGATCTGG TCACGATCGA TCTCACCGGC AAGGCGAAGG CCGCCGGACT CGCCAGCTTC
GGCGAGTTGA TCGAGGGGCA ATCCGACGCC GAGCCGGAGA TCGATTTCGA CGACCGCGAT
CTGGCGATGA TCATCTACAC CTCCGGCACC ACTTCGCGGC CGAAGGGTGC GATGCATTGC
CACCTCGCGG TGGTGATGGC GGTGATGAGC AACTGCATCG AAATGAAACT CGGCCGCGAC
GACGGCATCA CCGGCCAGTT TCCGATCTTT CACTGCGCCG GCCATGTGCT GCTGTTGAGC
TATCTTGCGG TCGGCGGCCG GATGGCGCTG ATGCGCGGTT TCGATCCCAT GGTCTGCATG
GAGGCGATCC AGCGCGACAA GCTCACGGTG TTTGTCGGCC TGTCGCTGAT GTATCAGGCG
ATTCTCGATC ATCCCCGCCG CAACGACTAC GACCTGTCGA GCCTGCGGAT GTGCATCTAC
ACCATGGCGC CGATGGGGCG GCCGCTGCTG GAGCGCGGCA TCAGGGAGCT GTGTCCGAAC
TTCGCGCTGA CGTCGGGACA GACCGAGATG TATCCGGCGA CCACGATGTC GCAGCCGGAC
CGTCAGCTCG AGCGGTTCGG CAATTATTGG GGCGAATCGC TGATCGTCAA CGAGACCGCG
ATCATGGACG AGGAGGGGCG TCTGCTGCCG CGCGGCGAGA TCGGCGAATT GGTGCATCGC
GGCCCCAACG TGATGATGGG CTACTACAAG GATCCCGAGG CCACCGCCGC CGCGCGCAAG
TTCGGCTGGC ATCACACCGG CGATCTGGCG CTGATCGACG CGAACGGCGA GGTGCTGTTC
CTCGACCGCA AGAAAGACAT GATCAAGTCC GGCGGCGAGA ACGTCGCCTC GGTGAGGATC
GAGGAGACGC TGCTGGCGCA TCCCGCGGTG CAGAACGCCG CTGTGGTCGG ATTGCCGCAT
CCGCAATGGG GCGAGGCGGT GTCGGCCTTC GTGAAGTTGA AGCCGGGCGC CGCGGCCAGC
GAGGCCGAGA TCGCCGAGCA TTGCAAGGCC CATCTCGGCG GCTTCCAGGT GCCGAAGCTG
ATCCGGATCG TCGAGGACAT GCCGATGACC GCCACCGGCA AGCTGCGCAA GGTCGAACTG
CGCAACCGGT TTGCCGACTT CTTTGCGACG GAGCGGGCGT GA
 
Protein sequence
MSDLAATGGV SGPGRIGRVA IGDLLKKAAV RFADRVALSD GERRVSFAEL ERDANRFANY 
LVSRGLRPGS KISTICNNSV EFVKALFGIH RAGLVWVPIN TMLGPADMDY ILDHAEVNLA
LIDDNLHAQP ERRAALEKRG IDLVTIDLTG KAKAAGLASF GELIEGQSDA EPEIDFDDRD
LAMIIYTSGT TSRPKGAMHC HLAVVMAVMS NCIEMKLGRD DGITGQFPIF HCAGHVLLLS
YLAVGGRMAL MRGFDPMVCM EAIQRDKLTV FVGLSLMYQA ILDHPRRNDY DLSSLRMCIY
TMAPMGRPLL ERGIRELCPN FALTSGQTEM YPATTMSQPD RQLERFGNYW GESLIVNETA
IMDEEGRLLP RGEIGELVHR GPNVMMGYYK DPEATAAARK FGWHHTGDLA LIDANGEVLF
LDRKKDMIKS GGENVASVRI EETLLAHPAV QNAAVVGLPH PQWGEAVSAF VKLKPGAAAS
EAEIAEHCKA HLGGFQVPKL IRIVEDMPMT ATGKLRKVEL RNRFADFFAT ERA