Gene RPD_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3103 
Symbol 
ID4023608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3447856 
End bp3449544 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content65% 
IMG OID637963304 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_570230 
Protein GI91977571 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0304891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA CTCCTCGTCC GGCCGCCGCC CCCGAAGATT ATGCCGACGT CCTTGCGCGG 
TTTCGCCGCG ACGACCTCGA GGCCCTATTC GACGGCAATT TCGCAACCGG CCTCAACGTC
TGCGTCGAAT GCTGCGATCG CCACGTCGAA GGCGGCGGCA TTGCGCTCGA TTGGGAAAGC
CAGGACGGCC GCACCGCCTC CTTCACTTTC GCCGAGATGA AGGACTATGC CGCCCGCGTC
GCCAACCTGC TGGTGGCGCA GGGCGTCAAA CCCGGCGACG TGGTCGCCGG CATGTTGCCG
CGGACGCCCG AATTACTTGC CTTGATCCTC GGCACCTGGC GCGCCGGCGC CGTCTATCAG
CCGCTGTTCA CCGCCTTCGG ACCCAAGGCG ATCGAGCACC GCCTCAAAAT GAGCGCCGCC
AAGCTGGTGG TCACCGACCT CGCCAACCGG GCCAAGCTGG CCGACGTCGT CGACTGCCCG
ACCGTCGCGA TCGTCACGCG CCCAGGTGAA AGCGCCCCGC TCGGCGACCT CGATTTCCAC
GCCGAGATCG CTTCGAAATC CACACAGTTC GAGCCGGTGC TGCGCAAAGG CAGTGACCTG
TTCCTGATGA TGTCGACGTC GGGCACCACC GGCCTTCCCA AGGGCGTGCC GGTGCCGATC
GACGCGCTGC CGGCGTTCTA TTCCTACATG CGCGACGCGG TCGACCTGCG CCCGGACGAC
ATCTTCTGGA ACATTGCCGA TCCCGGCTGG GCCTACGGGC TGTACTACGC CGTCACCGGT
CCCTTGCTGC TCGGCCACGC GACCACCTTT TTCGACGGGC CGTTCACGGC CGAGAGCACC
TACGGCCTGA TCAAGCGCCG CGGCATCACC AATCTCGCCG GCGCCCCCAC CGCCTACCGA
CTGCTGATTG CGGCGGGTCC GGCGATGGCA GCTCCGGTGA AGGGCCAGCT TCGCGTCGTC
AGCAGCGCCG GCGAGCCGCT GAACCCGGAA GTGATTCGCT GGTTCGCCGA ACATCTCGGC
GCGCCGATCC ACGATCATTA CGGCCAGACC GAGCTCGGCA TGGTGGTCAA CAATCACCAT
CGCCTGCGCC ATACCGTGCA TCCAGGCTCG GCGGGTCTTG CCATGCCCGG CTTCCGCGTC
GCGGTGCTCG ACGACGACAG TCGCGAACTG CCGGCTAACG TGCCGGGCAT TCTCAGCGTC
GATCTCGCCC GCTCGCCGCT GATGTGGTTC TCTGGCTATT GGCAGCAGGA CACTCCCGCC
ATCGCCAACG GCTATTACCG CACCGGCGAC ACCGTGGAGA TGGAACCCGA TGGCTCGATC
AGTTTCGTCG GCCGCTCCGA CGACGTCATC ACTTCGTCGG GCTACCGGAT CGGTCCATTC
GACGTCGAAA GCGCGCTGAT CGAACACCCG GCGGTGATCG AGGCGGCGGT GATCGGTAAG
CCTGACGCCG AGCGCACCGA AATCGTCAAA GCCTATGTGG TACTGGCCAA TGACGTCGAA
CCCACCGACC AACTTGCCGA GGAGCTGCGC CAATACGTCA AGAAGCGGCT CTCCGCCCAC
GCCTATCCGC GCGAGATCGA ATTCCTGGAG CAGTTGCCGA AGACACCGAG CGGCAAACTG
CAGCGGTTCA TTTTGCGCAA GCGCGATGTC GACCAGAGCA AGGATGCGCC GGCCCCGTCG
GCGCACTGA
 
Protein sequence
MSNTPRPAAA PEDYADVLAR FRRDDLEALF DGNFATGLNV CVECCDRHVE GGGIALDWES 
QDGRTASFTF AEMKDYAARV ANLLVAQGVK PGDVVAGMLP RTPELLALIL GTWRAGAVYQ
PLFTAFGPKA IEHRLKMSAA KLVVTDLANR AKLADVVDCP TVAIVTRPGE SAPLGDLDFH
AEIASKSTQF EPVLRKGSDL FLMMSTSGTT GLPKGVPVPI DALPAFYSYM RDAVDLRPDD
IFWNIADPGW AYGLYYAVTG PLLLGHATTF FDGPFTAEST YGLIKRRGIT NLAGAPTAYR
LLIAAGPAMA APVKGQLRVV SSAGEPLNPE VIRWFAEHLG APIHDHYGQT ELGMVVNNHH
RLRHTVHPGS AGLAMPGFRV AVLDDDSREL PANVPGILSV DLARSPLMWF SGYWQQDTPA
IANGYYRTGD TVEMEPDGSI SFVGRSDDVI TSSGYRIGPF DVESALIEHP AVIEAAVIGK
PDAERTEIVK AYVVLANDVE PTDQLAEELR QYVKKRLSAH AYPREIEFLE QLPKTPSGKL
QRFILRKRDV DQSKDAPAPS AH