Gene RPD_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4081 
Symbol 
ID4024598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4536848 
End bp4538386 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID637964284 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_571201 
Protein GI91978542 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACT TCATCACCCT CGATGCGCTG GTGCGCAACA CCGCGCAAGC ACGTCCGGAC 
CGCATTGCGG TGATCGATGG CGAGCGGAAA TTGCGTTACG CGGAATTCGA CGCGCTGATT
GACCGTATCG CTGCGGCCCT GCAGCGCGAC GGCGTGAAGC CGACCGATGC GATTTCGATC
TGCGCCTTGT CGTCGATCGA ATATGCCGCG ACATTCCTCG GCGCGTTGCG GGTCGGCGTC
GCCGTGGCGC CGCTGGCGCC GTCCTCGACC GCGCAGGACT TTGCCGCGAT GGTGAAGGAT
TCCAGCGCCA AGATTCTTTT CACCGACGAC TTCGCCGCCG AGGCGATGAA GGACGCCGCT
ATCGACGCCT CCGTGCGACG CGTCGCACTC GACGGCGGTG CGAGCGGCGC GGCGTTCTCG
GGCTGGCTCG CAGCCGAAGG CGCGAAGGCG GCGCCGGTCT CAGTCGATCC GGAATGGGTG
TTCAACATCA TCTATTCGTC GGGCACCACC GGCACGCCGA AGGGCATCGT GCACACCCAC
AGTCTGCGCT GGCGGCAATA CGGCCAGCTC GATCCGCTCG GTTACGGCCC CGAGGCCGTG
ACGCTGCTGT CGACGCCGCT TTATTCCAAC ACCACGCTGG TCTGTTTCAA TCCGACGCTG
GCCGGTGGCG GCACGCTGGT GCTGATGAAG AAGTTCGACG CCCGCGGCTT TCTCGACCTC
GCCCAACAGC ACCGCGTCAC CCACGCGATG CTGGTGCCGG TGCAGTATCG GCGGATCATG
GCGCTGCCGG AATTCGGTTC CTACGAGCTG TCGTCGTTCG TGATGAAGTT CTGCACCTCG
GCGCCGTTCG CGGCCGAGCT GAAGCGCGAC ATCCTTGCGC GCTGGCCGGG AGGCCTCACC
GAGTTTTATG GCATGACCGA GGGCGGCGGT TCCTGCGCGC TGCTCGCGCA CGAACATCCC
GACAAGCTCG GAACCGTCGG CCAACCGATG CCCGACCACA TCATCCGGCT GATCGACGAG
GACGGCAATT TCTTGCCGCA GGGCAGCATC GGCGAGATCG TCGGCCGCTC GGCGGTGGTG
ATGACGGGCT ATCTCAACCA ACCACAGAAA ACCGCCGAGA CGTTCTGGAC CGACAAGGAC
GGCCAGCGCT GGGTGCGCAC CGGCGACGTC GGACGTTTCG ATCAGGACGG CTTCCTGACG
CTGATGGACC GCAAGAAGGA CATGATCATC TCCGGCGGCT TCAACATCTA TCCGAGCGAC
ATCGAGGCGA TCGCGAGCCA GCATCCCGCG GTGCTCGAAG TCGCCGTCGT TGGTATGCCG
TCCGAAGATT GGGGCGAGAC GCCGGTGGCG TTCGTTGTGG CGCGGCCGGG CGCGATGCTC
GATCCGGCGG AGCTGAAGGC GTGGACCAAT GCGAAGGTCG GCAAGACCCA GCGGCTGTCC
GAGGTCGTCC TCTCCGAAGC GCTGCCGCGC AGCGCGATCG GCAAGGTGCT GAAACGCGAG
CTCCGCGATC AGCGGCTGGC GGCGGGCGCC GTGTCGTGA
 
Protein sequence
MPDFITLDAL VRNTAQARPD RIAVIDGERK LRYAEFDALI DRIAAALQRD GVKPTDAISI 
CALSSIEYAA TFLGALRVGV AVAPLAPSST AQDFAAMVKD SSAKILFTDD FAAEAMKDAA
IDASVRRVAL DGGASGAAFS GWLAAEGAKA APVSVDPEWV FNIIYSSGTT GTPKGIVHTH
SLRWRQYGQL DPLGYGPEAV TLLSTPLYSN TTLVCFNPTL AGGGTLVLMK KFDARGFLDL
AQQHRVTHAM LVPVQYRRIM ALPEFGSYEL SSFVMKFCTS APFAAELKRD ILARWPGGLT
EFYGMTEGGG SCALLAHEHP DKLGTVGQPM PDHIIRLIDE DGNFLPQGSI GEIVGRSAVV
MTGYLNQPQK TAETFWTDKD GQRWVRTGDV GRFDQDGFLT LMDRKKDMII SGGFNIYPSD
IEAIASQHPA VLEVAVVGMP SEDWGETPVA FVVARPGAML DPAELKAWTN AKVGKTQRLS
EVVLSEALPR SAIGKVLKRE LRDQRLAAGA VS