Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4572 |
Symbol | |
ID | 6412256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4924516 |
End bp | 4926447 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714452 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001993541 |
Protein GI | 192292936 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACT ATGCCGGCCG CGTCGCGGCA GCCGACACCT TCCCCAAGCT GCTCCGGCTC AATGCAAAGG AATTCGGGAC CGAGATCGCG CTGCGCGAGA AAGATCTCGG GCTGTGGCGG GTGTTCACCT GGGCCGACTA TCAGGCCCGC GTGCACGACT TCGCCCTCGG CATGGTCGAA CTCGGGCTCA AGCGCGGCGA CGTGATCGGC ATCATCGGCG ACAACCGGCC TGACTGGGTT TCGGCCGAGA TCGCCACCCA TGCGATTGGC GGCCTGAGCC TCGGCCTGTA CCGCGACGTG CTCGATGAGG AGGCCGACTA TCTGCTCAAT TACGGCGAGG CCAAGCTGGT GTTCGCCGAG GACGAGGAGC AGGTCGACAA GTTACTTGGC CTCGCCGACC GCGCGCCGCA TCTGAAGCAC ATCGTCTATT CCGATCCACG CGGGATGCGG AAATACGACG ATCCGCGCCT CTTGCCGGCC GATCAACTCG CCAAGCTGGG CCGCGATCGC GCCAGCCGCG AACCGGGCCT GTACGACCGG CTGGTCGACG CGACGCAGGG CGAAGACGTC GCCATCCTGT GCACCACCTC GGGCACCACG GCGCACCCCA AGCTGGCGAT GCTGGCGTCC GGGCGCGTGC TGAAACACTG CGCGACCTAT CTCAGCTTCG ATCCGAAGGG GCCCGACGAC GAATACGTCT CGGTACTGCC GCTCCCCTGG ATCATGGAGC AGGTTTACGC ACTCGGCAAA GGGCTGCTGT GCCGGATGAA GGTCAACTTC GTCGAAGAGC CCGACACGAT GATGAACGAC TTCCGCGAGA TCGCGCCGAC CTTCGTGCTG TTCGCGCCAA GGGTCTGGGA AGGTATCGCC GCGGACGTTC GCGCCCGGGT GATGGACGCC TCGCCGCTGA AGCAGCGACT TTACAACGCC GGCATGAAGG CCGGGCTGGC TGCGCTCGAC CAGGGCCAGC ATTCGGCGTT CGCCGACGCG GTGCTGTTCC GCGCGCTGCG CGACCGGCTC GGCTTCACCC GGCTGCGCTC GGCCGCAACC GGCGGCGCCG CGCTCGGCCC GGACACCTTC CGATTCTTCC GCGCCATGGG AGTGCCGCTG CGCACGCTGT ACGGGCAAAC CGAACTGCTC GGCGCCTACA CGCTGCACAA GCCGGACGCG GTCGATCCCG ACACCACCGG CGTGCCGATG GGCGCCGAGA TCGAGATCAA GGTCCTGAAC CCCGATGTCC AGGGCATCGG CGAAGTGGTG GTGCGCCACC CCAACATGTT CCTCGGCTAC TACAAGAACC CGGAAGCCTC GACCGCCGAC ATCAAGGACG GCTGGATGCA TTCCGGCGAC GCCGGCTATT TCAACGGCGC CGGCCAGCTC GTCATCATCG ACCGCATCAA GGACCTTGCC GAGCTGTCGC ACGGCGAACG GTTCTCGCCG CAATACATCG AGAACAAGCT GAAGTTCTCG CCCTACGTCG CCGAAGCGGT GATCCTCGGC GCCGGCCGCG ACATGCTGGC GGCGATGATC TGCATCCGCT ACTCGATCAT CTCGAAATGG GCGGAGAAGA AGCGGATCGC CTTCACCACC TATTCGGACC TCGCCTCCCG CCCTGAAGTC TACGAGCTGC TGCGCCGCGA GGTCGAGACC GTCAACGCCA CGCTGCCACC GGCCCAGCGC ATCAGCCGCT TCCTGCTGCT CTACAAGGAG CTCGACGCCG ACGACGGCGA GCTGACGCGC ACCCGCAAGG TCCGCCGCTC GGTGATCAAC GAGAAGTACG GCGACATCAT CGACGGCATC TATAGTGGAC GTAGCGACAT CCCGGTCGAT ACCACCATCA AGTTCCAGGA CGGCACCACC CAGCGGATCC GCACCACGCT GAGGGTGGTC GACCTCGGCG CCGGCCACGC GCGTGCGGAG GCAGCGGAAT GA
|
Protein sequence | MMDYAGRVAA ADTFPKLLRL NAKEFGTEIA LREKDLGLWR VFTWADYQAR VHDFALGMVE LGLKRGDVIG IIGDNRPDWV SAEIATHAIG GLSLGLYRDV LDEEADYLLN YGEAKLVFAE DEEQVDKLLG LADRAPHLKH IVYSDPRGMR KYDDPRLLPA DQLAKLGRDR ASREPGLYDR LVDATQGEDV AILCTTSGTT AHPKLAMLAS GRVLKHCATY LSFDPKGPDD EYVSVLPLPW IMEQVYALGK GLLCRMKVNF VEEPDTMMND FREIAPTFVL FAPRVWEGIA ADVRARVMDA SPLKQRLYNA GMKAGLAALD QGQHSAFADA VLFRALRDRL GFTRLRSAAT GGAALGPDTF RFFRAMGVPL RTLYGQTELL GAYTLHKPDA VDPDTTGVPM GAEIEIKVLN PDVQGIGEVV VRHPNMFLGY YKNPEASTAD IKDGWMHSGD AGYFNGAGQL VIIDRIKDLA ELSHGERFSP QYIENKLKFS PYVAEAVILG AGRDMLAAMI CIRYSIISKW AEKKRIAFTT YSDLASRPEV YELLRREVET VNATLPPAQR ISRFLLLYKE LDADDGELTR TRKVRRSVIN EKYGDIIDGI YSGRSDIPVD TTIKFQDGTT QRIRTTLRVV DLGAGHARAE AAE
|
| |