Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1560 |
Symbol | |
ID | 3908759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1758035 |
End bp | 1759966 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883456 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_485181 |
Protein GI | 86748685 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.610668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGACT ATGCCGGCCG CGTCGCGGCA GCCGATACGT TTCCGAAGCT GCTGCGGCTG AATGCGCGAG AGTTCGGCGA ACAGATCGCC TTGCGCGAAA AATCGCTCGG ACTGTGGCGT GTCTTCACCT GGGCCGACTA CCAGAGCCGG GTGCACGATT TCGCGCTCGG CATGATCGAG CTCGGGCTCG GTCGCGGCGA CGTCATCGGC ATCATCGGCG ACAACCGGCC GGACTGGGTG TCCGCCGAGA TCGCCACGCA CGCCATCGGC GCGATGAGCC TCGGGCTGTA TCGCGACGTG CTCGACGAGG AAGCCGCCTA TCTCCTGACC TATGGCGAGG CCAAGCTGGT TTTCGCCGAG GACGAGGAGC AGGTCGACAA GCTGCTGCTG CTCGCCGAGC GGGTGCCGAA CCTCAAGCAC ATCGTCTATT CGGATCCGCG CGGGATGCGC AAATACGACG ACCCGCGGCT GCTGCCGGCC GACACGCTGG CGGCGATGGG CCGCGCCCGC GCGGCGCGCG AGCCGGGGAT CTACGATCGC CTGGTCGATG CGACGGGCGG CGAGGACGTC GCGATCCTCT GCACCACTTC GGGCACCACC GCGCATCCCA AGCTGGCAAT GCTGGCCGCG GGGCGCGTGC TGAAACATTG CGCGACGTAT CTCAGCTTCG ACCCGAAGGG GCCGGACGAC GAATACGTCT CGGTGCTGCC GCTGCCGTGG ATCATGGAGC AGGTCTACGC GCTCGGCAAA GGGCTGTTGT GCCGGATGAA AGTCAATTTC GTCGAAGAAC CCGACACCAT GATGAACGAC TTCCGCGAGA TCGCGCCGAC CTTCGTGCTG TTCGCGCCGC GCGTCTGGGA AGGCATCGCC GCGGATGTGC GCGCGCGGGT GATGGACGCC TCGCCACTGA AACAGCGGCT GTACGAGACC GGCATGAAAG CGGGGCTCGC GGCGCTCGCC GAGGGCAAGC ATTCGGCCTT CGCCGACGCC GTGCTGTTCC GCGCGCTGCG CGACCGGCTC GGCTTCACGC GGCTGCGCTC GGCGGCGACC GGCGGCGCGG CGCTCGGCCC GGATACTTTC AAGTTCTTCC GCGCGATGGG CGTGCCGCTG CGCACGCTGT ACGGCCAGAC CGAACTGCTC GGCGCCTACA CGCTGCACAG AGCGGACGCG GTCGATCCCG ACACCACCGG CGTGGCGATG GGCGCCGAGA TCGAGATCAA GGTCGAGAAC CCCGATGTCC AGGGCATCGG CGAGATCGTG GTGCGCCACC CCAACATGTA TCTCGGCTAC TACAAGAACG AGGAAGCATC CAAAGCCGAC ATGCAGGACG GCTGGATGCA ATCCGGCGAC GCCGGCTATT TCAACGCGGC CGGCCAGCTC GTGGTGATCG ACCGCATCAA GGATCTCGCC GAGACCTCGC ACGGCGAGAA ATTCTCGCCG CAATATATCG AGAACAAGCT GAAATTCTCG CCCTATGTGG CCGAAGCCGT GGTGCTCGGC GCCGGCCGCG ACCGGCTCGC GGCGATGATC TGCATCCGCT ATTCGATCAT CTCGAAATGG GCCGAGAAGA AGCGGATCGG CTTCACCACC TATTCGGACC TCGCGTCGCG GCCCGAGGTC TACGAGATGC TGCGCAGCGA GGTCGAATCG GTCAATGCGA CGCTGCCGCC GGCGCAGCGG ATCGCCCGCT TCCTGCTGCT CTACAAGGAG CTCGACGCCG ACGACGGCGA ACTCACCCGC ACCCGCAAGG TCCGCCGCTC GGTGATCAAT GAGAAATACG CCGACATCAT CGACGGCATC TATGGCGGCC GGAGCGAGAT CCCGGTCGAC ACCCAGATCC AGTTTCAGGA CGGTACCACC CAACGCATCC GAACGACGCT GAAGGTGGTC GACCTCGCCA GCGGCCACGC GCATGCGGAG GCGGCGGAAT GA
|
Protein sequence | MMDYAGRVAA ADTFPKLLRL NAREFGEQIA LREKSLGLWR VFTWADYQSR VHDFALGMIE LGLGRGDVIG IIGDNRPDWV SAEIATHAIG AMSLGLYRDV LDEEAAYLLT YGEAKLVFAE DEEQVDKLLL LAERVPNLKH IVYSDPRGMR KYDDPRLLPA DTLAAMGRAR AAREPGIYDR LVDATGGEDV AILCTTSGTT AHPKLAMLAA GRVLKHCATY LSFDPKGPDD EYVSVLPLPW IMEQVYALGK GLLCRMKVNF VEEPDTMMND FREIAPTFVL FAPRVWEGIA ADVRARVMDA SPLKQRLYET GMKAGLAALA EGKHSAFADA VLFRALRDRL GFTRLRSAAT GGAALGPDTF KFFRAMGVPL RTLYGQTELL GAYTLHRADA VDPDTTGVAM GAEIEIKVEN PDVQGIGEIV VRHPNMYLGY YKNEEASKAD MQDGWMQSGD AGYFNAAGQL VVIDRIKDLA ETSHGEKFSP QYIENKLKFS PYVAEAVVLG AGRDRLAAMI CIRYSIISKW AEKKRIGFTT YSDLASRPEV YEMLRSEVES VNATLPPAQR IARFLLLYKE LDADDGELTR TRKVRRSVIN EKYADIIDGI YGGRSEIPVD TQIQFQDGTT QRIRTTLKVV DLASGHAHAE AAE
|
| |