Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1871 |
Symbol | |
ID | 4022353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2094920 |
End bp | 2096494 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962064 |
Product | long-chain-fatty-acid--CoA ligase |
Protein accession | YP_569007 |
Protein GI | 91976348 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.394568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.852293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAAG GCGAGCTCAC CCAAAGCCAT ATCACCCAAG GCCATCTCTC TCACGGCATC GTCAGTGGCG ACCGCCATCG GTCGTTCGAG GAGGTCAATG CGCGCGCCGC GCAGATCGCC GGCGGGCTGC AGGGCCTTGG CGTCAAACCC GGCGATTGCG TTTGCGTCTT GATGCGCAAC GACATCGCCT TTCTTGAATC CGCCTATGCG GTCATGATGC TCGGCGCCTA CATGGTCCCG GTGAACTGGC ACTTCAAACC AGAAGAGGTT CTCTACGTGC TGGGCGATTC CGGCACGCGC GTTTTGATCG GCCATGCCGA TCTGCTGCAC CACGCGGCTG GAATCGTGCC CGCAGCCGTC ACGATGCTGA GCGTGCCGAC GCCGCCGGAA ATCCTCGCGA GCTACAGCAT CGATCCCGAC CATCGCTCGG CTCCCGCCGG CGCGATCGAT CTCGACGGCT GGCTCGCGCA GCAGAGGCCA TATGACGGCC CGGCGCTGCC GCAGCCGCAG AACATGATCT ACACCTCGGG CACCACCGGC CATCCGAAGG GCGTCAAACG GTTCGCGCCG ACGCCGCAAC AGTCGGCGAA CGCCGAAGCC ATGCGCGCAG CGATCTACGG CCTGAAGCCG GGCGTCCGCG CGCTGTGCCC GGGGCCGTTG TATCACTCCG CGCCGAATTC GTTCGGCATT CGCGCCGGCC GGCTCGGCGG AGTGCTGGCG CTGATGCCGC GGTTCGAACC CGAAGCGCTG CTGCAACTGA TCGAGCAGCA CCGGATCGAC ACCGTGTTCA TGGTGCCGAC GATGTTCATC CGGCTGATGA AGCTGCCGGA AGCGGTGCGC AACAAGTACG ACGTGTCGTC GCTGCGCCAT GTCATCCATG CCGCGGCGCC GTGTCCCCCG GACGTCAAGC GCGCGATGAT CGACTGGTGG GGGCCGGTGA TCTACGAATT CTACGGCTCG ACCGAGAGTG GCGCGGTGAC GTTCGCGAGC TCCGAGGATG CGCTGAAGAA GCCCGGCACC GTCGGCAAGA TCGCTGCCGG CGCCGAGCTT GTGTTCGTCG ACGACGACAA TAACGAGGTG CCGCAGAGCG AGGTCGGCGA GATCTTCTCG CGGATCCCCG GCAATCCGGA TTTCACCTAT CACAACAAGC CGGAGAAACG CGCGGAGATC GACCGGGGCG GCTTCATCAC CTCGGGCGAC ATGGGCTATC TCGACGAGGA CGGCTACGTC TTCATCTGCG ACCGCAAGCG CGACATGGTG ATCTCCGGCG GCGTCAACAT CTATCCAGCC GAGATCGAGG CGGCGCTGCA CGCGATCCCG GGTGTGCACG ACTGTGCGGT GTTCGGGATT CCCGACGCCG AATTCGGCGA GGCGCTGATG GCGATGCTCG AGCCGCAGCC TGGCGTCACA CTGGAGCAGA GCAATATCCG CGAGCAGCTC AGGCTGTCGC TCGCCGGCTA TAAGGTTCCG AAGCACATCG AGATCATGGC GCAGCTGCCG CGCGAGGACT CCGGCAAGAT CTTCAAGCGC CGGCTACGCG ATCCGTATTG GGCCAAAGCT GGTCGGGTGA TTTAG
|
Protein sequence | MSQGELTQSH ITQGHLSHGI VSGDRHRSFE EVNARAAQIA GGLQGLGVKP GDCVCVLMRN DIAFLESAYA VMMLGAYMVP VNWHFKPEEV LYVLGDSGTR VLIGHADLLH HAAGIVPAAV TMLSVPTPPE ILASYSIDPD HRSAPAGAID LDGWLAQQRP YDGPALPQPQ NMIYTSGTTG HPKGVKRFAP TPQQSANAEA MRAAIYGLKP GVRALCPGPL YHSAPNSFGI RAGRLGGVLA LMPRFEPEAL LQLIEQHRID TVFMVPTMFI RLMKLPEAVR NKYDVSSLRH VIHAAAPCPP DVKRAMIDWW GPVIYEFYGS TESGAVTFAS SEDALKKPGT VGKIAAGAEL VFVDDDNNEV PQSEVGEIFS RIPGNPDFTY HNKPEKRAEI DRGGFITSGD MGYLDEDGYV FICDRKRDMV ISGGVNIYPA EIEAALHAIP GVHDCAVFGI PDAEFGEALM AMLEPQPGVT LEQSNIREQL RLSLAGYKVP KHIEIMAQLP REDSGKIFKR RLRDPYWAKA GRVI
|
| |