Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3760 |
Symbol | |
ID | 6411438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4026388 |
End bp | 4028697 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642713641 |
Product | condensation domain protein |
Protein accession | YP_001992734 |
Protein GI | 192292129 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.62014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGC AGGTCCCTTC CGCCGTCGGG CAAGAGCCGG AGATTGAACC GGTCCGCTGC ACCAGCATTC AAGAGCGGTT TCTCGAACTG CAGAGACACA ATCCAACCTC TTCGCTGGCC AATGTCGCGA TGCGCTGGCA GATGAAGGGA GCGGTGCGCG ACCAGACGAT CCAGAAGGCG CTGGACTTTC TCGCCGGGCG ACACGAGGCG CTCCGGACGA GATTTGAACG CTCGGCCGAC GGCTACTCAC AGATTGTCGT TGAAACTCCG CCGAGACTGT CGCTGATCGA TCTGTCGCAG CTATCGAGCG AGCGCTACCA GGCGGAGGCG GAGCGGCTTG CGTCATTGGA GGCGAGGGCG GCGTTCGATC CTTTGGTCGC GCCGCCCTGG CGAACGACCG TGCTTCGGCG CTCGCCGAAC GATGCGATCC TGCTGCTAAC GATCCATCAC GCCATCGCAG ATCTCTGGTC GGTCGGACTG ATCGCGCACG AATTTGCCCA AGTGGTCGAT GCACTGGAGC GCGGTCAAAG GCCAGATTGC CGCGAGTTGA GCCTTCAGTA CGTGGACTAT GCGCAGTGGC AGTCGGCGTT GCTGGCGAGT GACGGACTGG CCGATGAGCG ACAATACTGG CGTCGCAAGT TGTCGGGGGT GTGTCCACTC GATATCAAAC CCAGCAGGCA GCGCGTGCTC GATGCCGGCC GCAACGGCGA GATTAGGAGC CGAGTTCTTC CGAGGCTTTT GACGGATGCG CTGCAATCGC TGGCGCAGCA GGAAGGCTTC ACGATGTACG GGCTGACGTG TGCGGCCTTG ACGCGGGCGC TGGGCCGCAA GCTGGAGGCG GACGAGATCG TCATCGGAAC GCAGGTCGCC AACCGCGACG ATCCGTTGCT GTTCGATTTG GTCGGCCCCG TGGTCAACAC TGTGGTCCTG AGGCTCGGGA TCGAGGCGGG GAGCGATCCG CTCACGCACG CGCGCCGCAT CAGGGACGAG GTCTCGCAGG CATTGGCGAA CAAGGCGCTG CCTTTCGCCG AGATCGCCGC CGGCATCGAA CTGTCGGATC GGAAAGATCT GCCGCTCGGC TACGCCATCA ACTTTGCCGC CGGCAATGTC GAGACCGGCC AGGCCGGAGT GATCCGGCAC GGCGACTTCC AACTGGTGTC GCTGCCGTCG GCGACGACGG GCTGCCTCTA TGAATTCAGT TTCTTCTTGG TGGAGCGGCA GGAGGGGTGG CGGATCTCCT GCGAGTACAA CACCGATCAT CATGTTGCCG CCGCTGCCGA TGCGCTGCTG AAAGCGTGGT CGGAGGAGTT GGAGGCTCAG GTCGCTCCGA TCGAGAGCGA CCTGGGGAAC GCGAATCCGG CCTCGAAGGC GGATCAGCCG ACCGGTGATG TCGTTGCTCC TCTTGCCCTC TCGCTCGAAC GCAGGCTGCC GAACCGGCGT CCCACGCGCC CGTATTTTCA GCCGTGCGTC CTGCAGTCCG AAGGCAAGCT GCCGCCGATC TTCGCGTTGA GCAACAGGTC CTTGTACTAC CCGTTGGTGC AGCGGGTGGC GGCGGGGCGT CCCTTCATCG ACTTGCAATG GGCCGACGAC GCTGAGTTGC CGCCGCTCGA TCAGGACAGC ATTTGCAGGA TCGCGGCGGA TGCGGTTCGG CAGATCCGGG CGCACGATCC CGTCGGTCCG TACTATTTGA TCAGCCTTTG CCTGATGGGC AACGTCGCGC TCGAAGTTGC GCAGCAACTG AAGAGCCTCA ATGGCGAAGC GTCGGTGATC TTTCTGCTCG ACACGGTCGC GCCCGGCTAC GTGGAATCGA TGAGCCGGTT CGACCGTTTC CTGAGGCGAG TCCAACTGAC CGATCGGATC ATCCCCGATC TGGTCGCACG GATCCGGAAA GTCAGGTCCG GGGACATGAG CATTGCGGCC GCGATGTCGC AGTACTCGAT CATCAGGAAC AATCCACTGG TTCGATTGAT CGGCAGGGAC GGTGATGCGC CACGTCCACG AGCCACCAAC GAGGACTTCC TCAATCACGG GCTGATGGAC TATTTGCTGG ACGCCAGGGC TCTGTATCCC TGGAAGCCGT ATGACGGCGA AGTCGTTCTG TTCCGGAGCG CGAAATCACG TGTCGGACGG CTGTTCGTAC GCGGCCTGGG ATGGGACAAC GTCGTGAATG GGCAGTTGCG GATCTTCGAC GTTCCTGGTG GTCACGACGA CATGACCCGC GAGCCCGCGG TCGCCACCAT CGCCGAATAC ATGAGCTGGA TGATCGATCG GCGGGAGGGC CGCTGGTCCG CGCAGCGCGC CGCAGGTTGA
|
Protein sequence | MKTQVPSAVG QEPEIEPVRC TSIQERFLEL QRHNPTSSLA NVAMRWQMKG AVRDQTIQKA LDFLAGRHEA LRTRFERSAD GYSQIVVETP PRLSLIDLSQ LSSERYQAEA ERLASLEARA AFDPLVAPPW RTTVLRRSPN DAILLLTIHH AIADLWSVGL IAHEFAQVVD ALERGQRPDC RELSLQYVDY AQWQSALLAS DGLADERQYW RRKLSGVCPL DIKPSRQRVL DAGRNGEIRS RVLPRLLTDA LQSLAQQEGF TMYGLTCAAL TRALGRKLEA DEIVIGTQVA NRDDPLLFDL VGPVVNTVVL RLGIEAGSDP LTHARRIRDE VSQALANKAL PFAEIAAGIE LSDRKDLPLG YAINFAAGNV ETGQAGVIRH GDFQLVSLPS ATTGCLYEFS FFLVERQEGW RISCEYNTDH HVAAAADALL KAWSEELEAQ VAPIESDLGN ANPASKADQP TGDVVAPLAL SLERRLPNRR PTRPYFQPCV LQSEGKLPPI FALSNRSLYY PLVQRVAAGR PFIDLQWADD AELPPLDQDS ICRIAADAVR QIRAHDPVGP YYLISLCLMG NVALEVAQQL KSLNGEASVI FLLDTVAPGY VESMSRFDRF LRRVQLTDRI IPDLVARIRK VRSGDMSIAA AMSQYSIIRN NPLVRLIGRD GDAPRPRATN EDFLNHGLMD YLLDARALYP WKPYDGEVVL FRSAKSRVGR LFVRGLGWDN VVNGQLRIFD VPGGHDDMTR EPAVATIAEY MSWMIDRREG RWSAQRAAG
|
| |