Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4229 |
Symbol | |
ID | 3912037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4805360 |
End bp | 4806898 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886132 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_487831 |
Protein GI | 86751335 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACT TTATCACCCT CGATACGCTG GTGCGCAACA CGGCCCAGGC GCATCCGGAC CGGATCGCGG TGATCGACGG CGAGCGGCAA TTGCGCTACG CGGAATTCGA TGCGCTGATC GACCGGGTCG CAGCCGCCCT GCAACGCGAC GGCGTGCAAC CGACCGACGC GATCTCGATC TGCGCACTGT CCTCGCTCGA ATATGTCGCG ACCTTCCTCG GCGCGTTGCG CGCGGGCGTC GCGGTGGCGC CGCTGGCGCC GTCATCGACC GCGCAGGACT TCGCGGCGAT GGTGAAGGAC GCCTCGGCGA AGGTTCTGTT CACCGACGAT TTCGCCGCCG ACGCGATGAA GGCGGCCGCG ATCGATCCCG GGATCCGCCG CGTGGCGCTC GACGGTGGGG CGAGCGGCGC CGCCTTTGCG GACTGGATCG CTGCGGCGGA CCAAAAGCCG ACGCCGCTCG CGATCGATCC GGAATGGGTG TTCAACATCA TCTATTCGTC GGGCACCACC GGCACGCCGA AGGGCATCGT CCACACCCAC TATCTGCGCT GGCGGCAATA TGGCCAGCTC GATCCGCTCG GCTACGGCCC AGACGCGGTG ACGCTGCTGT CGACGCCGCT GTATTCCAAC ACGACGCTGG TGTGCTTCAA CCCGACGCTG GCCGGCGGCG GCACGCTCGT GCTGATGAAG AAGTTCGACG CCAAGGGCTT CCTCGACCTC GCGCAAAAAC ATCGCGTCAC CCACGCGATG CTGGTCCCCG TGCAGTATCG CCGCATCATG GCGCTGCCGG AATTCGGCGA TTACGACCTG TCGTCCTTCG TCGGCAAGTT CTGCACCTCG GCGCCGTTCG CGGCCGAGCT GAAGCGCGAC ATTCTCGCGC GCTGGCCCGG CGGCCTCACC GAGTATTACG GCATGACCGA GGGCGGCGGC TCCTGCGCGC TGCTCGCGCA CGAACATCCC GACAAGCTCG CTACCGTCGG CCAGCCGATG CCTGACCATG AGATTCGGCT GATCGACGAG GCCGGCAATT TCGTCGCCCA GGGCGAGATC GGCGAGATCG TCGGCCGCTC CGCGGTGATG ATGCAGGGCT ATCTCAACCA GCCGCAGAAG ACCGCCGAGA CGTTCTGGAC CGATAAGGAC GGCAACCGCT GGGTGCGCAC CGGCGACATC GGACGGTTCG ACGAGGACGG CTTTCTGACC TTGATGGACC GCAAGAAGGA CATGATCATC TCCGGCGGCT TCAACATCTA TCCGAGCGAC ATCGAGGCGA TCGCGAGCCA GCATCCCGAC GTGCTCGAAG TCGCGGTGGT CGGCATGCCT TCGGAAGACT GGGGCGAGAC GCCCGTCGCC TTCGCGGTGC CGCGCCCGGG CGCAGCGCTC GATCCGGCCG ATCTGAGGGC CTGGACCAAT GCCAAGGTCG GCAAGACCCA GCGGCTGTCC GATGTGACGT TGGTGGAGAC CCTGCCGCGC AGCGCGATCG GCAAGGTGCT GAAACGCGAA CTGCGCGATC AGCGGCTGGC GGCAGCGGGG AGGGACTAA
|
Protein sequence | MPDFITLDTL VRNTAQAHPD RIAVIDGERQ LRYAEFDALI DRVAAALQRD GVQPTDAISI CALSSLEYVA TFLGALRAGV AVAPLAPSST AQDFAAMVKD ASAKVLFTDD FAADAMKAAA IDPGIRRVAL DGGASGAAFA DWIAAADQKP TPLAIDPEWV FNIIYSSGTT GTPKGIVHTH YLRWRQYGQL DPLGYGPDAV TLLSTPLYSN TTLVCFNPTL AGGGTLVLMK KFDAKGFLDL AQKHRVTHAM LVPVQYRRIM ALPEFGDYDL SSFVGKFCTS APFAAELKRD ILARWPGGLT EYYGMTEGGG SCALLAHEHP DKLATVGQPM PDHEIRLIDE AGNFVAQGEI GEIVGRSAVM MQGYLNQPQK TAETFWTDKD GNRWVRTGDI GRFDEDGFLT LMDRKKDMII SGGFNIYPSD IEAIASQHPD VLEVAVVGMP SEDWGETPVA FAVPRPGAAL DPADLRAWTN AKVGKTQRLS DVTLVETLPR SAIGKVLKRE LRDQRLAAAG RD
|
| |