Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4821 |
Symbol | |
ID | 6412507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5187337 |
End bp | 5189322 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714698 |
Product | beta-lactamase domain protein |
Protein accession | YP_001993785 |
Protein GI | 192293180 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.415697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGAA TTATACCGTC GATGATACTT GCTACCACCG CGGCGCTGTC GCTGCTGGCT GCACCGCTGG CTGCCCAGCC CAACGACGCC GAGCCGGCGA CGCGGGCGGC GAACGAGGTG GTGAACAAGT CCCTGCCTCT TGCCGATCGG GCCGATTTCG AGGACGCGCA GCGCGGCCTG ATCGCCTCGC TGCCCGATGG CGTGGTCCCC GGGCCGGCGG GGGCGCCGGC CGCGTGGGAC CTCAAGCAGT ACGACTTCCT CAAGGGCGAT CAGCCCTCCG CGACGGTCAA TCCCAGCCTG TGGCGGCAGG CGCAGCTCAA CCTTGCCAGC GGCCTGTTCC AGGTGGCCGA GCGGGTCTAT CAGGTCCGCG GGCTCGACAT CGCCAATGTC ACGATCGTCG AGGGCGACAC CGGCCTGATC ATCACCGACA CCACGTTGAC GGTGCAGACC GCCAAGGCGG CACTCGATCT GTACTACCAG CACCGGCCCA AGAAGCCGGT GCTGGCGCTG ATGTACACCC ACAGCCACAT CGACCATTTC GGCGGCGCCC GCGGGTTGAT CGACGAGGCG GATGCGGCGA GCGGAAAGGT CAAGGTGATC GCGCCGACCG GCTTCTTGGA ACATGCGGTC GCCGAGAACG TCATCGCCGG CAACGCGATG AGCCGCCGCG CGCAATTCCA GTTCGGCACG CAGCTCCCGG TCGGTGAGCG CGGTCAGGTC GATGCCGGCC TCGGCAAGGC GCTGGCCAAG GGCACGGTGT CGCTGATCGC GCCGAACGAC CTGATCAAAC AGCCCTATGA GACGCGCAGC ATCGACGGCG TCGAGATCGA ATTCCACCTG GTGCCGGAGT CGGAGGCGCC TTCGGAGATG ATCTCGTACT TTCCCCAGTT CAAGGTGCTG AACATGGCGG AGGACACCAC CCACACGCTG CACAATCTCT ATACCCTGCG CGGCGCCGCG ATCCGCGACG GCCGGCTGTG GTCGAAATAC ATCGGCGAGG CGATCGAGCG CTATGGCGAC AAGACCGACG TAGTGATCGC GCAGCACAAC TGGCCGGTGT GGGGCCGTGA CCGCGTCGTC GGCTATCTGA AGAAGCAGCG CGACGTTTAC AAGTTCATCC ACGACCAGAG CGTGCGGCTG CTCAATCACG GCCTGACGCC GACCGAGATC GCCGAGCGGT TGACGCTGCC GCCGTCGTTG ACGAGCGAAT TCGCCGCGCG CGGCTATTAC GGCTCGGTCA GCCACAACGC CAAGGCGGTG TATCAGTTCT ATCTCGGCTG GTACGACGCC AACCCGGCCG ATCTCAATCC GCTGCCGCGC GCCGAGCAGG CCAAGAAGGA GATCGACTAT ATGGGTGGCG CCGCTGCGGT GCTGGCGCGC GCCCGCGACG ACTACAAGGC TGGGCAATAT CGCTGGGTGG CGACGGTGGC CAGCAAACTG GTGTTCGCCG ATCCCGCCAA CACCGAAGCC CGCGCGCTCG GTGCCGACGC GCTGGAGCAG CTCGGCTATC AGGCCGAAGC TTCGACCTGG CGCAACGCCT ATCTGCTCGG CGCGCAGGAA CTGCGCAACG GTTTGATCAA GACCGATTCG GTCACCTCCA ATCCCGATCT GCTCAAGGGC GTGTCGATCG ATCTGGCGTT CGACTTCCTC GCGGTGCGGC TGAACGCGGC GAAGGCCGAG GGCAAGCACA TCGTGGTGAA CTGGACCTTC ACCGATCTGA AGGAAACCTA CACCATGAAC CTGGAGAACT CGGCGCTGAC CCACATCTCC GGCAAGCTGT CCGACAACGC CGACGTCAGC GTCACCCTGA ACCGCGCCAC CTTCGACGCG ATCTCGCTGA AGCAGCGCGG CTTCCTCGGC GCGGTGCTGA GCGGCGACCT CTGGGTCAGC GGCAATCCGC TGAAGCTGCG CGAACTGTTC GGCCTGTTCG AAGACTTCTC ACCGAACTTC GAAGTGATCG AGCCGGTCAA GGCGAAGGTG GAGTAG
|
Protein sequence | MARIIPSMIL ATTAALSLLA APLAAQPNDA EPATRAANEV VNKSLPLADR ADFEDAQRGL IASLPDGVVP GPAGAPAAWD LKQYDFLKGD QPSATVNPSL WRQAQLNLAS GLFQVAERVY QVRGLDIANV TIVEGDTGLI ITDTTLTVQT AKAALDLYYQ HRPKKPVLAL MYTHSHIDHF GGARGLIDEA DAASGKVKVI APTGFLEHAV AENVIAGNAM SRRAQFQFGT QLPVGERGQV DAGLGKALAK GTVSLIAPND LIKQPYETRS IDGVEIEFHL VPESEAPSEM ISYFPQFKVL NMAEDTTHTL HNLYTLRGAA IRDGRLWSKY IGEAIERYGD KTDVVIAQHN WPVWGRDRVV GYLKKQRDVY KFIHDQSVRL LNHGLTPTEI AERLTLPPSL TSEFAARGYY GSVSHNAKAV YQFYLGWYDA NPADLNPLPR AEQAKKEIDY MGGAAAVLAR ARDDYKAGQY RWVATVASKL VFADPANTEA RALGADALEQ LGYQAEASTW RNAYLLGAQE LRNGLIKTDS VTSNPDLLKG VSIDLAFDFL AVRLNAAKAE GKHIVVNWTF TDLKETYTMN LENSALTHIS GKLSDNADVS VTLNRATFDA ISLKQRGFLG AVLSGDLWVS GNPLKLRELF GLFEDFSPNF EVIEPVKAKV E
|
| |