Gene Rpal_4821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4821 
Symbol 
ID6412507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5187337 
End bp5189322 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content65% 
IMG OID642714698 
Productbeta-lactamase domain protein 
Protein accessionYP_001993785 
Protein GI192293180 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.415697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGAA TTATACCGTC GATGATACTT GCTACCACCG CGGCGCTGTC GCTGCTGGCT 
GCACCGCTGG CTGCCCAGCC CAACGACGCC GAGCCGGCGA CGCGGGCGGC GAACGAGGTG
GTGAACAAGT CCCTGCCTCT TGCCGATCGG GCCGATTTCG AGGACGCGCA GCGCGGCCTG
ATCGCCTCGC TGCCCGATGG CGTGGTCCCC GGGCCGGCGG GGGCGCCGGC CGCGTGGGAC
CTCAAGCAGT ACGACTTCCT CAAGGGCGAT CAGCCCTCCG CGACGGTCAA TCCCAGCCTG
TGGCGGCAGG CGCAGCTCAA CCTTGCCAGC GGCCTGTTCC AGGTGGCCGA GCGGGTCTAT
CAGGTCCGCG GGCTCGACAT CGCCAATGTC ACGATCGTCG AGGGCGACAC CGGCCTGATC
ATCACCGACA CCACGTTGAC GGTGCAGACC GCCAAGGCGG CACTCGATCT GTACTACCAG
CACCGGCCCA AGAAGCCGGT GCTGGCGCTG ATGTACACCC ACAGCCACAT CGACCATTTC
GGCGGCGCCC GCGGGTTGAT CGACGAGGCG GATGCGGCGA GCGGAAAGGT CAAGGTGATC
GCGCCGACCG GCTTCTTGGA ACATGCGGTC GCCGAGAACG TCATCGCCGG CAACGCGATG
AGCCGCCGCG CGCAATTCCA GTTCGGCACG CAGCTCCCGG TCGGTGAGCG CGGTCAGGTC
GATGCCGGCC TCGGCAAGGC GCTGGCCAAG GGCACGGTGT CGCTGATCGC GCCGAACGAC
CTGATCAAAC AGCCCTATGA GACGCGCAGC ATCGACGGCG TCGAGATCGA ATTCCACCTG
GTGCCGGAGT CGGAGGCGCC TTCGGAGATG ATCTCGTACT TTCCCCAGTT CAAGGTGCTG
AACATGGCGG AGGACACCAC CCACACGCTG CACAATCTCT ATACCCTGCG CGGCGCCGCG
ATCCGCGACG GCCGGCTGTG GTCGAAATAC ATCGGCGAGG CGATCGAGCG CTATGGCGAC
AAGACCGACG TAGTGATCGC GCAGCACAAC TGGCCGGTGT GGGGCCGTGA CCGCGTCGTC
GGCTATCTGA AGAAGCAGCG CGACGTTTAC AAGTTCATCC ACGACCAGAG CGTGCGGCTG
CTCAATCACG GCCTGACGCC GACCGAGATC GCCGAGCGGT TGACGCTGCC GCCGTCGTTG
ACGAGCGAAT TCGCCGCGCG CGGCTATTAC GGCTCGGTCA GCCACAACGC CAAGGCGGTG
TATCAGTTCT ATCTCGGCTG GTACGACGCC AACCCGGCCG ATCTCAATCC GCTGCCGCGC
GCCGAGCAGG CCAAGAAGGA GATCGACTAT ATGGGTGGCG CCGCTGCGGT GCTGGCGCGC
GCCCGCGACG ACTACAAGGC TGGGCAATAT CGCTGGGTGG CGACGGTGGC CAGCAAACTG
GTGTTCGCCG ATCCCGCCAA CACCGAAGCC CGCGCGCTCG GTGCCGACGC GCTGGAGCAG
CTCGGCTATC AGGCCGAAGC TTCGACCTGG CGCAACGCCT ATCTGCTCGG CGCGCAGGAA
CTGCGCAACG GTTTGATCAA GACCGATTCG GTCACCTCCA ATCCCGATCT GCTCAAGGGC
GTGTCGATCG ATCTGGCGTT CGACTTCCTC GCGGTGCGGC TGAACGCGGC GAAGGCCGAG
GGCAAGCACA TCGTGGTGAA CTGGACCTTC ACCGATCTGA AGGAAACCTA CACCATGAAC
CTGGAGAACT CGGCGCTGAC CCACATCTCC GGCAAGCTGT CCGACAACGC CGACGTCAGC
GTCACCCTGA ACCGCGCCAC CTTCGACGCG ATCTCGCTGA AGCAGCGCGG CTTCCTCGGC
GCGGTGCTGA GCGGCGACCT CTGGGTCAGC GGCAATCCGC TGAAGCTGCG CGAACTGTTC
GGCCTGTTCG AAGACTTCTC ACCGAACTTC GAAGTGATCG AGCCGGTCAA GGCGAAGGTG
GAGTAG
 
Protein sequence
MARIIPSMIL ATTAALSLLA APLAAQPNDA EPATRAANEV VNKSLPLADR ADFEDAQRGL 
IASLPDGVVP GPAGAPAAWD LKQYDFLKGD QPSATVNPSL WRQAQLNLAS GLFQVAERVY
QVRGLDIANV TIVEGDTGLI ITDTTLTVQT AKAALDLYYQ HRPKKPVLAL MYTHSHIDHF
GGARGLIDEA DAASGKVKVI APTGFLEHAV AENVIAGNAM SRRAQFQFGT QLPVGERGQV
DAGLGKALAK GTVSLIAPND LIKQPYETRS IDGVEIEFHL VPESEAPSEM ISYFPQFKVL
NMAEDTTHTL HNLYTLRGAA IRDGRLWSKY IGEAIERYGD KTDVVIAQHN WPVWGRDRVV
GYLKKQRDVY KFIHDQSVRL LNHGLTPTEI AERLTLPPSL TSEFAARGYY GSVSHNAKAV
YQFYLGWYDA NPADLNPLPR AEQAKKEIDY MGGAAAVLAR ARDDYKAGQY RWVATVASKL
VFADPANTEA RALGADALEQ LGYQAEASTW RNAYLLGAQE LRNGLIKTDS VTSNPDLLKG
VSIDLAFDFL AVRLNAAKAE GKHIVVNWTF TDLKETYTMN LENSALTHIS GKLSDNADVS
VTLNRATFDA ISLKQRGFLG AVLSGDLWVS GNPLKLRELF GLFEDFSPNF EVIEPVKAKV
E