Gene Rpal_4858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4858 
Symbol 
ID6412544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5221207 
End bp5222598 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content66% 
IMG OID642714735 
Productpeptidase M16 domain protein 
Protein accessionYP_001993822 
Protein GI192293217 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAT CCGCCGTCTT GCGCCGTCGC CCGCTCGCCG CGCTCGCCGT CTTTGCCGCC 
GTGGTGGTGG GAACGCCTGC TGCGGCGCAG ACCGTCACTG CCGCGCCGCC CGCGACCTTC
ACGCTGCAGA ACGGCCTTCG GGTGGTGGTG ATTCCGGATC ACCGCACGCC GGTGGTGACC
CAGATGATCT GGTACAAGGT CGGCTCCGCC GATGAGACGC CCGGCAAGTC GGGGCTGGCG
CATTTCCTCG AACATCTGAT GTTCAAGGGC ACCGAGAAGC ATCCGGCCGG CGAGTTCTCG
CAGACCGTGC TGAAGATCGG TGGCAACGAG AACGCGTTCA CCTCCTATGA TTTCACCGGC
TACTTCCAAC GCGTGCCGCG GTCCCACCTC GAACAGATGA TGACGTTCGA GGCCGATCGC
ATGACCGGCC TGGTGTTGAA GGACGAGAAC GTGCTGCCGG AGCGCGACGT CGTGCTCGAA
GAGTACAACA TGCGGGTTGC TAACGATCCG GATGCGCGGC TGACCGAGCA GATCATGGCG
GCGCTGTACC TCAATCATCC CTACGGCCGG CCGGTGATCG GCTGGCATCA GGAAATCGCC
AAGCTCGATC GCGAGGATGC CCTGGCGTTC TATCGCCGGT TCTACGCGCC CAACAACGCC
ACGCTGGTGA TCGCCGGCGA TATCGAGGCC GATGAGGTTC GCCCGCTCGC CGAGCGGATC
TATGGAACGA TCCCGGCGCA GCCGGCGATC CCGCCGCAGC GCATCCGCCC GCAGGAGCCG
ACGCCGGCGG GGCCGCGCAC GGTGACGCTC GCCGATCCTC GCGTCGAACA ACCGGCGGTG
CGGCGCTATT ACCTGGTACC GTCGGCCCAC ACCGGCGCCA AGGGCGACAG CGCCGCGCTG
GAAGTGCTGG CGCAACTGCT CGGCCATGGC AGCAATTCGT ATCTGTACCG CGCGCTGGTG
ATCGACAATC CACTGGCGAT CACGGTGGGC GCCAACTACC AGGGCAATGC GCTCGACGAC
AGCTACTTCA TCGTCGCCGG CACGCCGAAG CCGGGTGTCG ATTTTGCCGC GATCGAGAAG
AAGATCGACG AAGTGATCGC CGACGTCGTC GCCAACCCGG TTCGCTCCGA GGACCTGGAG
CGGGTCAAGA CCCAGCTGAT CGCCGCCGCG GTCTATGCGC AGGACAATCA GGCGACTCTG
GCGCGCTGGT ACGGCCAGGC GCTGACCACC GGTCTCAGCG TCCAGGACGT GCAGAGCTGG
CCCGACCGCA TCCGCGCCGT TACGTCCGAT GATGTGCGTG CCGCCGCCAA GCAATGGCTC
GACCGCAACC GCTCGGCGAC CGGCTATCTG GTCACCGGAC CCGCCGCCAA GCAGGAGGAG
AAGCGCTCGT GA
 
Protein sequence
MTASAVLRRR PLAALAVFAA VVVGTPAAAQ TVTAAPPATF TLQNGLRVVV IPDHRTPVVT 
QMIWYKVGSA DETPGKSGLA HFLEHLMFKG TEKHPAGEFS QTVLKIGGNE NAFTSYDFTG
YFQRVPRSHL EQMMTFEADR MTGLVLKDEN VLPERDVVLE EYNMRVANDP DARLTEQIMA
ALYLNHPYGR PVIGWHQEIA KLDREDALAF YRRFYAPNNA TLVIAGDIEA DEVRPLAERI
YGTIPAQPAI PPQRIRPQEP TPAGPRTVTL ADPRVEQPAV RRYYLVPSAH TGAKGDSAAL
EVLAQLLGHG SNSYLYRALV IDNPLAITVG ANYQGNALDD SYFIVAGTPK PGVDFAAIEK
KIDEVIADVV ANPVRSEDLE RVKTQLIAAA VYAQDNQATL ARWYGQALTT GLSVQDVQSW
PDRIRAVTSD DVRAAAKQWL DRNRSATGYL VTGPAAKQEE KRS