Gene Rpal_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2733 
Symbol 
ID6410397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2969955 
End bp2971325 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID642712609 
ProductPeptidase M23 
Protein accessionYP_001991717 
Protein GI192291112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTACC GTTCCGGCCA CCCATCCGCA GCGATTCACC CCCACGGCCA CCAACAAGTC 
CAGGCGCCCC GGCCGGCGGC GCCACGCCCG GCACGTCCGC CTCAGCGGCC GGCGCCGACC
GGTAACAGCT ACACCATCGC GCATGCCGGC CGTCAGGTGC GGATCGGGCC GGTGCTGTTC
TGGATCGTGG TCGGCAGCAT CGTGCTACTC GGCTGCTGGA GCGCGGCGAC CGCTACGTAC
TTCACCTTCC GTGACGACGT ACTGACGCGG CTGATCGCCC GCCAGGCCGA GATGCAGTAC
GCGTATGAGG ATCGCATCGC CGAGCTGCGC GCCAAGGTCG ATCGCACCAC CAGCCGGCAG
CTGCTCGACC AGGAGCAGTT CGACCAGAAG CTCGAACAGA TCATGCGGCG GCAGTCGCTG
CTTGAGTCGC GCGCCGGCGC GCTCAGCGCC CTGCCCGACG TCGGCGTCAC AGGCAGCATC
AAGCCGACCC GAACACCGTC GTTCGAAGCC GAGACCAACA GCCGGCCGAA GCCGTCGCCG
ATCAACGACA CCGTGATCTT CGTGGCGCCG CCGGACCGCG AGGCGCGGCT GGAGTCGCGA
ACGTCGCCGT CGGCACTCGA ACAGGCGCCA ACCCAATACG CCAAGAGCCA GGGCGTCGAG
AGCGTCCTGA TGCGGCTGCA GACCTCGCTC GATCAGGTCG AGCGCCGCCA GGTAGCCTCG
CTCGGCGCGG TCGAAGAGAG CTTCGAGTCG CGCGCGCGCC GGATGCGCGG CGTGCTGACC
GATCTCGGCC TCGACGCCCG CGGCATCGAA GCCTCCGCGC CGCGCGCCGC CGTTGGCGGC
CCGTTCGTGC CGGTGAAACA GCCGGGCGCC AACGCCAGCG CGTTCGACCG CCAGCTGTAC
CGGATCTACA TCAGCCGCTC GCAGTTCGAA CGCCTCAACC GCGCCCTCGC CCTGGTGCCG
TATCGCAAGC CGGTGCTCGG CGAAGTCGAA TTCTCCTCGG GCTTCGGCGT CCGCTCCGAT
CCGTTCCTCG GCCGTCCGGC GATGCACACC GGGCTCGACT TCCGCGCCTC CACCGGCGAT
CCCGTCCGCG CCACTGCGGT CGGCAAGGTG GTGAATGCCG GCTGGCAGGG CGGCTACGGC
CAGATGGTCG AGATCGACCA CGGCAACGGC CTGTCGACCC GCTACGGCCA TCTGTCGAAG
ATCATCGCCA AGGTCGGCCA GAGCATCCAG ATCGGCCAGG TGATCGGCGA AGTCGGCTCC
ACCGGCCGGT CCACCGGTCC GCATCTGCAC TACGAAACCC GCATCGACGG CGAAGCCGTC
GACCCGCAGA AATTCCTGCG CGCCGGCGTG CGGCTGGCGG GCGCGGGTTA G
 
Protein sequence
MSYRSGHPSA AIHPHGHQQV QAPRPAAPRP ARPPQRPAPT GNSYTIAHAG RQVRIGPVLF 
WIVVGSIVLL GCWSAATATY FTFRDDVLTR LIARQAEMQY AYEDRIAELR AKVDRTTSRQ
LLDQEQFDQK LEQIMRRQSL LESRAGALSA LPDVGVTGSI KPTRTPSFEA ETNSRPKPSP
INDTVIFVAP PDREARLESR TSPSALEQAP TQYAKSQGVE SVLMRLQTSL DQVERRQVAS
LGAVEESFES RARRMRGVLT DLGLDARGIE ASAPRAAVGG PFVPVKQPGA NASAFDRQLY
RIYISRSQFE RLNRALALVP YRKPVLGEVE FSSGFGVRSD PFLGRPAMHT GLDFRASTGD
PVRATAVGKV VNAGWQGGYG QMVEIDHGNG LSTRYGHLSK IIAKVGQSIQ IGQVIGEVGS
TGRSTGPHLH YETRIDGEAV DPQKFLRAGV RLAGAG