Gene Rpal_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4033 
Symbol 
ID6411716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4322736 
End bp4324565 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content68% 
IMG OID642713915 
Productpeptidase M24 
Protein accessionYP_001993004 
Protein GI192292399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0371285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAGT CGCATTTCCA GAGCTTCGAG GAGCCCGAGA GCGGCGTCGC ACTCACCGCG 
CGGCTCTCGG CGTTTCGCGA AGAGCTGCTG CGGCGCAAGC TCACCGGATT TGCGATTCCC
CGCGCCGATC AGCAACAAAA CGAATATGTG CCGCCGTCCG ACGAGCGGCT GGCGTGGCTC
ACCGGCTTCA CCGGATCGGC TGGTCTCGCT TACGTGCTGA TCGATCAGGC CGCGCTGTTC
GTCGACGGCC GCTACACGTT GCAGGCCGCC AAACAGGTCG ACGGCAATGC CTGGCGCATC
GAGTCGCTGG TCGAACCGCC GCCGGAACGC TGGCTGGAGA CGCATCTGAA AGCCGGTGAT
CGCCTCGGAT TTGATCCGTG GCTGCACACT TCTTCGGCAG TCGAACGGAT GCAGGCCGCC
TGCGCCAAGG CCGGCGCCGA GCTGGTCGCG GTCGACGGCA ATCCGGTCGA CGCGGTGTGG
AGCGAACGTC CCGCGCCGCC GCTCGGACCG GTCACTGTCC ACGGCGTGGA ATTTGCCGGC
GAGAGCGAGG CCAGCAAGCT CGGCCGCATC AACGAGGAGC TGGCCCGGCT GAAGGCCGAC
GCGCTGGTGC TTTCGGATTC CCACGCCGTC GCCTGGACCT TCAACATCCG CGGCGCCGAC
GTCTCGCACA CGCCGCTGCC GCTATCCTAC GCGCTGCTGC CGAAGGATGG CCGCCCCACC
ATCTTCATCG ACGGCCGCAA GCTGTCGAAC AGCGTGCGCG ATCATCTTGA GCAGACCGCC
GACGTCGCCG AGCCGGCCGA GTTGGCGCCG ATGCTGCGCG AACTCGCCAA GACCGGCGCG
ACCATCGCGC TCGACAGCGC CACTGCGGCC GATGCACTGA CCCGGCTGAT CAAGGATGCC
GGCGGCAAGC CGCTGCGCGG CGCCGATCCG GTGGCGCTGC TCAAGGCCGT GAAGAACACC
GCCGAGATCG ACGGCACCCG CGCGGCGCAT CGCCGCGACG CGGTGGCGCT GGCGCGGTTC
CTGGCGTTCA TCGATGCGGA AGCGCCGAAG GGCGCACTGA CCGAGATCGA CGCGGTGGAG
GCGCTGGAGA CGTTCCGCCG CGACACCGGC GCGCTCAAGG ACGTGTCGTT CCCGACCATC
TCCGGCACCG GCCCGAACGG CGCGATCGTG CATTATCGCG TCACCCGCAA GAGCAACCGC
CGCATCCAGC CGGGCGACCT CTTACTGATC GATAGCGGCG CGCAGTATCA GGACGGCACC
ACCGACGTCA CCCGCACCAT CGCGGTCGGC GAGCCGACGG CGGAGATGCG CGACCGCTTC
ACCCGCGTAC TGCGGGGCCA TCTGGCAATC GCCCGCGCGG TGTTTCCCGA CGGCACCACC
GGCGCGCAGC TCGACACGCT GGCGCGGCAG TTCCTGTGGC AGGCCGGAAT CGATTTCGAG
CACGGCACCG GCCACGGCGT CGGCAGCTAT CTGTCGGTGC ATGAAGGCCC GGCGCGGATC
TCCAAACTCG GCACCACCCC GCTGAAGCGC GGCATGATCC TGTCCAACGA GCCGGGCTAC
TACAAGACCG ACGGCTTCGG CATCCGGATC GAGAACCTCG AACTGGTGGT CGAGAAACAG
ATCGACGGCG CCGAGAAGCC GATGAACGGT TTCGAGACCC TGACGCTAGC GCCGATCGAC
CGCCGCCTGA TCGACGTGGC GATGCTCAGC GCCGAGGAAC GCACCTGGCT CGACGCCTAC
CACGCCCGCG TCCGCGAAAC CGTCCGCCCT CACCTCGACG GCCCGACCCA ACTCTGGCTC
GACGCAGCGA CCGCGCCGCT GCAATCCTAA
 
Protein sequence
MFESHFQSFE EPESGVALTA RLSAFREELL RRKLTGFAIP RADQQQNEYV PPSDERLAWL 
TGFTGSAGLA YVLIDQAALF VDGRYTLQAA KQVDGNAWRI ESLVEPPPER WLETHLKAGD
RLGFDPWLHT SSAVERMQAA CAKAGAELVA VDGNPVDAVW SERPAPPLGP VTVHGVEFAG
ESEASKLGRI NEELARLKAD ALVLSDSHAV AWTFNIRGAD VSHTPLPLSY ALLPKDGRPT
IFIDGRKLSN SVRDHLEQTA DVAEPAELAP MLRELAKTGA TIALDSATAA DALTRLIKDA
GGKPLRGADP VALLKAVKNT AEIDGTRAAH RRDAVALARF LAFIDAEAPK GALTEIDAVE
ALETFRRDTG ALKDVSFPTI SGTGPNGAIV HYRVTRKSNR RIQPGDLLLI DSGAQYQDGT
TDVTRTIAVG EPTAEMRDRF TRVLRGHLAI ARAVFPDGTT GAQLDTLARQ FLWQAGIDFE
HGTGHGVGSY LSVHEGPARI SKLGTTPLKR GMILSNEPGY YKTDGFGIRI ENLELVVEKQ
IDGAEKPMNG FETLTLAPID RRLIDVAMLS AEERTWLDAY HARVRETVRP HLDGPTQLWL
DAATAPLQS