Gene Rsph17029_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0398 
Symbol 
ID4896280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp411333 
End bp412538 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID640110982 
Productpeptidase M24 
Protein accessionYP_001042286 
Protein GI126461172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTC CCGAAAACTA CCGTTTCCAC AATGGCGAGA AGGCCGCGCT GCCCTTCCCG 
CCCGAGGAAT ACGAGGCCCG GCTCGAGGGC CTGCGCGACC TGATGGAGCT GCATTCGCTC
GACGCGGTCG TGCTGACCTC GATGCACAAC GTGGCCTACT ATTCCGGCTT CCTCTACCTG
TCGTTCGGCC GCCCCTACGC CTGTGTGGTC ACTCCCACCG ACTGCGTCAC CGTCAGCGCG
GGCATCGATG GGGGTCAGCC CTGGCGGCGG AGCGTGGGCG ACAACATCAC CTACACCGAC
TGGCAGCGCG ACAATTTCTG GCGGACGGTC GCGCAGGTCA CCGGCACGGG CCGCGCCATC
GGCTGCGAGG CGGACCATCT GACCATGGTG CAGGCCGAGA AGCTGAACGC CTTCCTTAGG
CCCACGCGCG GCATGGACAT CGCCCCCGGC ACGATGGCGC AGCGGATGCT GAAATCTCCC
GCAGAGATCG CGCTCATCCG ACACGGCGCG CAGGTGGCGG ATGTGGGCGG CTATGCCATC
CGCGAGGCGA TCCGCGAGGG CGCGACCGAG CTCGAGATCG CCATGGTGGG GCGCGACGCA
ATGGAGCGCG AAATTGCGGC CCGCTTCCCC GAGGCCGAAT ATCGTGACAG CTGGGTATGG
TTCCAGTCGG GCCCGAACAC CGACGGTGCG CATAACCCGG TGACGAACCG GGCGCTCCGG
CGCGGCGACA TCCTCTCGCT CAACTGCTTT CCGATGATCT CGGGCTATTA CACCGCGCTC
GAACGCACGC TGTTTCTGGG CGAGGTGGAC GATGCCAGCC TGAAGATCTG GGAGGCGAAT
GTCGCCGCCC ATGAATATGG CATCTCGCTG CTTCAGCCGG GGGCCTCCTG CGCCGACGTG
ACGGCGAAGC TCAACGCGTT CCTCGAAGAG CGCGACCTCT TGCGCTACCG CACCTTCGGC
TATGGCCATT CCTTCGGCCT GCTCTCGCAC TACTACGGCC GCGAGGCGGG GCTGGAACTG
CGCGAGGATA TCGAGACGGT GCTCGAGCCC GGCATGGTGA TCTCGATGGA GCCGATGCTG
ACGCTCGGCG CAGGCCAGCC CGGCGCGGGC GGCTACCGCG AGCACGACAT CCTCGTCATC
ACCGAGGACG GGCCCGAGAA CATCACGGGC TATCCCTACG GCCCCGGCTT CAACGTGGTG
GGCTGA
 
Protein sequence
MERPENYRFH NGEKAALPFP PEEYEARLEG LRDLMELHSL DAVVLTSMHN VAYYSGFLYL 
SFGRPYACVV TPTDCVTVSA GIDGGQPWRR SVGDNITYTD WQRDNFWRTV AQVTGTGRAI
GCEADHLTMV QAEKLNAFLR PTRGMDIAPG TMAQRMLKSP AEIALIRHGA QVADVGGYAI
REAIREGATE LEIAMVGRDA MEREIAARFP EAEYRDSWVW FQSGPNTDGA HNPVTNRALR
RGDILSLNCF PMISGYYTAL ERTLFLGEVD DASLKIWEAN VAAHEYGISL LQPGASCADV
TAKLNAFLEE RDLLRYRTFG YGHSFGLLSH YYGREAGLEL REDIETVLEP GMVISMEPML
TLGAGQPGAG GYREHDILVI TEDGPENITG YPYGPGFNVV G