Gene Rsph17029_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0683 
Symbol 
ID4895092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp686494 
End bp688290 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content71% 
IMG OID640111267 
Productpeptidase M24 
Protein accessionYP_001042568 
Protein GI126461454 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAGA CGTTCCATGC GACTTCCTCC CCGGCTCAGG GGCCGGCCCG GCTTGCCGCC 
CTGCGGCAGG CGCTGGCCGC CGACGGGCTG GCCGGTTTCC TCGTGCCGCG CTCGGACGCG
CATCAGGGCG AATATGTGGC CGCGCGCGAC GACCGGCTGC AATGGCTGAC GGGCTTCACG
GGCTCGGCCG GCTTCTGCCT CGTGCTGCCC GAGGTGGCGG GCGTCTTCAT CGACGGTCGC
TACCGGGTGC AGGTAAAGCA TCAGGTGGAT CTGGCCCATT TCACGCCGGT GGCCTGGCCC
GAGATCCAGC CTGGCGACTG GCTGCGTGAA AAACTTTCCC AAGGCGCGAT CGGCTTCGAT
CCCTGGCTCC ATACCGCCGA CGAGATCGCG CGGCTCGAAA CGGCGCTGGC GGGCTCCGGC
ATCACGCTCA GGCCGGTGGA GAACCCGCTC GACCGGCTGT GGGCCGATCA GCCCGAGCCG
CCGATGGGCC GCGCCTTCGC TCATCCCGAC GCGCTCGCGG GCGAGACGGG CGAGGCCAAG
CGCCAGCGTC TGGCTCAGAC GCTCGCCGCC GCGGGCCGCA GGGCGGTCGT GCTGAGCCTG
CCCGACTCGA TCTGCTGGCT TCTGAACATC CGGGGATCGG ACGTACCGCG CAATCCGGTG
CTCCACGCTT TTGCCGTGCT GCACGACGAT GCCCGCGTGA CCCTCTTTGC CGAGGCTGCG
AAGTTCGACG AGGCCACCCG CGCGCATCTG GGCGCGGGCG TGACGCTGCG CCCGCCGCAG
GCCTTCGTGC CGGCGCTACG CACCCTCACG GGCCCGGTGC AGGTCGACCG CAAGACGGCG
CCGCTGGCTG TGCTCCTCGA GCTGCAGGAT GCCGGCGTGG AGGCGGTGGA CGGCGACGAT
CCCTGCCGCC TGCCCAAGGC CTGCAAGAGC GCGGCCGAGA TCGCGGGCAT GCGCGACGCC
CATCTGCGCG ACGGGGCGGC CATGGTCGAA TTCCTGACCT GGCTCGATGC CGAGGCCCCG
AAGGGCGGCC TGACCGAGAT CGACGTGGTG ACGGCGCTCG AGGGCTTCCG CCGCGCGACC
AATGCGCTCC ACGACATCAG CTTCGACACG ATCTGCGGCG CGGGTCCGAA CGGCGCGATC
ATGCATTACC GCGTGACCGA CGGCTCGAAC CGCCCGGTGC AGCGGGACGA GCTGCTGCTC
GTCGACTCGG GCGCGCAATA TGCCGACGGC ACCACCGACA TCACCCGCAC GGTCGCGGTG
GGCGATCCGG GGCAAGAGGC GCGGGAATGC TACACCCGCG TCCTGCAGGG GCTGATCGCG
ATCAGCCGCG CGCGCTGGCC CAAGGGGCTG GCCGGGCGCG ACCTCGACGC GCTCGCCCGC
TATCCGCTCT GGCTCGCCGG ACAGGATTAC GATCACGGCA CGGGCCACGG CGTCGGCGCG
TTCCTGTCGG TCCACGAGGG GCCGCAGCGC ATCGCCCGCA TCTCCGAGGT GCCGCTGGAG
CCGGGCATGA TCCTGTCGAA CGAGCCCGGC TATTACCGCG AAGGCGCCTT CGGGATCCGG
CTGGAAAACC TGATCGTCGT CGAGGAGGCC CCTGCCCTCG GCGACAACCG CCGGCAGTTG
GCCTTCGAGA CCCTGACCTT CGTGCCCTTC GACCGCCGCC TGATCCTGAC GCAGCTCCTG
TCCTCGGCAG AGCGGGACTG GATCGACGCC TACCACCGGG ACGTTCTCGA AAAGATCGGT
TCACGCCTGT CGCCCGCGGC CCGGGACTGG CTGGAAGCGG CGGCTGCGCC TCTTTGA
 
Protein sequence
MFQTFHATSS PAQGPARLAA LRQALAADGL AGFLVPRSDA HQGEYVAARD DRLQWLTGFT 
GSAGFCLVLP EVAGVFIDGR YRVQVKHQVD LAHFTPVAWP EIQPGDWLRE KLSQGAIGFD
PWLHTADEIA RLETALAGSG ITLRPVENPL DRLWADQPEP PMGRAFAHPD ALAGETGEAK
RQRLAQTLAA AGRRAVVLSL PDSICWLLNI RGSDVPRNPV LHAFAVLHDD ARVTLFAEAA
KFDEATRAHL GAGVTLRPPQ AFVPALRTLT GPVQVDRKTA PLAVLLELQD AGVEAVDGDD
PCRLPKACKS AAEIAGMRDA HLRDGAAMVE FLTWLDAEAP KGGLTEIDVV TALEGFRRAT
NALHDISFDT ICGAGPNGAI MHYRVTDGSN RPVQRDELLL VDSGAQYADG TTDITRTVAV
GDPGQEAREC YTRVLQGLIA ISRARWPKGL AGRDLDALAR YPLWLAGQDY DHGTGHGVGA
FLSVHEGPQR IARISEVPLE PGMILSNEPG YYREGAFGIR LENLIVVEEA PALGDNRRQL
AFETLTFVPF DRRLILTQLL SSAERDWIDA YHRDVLEKIG SRLSPAARDW LEAAAAPL