Gene Rsph17025_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3072 
Symbol 
ID5083159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3141845 
End bp3143641 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content70% 
IMG OID640484644 
Productpeptidase M24 
Protein accessionYP_001169261 
Protein GI146279102 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.782089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAGA CGTTCCATGC GACTTCCTCC CCGGCCCAGG GGCCGGCCCG GCTTGCGGCG 
CTGCGCGCGG CGCTGACGGC CGACGGGCTG ACGGGGTTCA TCGTTCCACG CTCGGATGCC
CATCAGGGCG AATATGTGGC CGCCCGGGAC GAGCGTCTCC AGTGGCTGAC GGGCTTTACC
GGCTCGGCCG GCTTCTGCAT CGTGCTGCCC GACCTCGCGG GCGTCTTCAT CGACGGCCGC
TACCGGGTTC AGGTGAAGCA TCAGGTGGAT CCCGGCCATT TCACGCCCGT TCCCTGGCCC
GAGGTGCAGC CGGGTGACTG GCTGCGTGAA AATCTTTCCC AAGGCACGAT CGGCTTCGAT
CCCTGGCTCC ATACGGCCGA TGAGATCTCG CGGCTCGAGG CGGCGCTGGC GGGCTCCGAC
ATCAGCCTGC GCGCGGTGGA GAACCCGCTC GACCGGCTCT GGGCCGACCA GCCCGAGGCG
CCGATGGGGC GCGCCTTTGC CCATCCCGAC GCCCTCGCAG GCGAGACGGG CGAGGCCAAG
CGCCAACGCC TCGCGGCTGC GCTTGGCCTC GCCGGGCGCA AGGCCGCGGT CCTGACGCTG
CCGGACTCGA TCTGCTGGCT GCTGAACATC CGGGGCGCCG ATGTGCCGCG CAATCCGGTG
CTGCACGCCT TTGCCGTGCT GCATGACGAC GCCCGCGTGA CGCTCTTTGC CGACGCCGCG
AAGTTCGACG AGGCAACCCT CGCGCATCTG GGCCAGGGTG TGACCCTGCG CCCGCCGCAG
GCCTTCGTGC CGGCCCTGCG CACACTCGGC GGCCCGGTGC AGGTGGATCG CAAGACCGCC
CCGCTGGCCG TGACGCTCGA GCTGCAGGAT GCCGGGATCG AGGTGGCCGA CGGCGACGAT
CCCTGCCGGC TGCCAAAGGC CTGCAAGACC CCGGCCGAGA TTGCCGGCAT GCGCGACGCC
CACCTGCGCG ACGGGGCCGC GATGGTCGAG TTCCTCTGCT GGCTCGACGC CGAGGCGCCA
AAGGGCGGCC TCACGGAAAT CGCCGTGGTG ACCGCGCTCG AGGGCTTCCG CCGGGCAACC
AACGCGCTCC ACGACATCAG CTTCGACACG ATCTGCGGCG CAGGCCCCAA CGGCGCGATC
ATGCATTACC GCGTGACCGA GGGCTCGAAC CGCCCCGTGC AGCGGGACGA GCTGTTGCTC
GTCGATTCGG GTGCGCAATA TGCCGATGGC ACGACCGACA TCACCCGCAC CATTGCCGTG
GGCGACCCCG GCGAGGAGGC GCGCGAGTGC TACACGCGGG TGCTGCAGGG CCTGATAGCC
ATCAGCCGCG CGCGCTGGCC GAAAGGTCTC GCCGGGCGCG ACCTTGATGC GCTGGCGCGT
TACCCGCTGT GGCTTGCGGG ACAGGACTAC GATCATGGCA CCGGCCACGG CGTCGGCGCC
TTCCTCTCGG TCCACGAGGG ACCGCAGCGG ATTGCCCGCA TCTCGGAGGT GCCGCTCGAG
CCGGGCATGA TCCTCTCGAA CGAGCCGGGC TACTACCGCG AGGGCGCCTT CGGCATCCGG
CTGGAAAACC TGATCGTCGT CGAGGAAGCG CCGGGGCTTG GCGATCATCG CCGGCAGTTG
TCGTTCGAGA CCCTGACCTT CGTGCCCTTC GACCGGCGGC TGATCCTGCC CCATCGCCTC
TCGCTCCCCG AGCGGGAATG GCTGGATGCC TACCATGCGG ATGTTCTCGA AAGGATCGGA
TCGCGCCTTT CACCCCCGGC GCGGGCGTGG CTGGGGGCGG CGGCTGCGCC TCTTTGA
 
Protein sequence
MFQTFHATSS PAQGPARLAA LRAALTADGL TGFIVPRSDA HQGEYVAARD ERLQWLTGFT 
GSAGFCIVLP DLAGVFIDGR YRVQVKHQVD PGHFTPVPWP EVQPGDWLRE NLSQGTIGFD
PWLHTADEIS RLEAALAGSD ISLRAVENPL DRLWADQPEA PMGRAFAHPD ALAGETGEAK
RQRLAAALGL AGRKAAVLTL PDSICWLLNI RGADVPRNPV LHAFAVLHDD ARVTLFADAA
KFDEATLAHL GQGVTLRPPQ AFVPALRTLG GPVQVDRKTA PLAVTLELQD AGIEVADGDD
PCRLPKACKT PAEIAGMRDA HLRDGAAMVE FLCWLDAEAP KGGLTEIAVV TALEGFRRAT
NALHDISFDT ICGAGPNGAI MHYRVTEGSN RPVQRDELLL VDSGAQYADG TTDITRTIAV
GDPGEEAREC YTRVLQGLIA ISRARWPKGL AGRDLDALAR YPLWLAGQDY DHGTGHGVGA
FLSVHEGPQR IARISEVPLE PGMILSNEPG YYREGAFGIR LENLIVVEEA PGLGDHRRQL
SFETLTFVPF DRRLILPHRL SLPEREWLDA YHADVLERIG SRLSPPARAW LGAAAAPL