Gene Rsph17025_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2821 
Symbol 
ID5085100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2870106 
End bp2871905 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID640484392 
Productpeptidase U35, phage prohead HK97 
Protein accessionYP_001169013 
Protein GI146278854 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.445416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA ACGTCACCTT GCTGACCCGC CGCGCCGACC TGGCACCGGC CAGTGCCAAC 
CGCGATGATC GCACTGTCGA GGTGATCTGG TCCACCGGCG CGCCCGTGCG CCGCCGCGAC
ATGGTAGGGC AATACATCGA ACGCCTCAGC CTTGCGCCAG AGGCGGTGGA CCTGTCGCGA
CTGCAAGGGG CCAGCGTGCT GGATGCCCAC CGGCAATCCG CCGTCCGTGA TGTGCTGGGC
AGTGTGCAGT CCGCCAGCGT CGACGGCCAG CGCGGCACGG CGCTGATCCG GTTCTCGGCC
CGCCCCGAGG TGGAGCCGCT CTGGCAGGAC GTGCTGTCGG GCATCCTGCG CCATGTCTCG
GTAGGCTATT CGGTCGAGGA ATGGGCAGAG ACCACCGAGA GCGGCGCGCG GGTACTGACC
GCCGTCCGCT GGACACCCCA CGAGATTTCC CTGGTGCCGA CGCCCGCCGA CCCCGGCGCC
CACATCCGCA TGGAGACCCA TATGACCGAC ACCACCATCC CTGCCCCGCC CGAGGCGCAG
ACTCGCGCCA CGATCAACAC CGAAATCCGA TCCATCGCCC GCATCGCCGG GCTGGACCAG
TCCTGGATCG ACGGTCAGAT CGATGCCGCA GCCGATGCCG ACACCGCCCG CCGCGCAGCA
TTCGAGGTGT TGGCGACCCG CAGCGCGCCG ACGATCCGCA CCGAACAGGT CCGCGTCGAA
ATGGGCGACA GCCAGGACGA CCCGGCACTG CGCGCGCGGC AGATGGGCGA GGCCCTCTAT
GCGCGCATCA ACCCGCGCCA TGACCTCAGC GAACCCGCCC GCCGCTATGC CTATGCCACG
CCTGTGGACA TGGCGAAGGA ACTGCTGACC CTGCGCGGCG AGTCGACCAT GGCCCTGTCG
CCCGCCAGTC TCGTCACCCG TGCCCTGCAT ACGACATCCG ACTTCCCGAT CATCCTTGGC
AACACCGTGG GCCGCGTGCT GCGCGATGCC TACCAGGCCG CCCCTTCCGG CATCCGCCGC
CTCGGTCGCC AGACCTCGGC GCGGGATTTC CGGGCGGTGA ACAAGATCAT GCTGGGCGAG
GCGCCGCTCC TCGAGAAGCT GAACGAGGCG GGCGAGATCA AGGCCGGGAC CATGGCCGAG
GCGCGCGAGG CCTACAAGAT CGAGACTTGG GCCCGGAAGA TCGGCATCAC CCGGCAGGTG
TTGGTGAACG ACGACCTCGG CGCCTTCGCG GACCTTGCCC GCCGCATGGG CCAGGGCGCA
GCCGAGACCG AGGCGCGCAT CCTCGTCACC CTGTTGGAGG CGAACAGCGG CAACGGCCCG
ACCCTGTCGG ACAACAAGGC GCTGTTCCAT GTCGATCACG GCAACCGCGC GACGACGGGT
GCTGTGATCT CCGACGCCAC CCTGTCGGCC GCGCGACTGG CGCTGCGGAC CCAGAAGGGC
ATCGAGGGCC GCGTGATCCG CGTGACGCCG AAGAACCTGC TGGTCCCGCC CGCGCTTGAG
ACCGTGGCCG AGAAGTGGCT GGCGACCATC GCACCCGCCA CAGCCGCCGA TGTGAACCCG
TTCTCGGGGG CGATGTCGCT GGTCGTTGAA CCCCGCCTGT CCAGCGCGAC CCGCTGGTAT
GTCACCGCCG ACCCCGGCGA GATCGACGGG CTGGAGTTCG CCTACCTCTC GGGCAACGAG
GGGCCCCAGG TGGAAAGCCG GTCAGGGTGG GATGTGGACG GTGTGGAAAT CCGGGTGATC
CTGGACTTCG GCGCAGGCTT CATCGACCAC CGCGGCTGGT TCCAGAACCC CGGGGCGTAA
 
Protein sequence
MTDNVTLLTR RADLAPASAN RDDRTVEVIW STGAPVRRRD MVGQYIERLS LAPEAVDLSR 
LQGASVLDAH RQSAVRDVLG SVQSASVDGQ RGTALIRFSA RPEVEPLWQD VLSGILRHVS
VGYSVEEWAE TTESGARVLT AVRWTPHEIS LVPTPADPGA HIRMETHMTD TTIPAPPEAQ
TRATINTEIR SIARIAGLDQ SWIDGQIDAA ADADTARRAA FEVLATRSAP TIRTEQVRVE
MGDSQDDPAL RARQMGEALY ARINPRHDLS EPARRYAYAT PVDMAKELLT LRGESTMALS
PASLVTRALH TTSDFPIILG NTVGRVLRDA YQAAPSGIRR LGRQTSARDF RAVNKIMLGE
APLLEKLNEA GEIKAGTMAE AREAYKIETW ARKIGITRQV LVNDDLGAFA DLARRMGQGA
AETEARILVT LLEANSGNGP TLSDNKALFH VDHGNRATTG AVISDATLSA ARLALRTQKG
IEGRVIRVTP KNLLVPPALE TVAEKWLATI APATAADVNP FSGAMSLVVE PRLSSATRWY
VTADPGEIDG LEFAYLSGNE GPQVESRSGW DVDGVEIRVI LDFGAGFIDH RGWFQNPGA