Gene Rsph17025_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1750 
Symbol 
ID5083656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1785521 
End bp1787233 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content63% 
IMG OID640483310 
Productphage terminase 
Protein accessionYP_001167948 
Protein GI146277789 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTCG ATGTGAGCTT CGCCTGCCCG GATTGGGCAG ACAGATTGAA GCGTGGCGAG 
GTGCCGTTTC CGGCTTTGCC GCTCGACCCG GTTGCAGCGG AGGCGGCGGT CGATCTCTTC
AACCTCCTGC GAATCCCGGA TGTCACCGGG CAGCCGACGA TGGGCGAGGT CGCTGGTGAG
TGGTTCCGCG AGGTGATCCG GGCGGCTTTC GGATCGATCG ATCCCGCGAC GGGAAAGCGG
TTCGTGGGGG AGATCTTCAA CCTCATCCCG AAGAAGAACT CGAAGACGAC GAACGCTGCG
GCTTTGGGGC TGATCGCGCT CCTGATGAAT CGGCGCCCGA ACATCGACGG CGTGATCATC
GGGCCGACGC ATGAGGTTGC CCAGAAGTGC TTCGATCAGG CAGCCGGGAT GATCGAAGCC
GATCCCTACC TGCGCAAGCG GTTCAAGGTG ATCGAACACA AGAAGACGAT CCTGGACCTG
CACAAGGATG AGAGCACCGG GACGCGGATG AATGCGAAGC TGAAGATCAA GAGCTTCGAT
CCGAAGGTTG TTACCGGCTC GATCCCGGCA TTCGCCATCA TCGACGAGCT GCACCTCATG
GCGGAGATGA GCCATGCGGA GCGCGTGATC GGACAGATCC GCGGCGGCAT GATCACGAAC
GATGAGAGCC TTCTGATCAT CATCACGACC CAGTCGGAGA TTGTCCCCAC AGGCGTCTTC
AAGTCGGAAC TGGATTACGC TCGAGGCGTC AGGGACGGCC GGATCACCGC ATCCGTGCGC
ATGCTGCCGA TCCTCTACGA GTTCCCGGAA GAGGTGCAGC GCGACGAGGC GAAGCCGTGG
CGCGACCCGA AGCTGTGGCC GATGGTCCTG CCGAATCTGG GCCGGTCCGT CACGATCGAG
CGCCTGGTGC AGGACTATCA CACCGCGGTG GAGAAGGGTG CCGCCGAGGA GATCAGGTGG
GCGTCCCAAC ACCTCAACAT CGAGATCGGT CTGGGTCTCC ACGCGAACCG CTGGGTGGGC
GCAGACTACT GGCTGAAGAA CGCGGACCCC GACCTGACCT TCGAGCGCAT GCTCGAAGAA
TGCGAGGTCA TCGTATTGGG CGGCGACGTG GGTGGCGCGG ACGACCTATG CAGCCTCGCC
GCCATAGGTC GGCATCGGGA GACCAGGCTG TGGCAAGCGT GGGGCTGGGC GTGGTGCGTC
AGGGACGTCC TCGTCAGGCG CAAGGAGATC GCGCCCAGGC TTGAGGAGCT GCGGGACGCA
GGTGAGTTGC GGATCACCGC GACGGCAGAC GAGCACACAA TCGAGATGGT CGAGATCTGC
GCCCGTGTCC GCGATGCAGG TCTGATGCCG GAAAAGCTGG GGATAGGGCT CGATCCGCAT
GGCGTGGCGG CTCTGGTCGA TGCCCTGGAG GCAGAGGGTT TCGACCCGAA CGTGCACATC
ATGGCCGTTG GGCAGGGCTA CAAGCTGAAC GGCGCGGTCA AGGGGCTTGA GCGCCGCCTC
CTTGACGGGA GGCTGCGTCA CGGGGGTCAG CGCCTCATGA ACTGGGCGGT CGGCAACGCG
AAGTCGGAGC AGAAGGGCAA CAACGTGTAC ATCACGAAGC AAACTGCTGG CATCGCGAAG
ATCGATCCAC TGATCGCGTT GTTCAACGCG GCGATCCTGA TGGACATGAA CCCGACGGCT
GCTTCTGCGG CCTTCGAATA CACCGGGATG TAA
 
Protein sequence
MPLDVSFACP DWADRLKRGE VPFPALPLDP VAAEAAVDLF NLLRIPDVTG QPTMGEVAGE 
WFREVIRAAF GSIDPATGKR FVGEIFNLIP KKNSKTTNAA ALGLIALLMN RRPNIDGVII
GPTHEVAQKC FDQAAGMIEA DPYLRKRFKV IEHKKTILDL HKDESTGTRM NAKLKIKSFD
PKVVTGSIPA FAIIDELHLM AEMSHAERVI GQIRGGMITN DESLLIIITT QSEIVPTGVF
KSELDYARGV RDGRITASVR MLPILYEFPE EVQRDEAKPW RDPKLWPMVL PNLGRSVTIE
RLVQDYHTAV EKGAAEEIRW ASQHLNIEIG LGLHANRWVG ADYWLKNADP DLTFERMLEE
CEVIVLGGDV GGADDLCSLA AIGRHRETRL WQAWGWAWCV RDVLVRRKEI APRLEELRDA
GELRITATAD EHTIEMVEIC ARVRDAGLMP EKLGIGLDPH GVAALVDALE AEGFDPNVHI
MAVGQGYKLN GAVKGLERRL LDGRLRHGGQ RLMNWAVGNA KSEQKGNNVY ITKQTAGIAK
IDPLIALFNA AILMDMNPTA ASAAFEYTGM