Gene Rsph17029_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3357 
Symbol 
ID4898453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp407432 
End bp409594 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content72% 
IMG OID640113956 
Productphage terminase GpA 
Protein accessionYP_001045225 
Protein GI126464112 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.27922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAGA TGCTCGACCG CGGCATCGGG CGGCTCACCC GCATTCCGCC CCTGCCGCCC 
TTCACCGCCC CCGAGGAGAT CCTGGCCGAC GCCCTGCCGC TCCTCGATCC GCCGAGCCGG
GTCACGGTGA CCGAGGCGGC CCAGCGGCAC ATGCGCGTGC CGGTGCAGGG CAACTGGGTG
CCCTTCGACC GGGCGGTGAC GCCCTATACC GTCGAGCCCG CGGACATGAC CCAGTCGCGC
CGCTTCAAGG CCGTGGTCTT TCTCGGGCCG TCGCAGAGCG GCAAGAGCCA GATGATGCAG
TCGGTCTCGG CCCATGCCGT CACCTGCGCG CCGGGCCCGG TGCAGGTCAT CCACATGACC
AAGACCGATG CCGATGCGTG GGTGGAGGAG AAGCTCGACC CCACGATCCT GAACAGCCCG
GCGCTCCGCG AGCGGCTGGG GACCGGGCGC GACGACAGCA CCTTCAGCCG CAAGCGCTTC
AAGGGCATGC GGCTCACCAT CGGCTATCCG GTGCCGAACC AGCTCTCGAG CCGGTCGCAG
CGCCTCGTGA TGCTGACCGA TTACGATCAC ATGCCCCAGA AGCTGGGGCC GAAGGACAGC
CCGGAAGGTT CGCCCTTCGG CATGGCGCTG CAGCGGATCC GCACCTTCAT GAGCCGGGGC
TGCGTCCTGG CGGAATCCTC GCCCGCCTTC CCGGTGGACC CGAATGCGGA CTGGGCGCCG
CATGCGGGCC ATCCGCACAT GCTGCCGCCG GCCACGGCCG GGCTCGTGCC GATCTACAAC
GAGGGCACGC GCGGGCGCTG GTACTGGGAA TGTCCGGACT GCGGCGATCT CTTCGAGCCG
CGCTTCGACC GGCTGCATTA CGACGCGGAG CTCGATCCGG GCGCCGCCGG CGAGCAGGCG
ATGATGGAAT GCCCGCACTG CGGAACGCTC ATCGCCCACC GTCACAAGGT CGGCCTCAAC
CGCGCCGCGC TCGAGGGTCG TGGCGGCTGG CTGCACGAGG GCCGCCACAT CGAGGCGAAC
GGGCGCCGGG CGCTGGTCCG GATCGACGAT CCCGACATCC GGCGCACGCC CATCGCGAGC
TACAGTCTGA ACGGGGCCGC CGCGGCCTTC GCCTCGTGGG AAGAGCTGGT CCAGCGCTAC
GAGACCGAGC GGCGGCGGTT CGAGGCGCTC GGCGACGACA CCGACTTCGC CCGGGTGCAT
TACACCGACA TCGGCGTGCC CTACCGGCGC CCGGAGGCCG AAGAGGAGGG CGCCCTCACC
GCGGCGCAGA TCCGTGAGCA CATGCGCGAG CAGGAGAGGC GCCTCGCCCC GGCCTGGACG
CGCTTCGTCA CGGTCTCGAT CGACGTGCAG GGCAACCGCT TCGAGGTGCT GGTCATGGCC
TGGGGCGCGC AGGGCGAGCG GATGCCGATC GACCGGTTCG CCGTGGCGCA GCCTCCCGAC
CATGCCCCGC GCGCGAAGGG CTGTGACGAC CGCTACCGGG CGCTCGACCC CGGGCGCTAT
GTCGAGGATG CCGATGCGCT CCTCGATCTG CCCGAGCGTC TCTACCCGGT GGAGGGGGCG
AGCTGGAGCC TGAAGCCCTG CGCGCTGGTG ATCGACTTCA ATGGCCCTGC CGGCTGGTCG
GACAATGCCG AGAAGTTCTG GCGCGCGCGC AGGCGCGACG GTCAGGGCGG GCTCTGGTGG
CTCTCGATCG GCCGCGGCGG CTTCCAGCAG CGCGACCGGG TCTGGCACGA GGCGCCGGAG
CGGGGCTCGA AGGGCAGGCG GGCGCGCGGC ATCAAGCTGC TGAACATGGC GACCGACCGG
ATGAAGGAGA GCGTCCTCGC GGCCGTCGGC CGGTTCGAGG GCGGTCAGGG CGCCCAGCAT
GTGCCCTCTT GGCTCGAGGC GGAGCATCTC GACGAGCTCC TCGCCGAGCG CCGGGGCGCC
AAAGGCTACG AGAAGCGCAC GCCCGCCGCC CGCAACGAGA CGCTCGACCT CTCGGTGCAG
GCGTTGGCCG TGGCGGAGTT CAAGGGGCTG AACCGGATCG ACTGGGAGGC GCCGCCCGCC
TGGGCCGAGG CGGGGCCCGC CAACCCGTTC GCCGTGGCCG TGTCCGCGGC TGCGGCAGAG
GCCGCACCGG CCCCGCGCCG GCGCGCGCGG ACCTCGCGCT CGCGATACAT GGAGGGATCA
TGA
 
Protein sequence
MVEMLDRGIG RLTRIPPLPP FTAPEEILAD ALPLLDPPSR VTVTEAAQRH MRVPVQGNWV 
PFDRAVTPYT VEPADMTQSR RFKAVVFLGP SQSGKSQMMQ SVSAHAVTCA PGPVQVIHMT
KTDADAWVEE KLDPTILNSP ALRERLGTGR DDSTFSRKRF KGMRLTIGYP VPNQLSSRSQ
RLVMLTDYDH MPQKLGPKDS PEGSPFGMAL QRIRTFMSRG CVLAESSPAF PVDPNADWAP
HAGHPHMLPP ATAGLVPIYN EGTRGRWYWE CPDCGDLFEP RFDRLHYDAE LDPGAAGEQA
MMECPHCGTL IAHRHKVGLN RAALEGRGGW LHEGRHIEAN GRRALVRIDD PDIRRTPIAS
YSLNGAAAAF ASWEELVQRY ETERRRFEAL GDDTDFARVH YTDIGVPYRR PEAEEEGALT
AAQIREHMRE QERRLAPAWT RFVTVSIDVQ GNRFEVLVMA WGAQGERMPI DRFAVAQPPD
HAPRAKGCDD RYRALDPGRY VEDADALLDL PERLYPVEGA SWSLKPCALV IDFNGPAGWS
DNAEKFWRAR RRDGQGGLWW LSIGRGGFQQ RDRVWHEAPE RGSKGRRARG IKLLNMATDR
MKESVLAAVG RFEGGQGAQH VPSWLEAEHL DELLAERRGA KGYEKRTPAA RNETLDLSVQ
ALAVAEFKGL NRIDWEAPPA WAEAGPANPF AVAVSAAAAE AAPAPRRRAR TSRSRYMEGS