Gene RSP_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4021 
SymbolrepC2 
ID3712040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp22769 
End bp24070 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID640069318 
Productreplication initiation protein RepC 
Protein accessionYP_345185 
Protein GI77404611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGCGTG ACATCGCCAT GCACCACATT TCCCTGACGC CCTTCGGGCG CCAGCCGGTG 
ACGGCTGGGA TCCTCGCGAG CCAGCGGCTC GCCGAGGCGC GGTCCCCCCT GCCCGAGATC
GACAAATGGG CGCTGTTTGC CGACCTCCGC ACCGCGCGCG CTCGCTTCGG CGTCTCGGAC
CGGGATCTGG CCGTCCTCTA TGCGCTGCTG AGCTTCCTGC CCGGCCGGGT TCTGACCGAC
GCGGCGCCGC TCGTGGTTTT TCCGTCGAAT GCGACGCTTT CGGACCGGGC GCATGGCATG
GCCGAGAGCA CGCTGCGCCG TCATCTCGCG GCGCTGGTCG AGGCAGGGCT CGTGGCGCGG
CGCGACAGTG CGAACGGCAA GCGCTACGCC CACCGCAACC GCGCGGGCGA GCTGGTGACG
GCCTATGGCT TCGACCTGCG CCCCCTGCTC GTCCGGGCGG CGGAGATCGC CGAGACGGCG
GCGGCTCTCG AGGCTGAAGC CGAAGCGATG GCGCGGGCCC GGACGGCCTT GGTGCTGAAG
CTCCGGGATG CGACCAAGCT CGCCGCCTAC GCCGATCCGG AGGGCCGCGC TGCGGGGATC
GAGCCGCGGC TGATGCCGAT CCGGCGTGCC CTGCGCCGGC GACTGACGCT GGCCGATCTG
GCCGAGATGG CGCGGACGCT CGACCTGGTG CTCGGGGACA TCCGGAGGAC GCTGGCCCTT
CGCGCAGCGG AAATGGACGG CACAGTCGGC CGGATCGAGC GGCACCATAC AGAATCAAAG
ACAGAATCTT CTGATCTTGA AGGCGGGCAT GCCTGTGGAA AGACCAGCGG CGAGGCTGAG
AAAGATTGCC GAAACATCGG CGGTGAGAGC CCGGACGAAA AGGCTAACAG ACCCTCACCC
AACCGGGCGG AACCTGACGA AGCGCGCCCC GAGGAACTCG CCTCGCTGGA GATTAACGAC
AATCAGCGGC CTGCCCCGAC ACTTCCGCTG GGACTTGTCC TGAAAGCCTG CCCGGATCTG
GAACCCTATG CGCCCGAGAT CCGCAACTGG CGCGACCTGA CCGCCGCTGC GAGCCGGCTG
CGGGGCATGA TGGGCATCAC ACCCTCCGCC TGGGAAGAGG CAAAGGCGGA AATGGGCGCC
GAGACGGCGT CCGTTGCGCT CGCCGCCATC CTGCAGCGCT TCGCCTCGAT CCGGAATCCG
GGCGGCTATC TCCGGGCCCT CACCCGTCGT GCCAGCGAGG GTGCCTTCTC GCCCACGCCG
ATGATCATGG CGCTGCTCGC CGAGACACGT GAGGCCGCTT GA
 
Protein sequence
MLRDIAMHHI SLTPFGRQPV TAGILASQRL AEARSPLPEI DKWALFADLR TARARFGVSD 
RDLAVLYALL SFLPGRVLTD AAPLVVFPSN ATLSDRAHGM AESTLRRHLA ALVEAGLVAR
RDSANGKRYA HRNRAGELVT AYGFDLRPLL VRAAEIAETA AALEAEAEAM ARARTALVLK
LRDATKLAAY ADPEGRAAGI EPRLMPIRRA LRRRLTLADL AEMARTLDLV LGDIRRTLAL
RAAEMDGTVG RIERHHTESK TESSDLEGGH ACGKTSGEAE KDCRNIGGES PDEKANRPSP
NRAEPDEARP EELASLEIND NQRPAPTLPL GLVLKACPDL EPYAPEIRNW RDLTAAASRL
RGMMGITPSA WEEAKAEMGA ETASVALAAI LQRFASIRNP GGYLRALTRR ASEGAFSPTP
MIMALLAETR EAA