Gene Rsph17029_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3072 
Symbol 
ID4898949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp85376 
End bp87100 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content70% 
IMG OID640113674 
Productheme peroxidase 
Protein accessionYP_001044944 
Protein GI126463831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0304079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.293212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCCG CCTATCCGCT TCCCAACCCC GACCCGCAAC CCTCTCCTGG GCTCGAGCCG 
CGCGAGGTAT TCCGCTTCTA CTTCCGCAAT GCCCCCGAGG TGGCGGCCGA TCCCGAGATC
GAGGGCAGGA TCCTCGCGCT GGCCGACGCC ATGGCCGAGG CGAGCCTGAC CCAGGCCGAT
CCCGAAGGAG AGTCCAGCAT TCCGGCGGTC TTCACCTACT TCGGCCAGTT CATCGACCAC
GACATCACGG CCAATACCGA CCGCGACGAG CCCGGCTTCG ACATCGACCG GGCCGACATC
GACCCGCAGC CGCGCGATCT GGTGGAAAAG GAGAAGCTGA ACCTGCGCAA CGGCCGGCTG
AACCTCGACA GCCTCTACGG GGACGGCCCC GGCCAGTCGC CCGCGGCGGC GAAGATGGAA
GCGGCGATGC GCGACCCCGC CGATCCCGCC AAGATGCGCC TCGGGCCGGT GCAGCCGGTC
AGCAACGGAT CGCCGGGCTT CATCCGGCCG GATCTGCCCA CGGACAACCT GGCCGACATC
CCCCGGGTCG GCGCCGCCAT CGACGACAGC AGCCTGACAC TGGAGGAAGC CCGCGAGCTG
ATCGGCGAGG CCGACGATGC CAAGCTGCGC CTCTCCGCGC TGATCGGCGA CGGGCGCAAC
GACGAGAATC TGATCGTGGC CCAGCTGCAC CACAGCTTCC TGCGGCTGCA CAATGCGCTG
GTGGACAAGC TCCGGGCCGA TGGCGCGGCC ACGGGCGACG ACGCGCTCTT CGCTCTGGCG
CGGCAGCACA CGACCTGGAT CTACCAGTGG ATGGTGGTGA ACCTCTTCCT GAAGGCGGTC
TGCGACCCCT CCGTCGTGGA GGATGTGCTC GAGAAGCGCG CCCCGCTCTA CCGGGCCTTC
TTCGAGGCGC ACAAGGCCTC GGTGCCCGAA GGCGCCCTGC CGATGCCGCT CGAGTTCAGC
GTGGCGGCCT TCCGCTACGG CCATTCGATG GTGCGGGGCG ACTACGATTA CAACCGCAGC
TTCGGCGTCG CGGTCGACGG CACGCCGCAG ACCCGCGCCA GCTTCGAGCA GCTCTTCGCC
TTCACCGGCG GGGGCAACAT GTCGGGCTTC GGACTGACCT CGCTGCCCGA CAACTGGATC
ATCGAGTGGG ATCGGTTCAT CCGCGCCGAT GGCCCGCCCG GGCGGACGGC CCGCAAGATC
GACAGCCGGC TCGCGCTGCC CTTGAAGGAG ATGCGCAACC CCGCCCCGGG CGTCAGCGGG
ATCATGCGCC ATCTGGCGGC ACGCAACCTG CGCCGGGGCT ATGTGTTCAA CCTTCCCGAC
GCGCAGGCCA TCCTCTCCGA ACTGGCCTAT CAGGGCACGC GGATCGAACC GCTCACGGCG
GATGAGATCG CCTCGGGCGC GACCGGGGCG GCGATCCGCG ACGGCGGCTT CGACAGCTCG
ACGCCGCTGT GGTTCTACGT CCTGAAAGAG GCGGAGGTGA GGGCCGAGGG CAACCATCTC
GGCCCGCTCG GCAGCCGTCT GGTGGCCGAG ACGCTGATCG GCCTCCTCGT GACCGATGTG
TCGAGCTTCC TCCATCGGGG AGCCTTCGGC AGCTGGACAC CGGCCGAGGC GGCGCAGCCG
AAGGGCGAGC CGATCACGAG CTTTGCCGCC ATGCTCGTAG CCTGCGGCCT GCTCGCGCCC
GTGCCCGCGA CGCCGGCGGC GCCCCAGCCG GCCGCCACGG TCTGA
 
Protein sequence
MRPAYPLPNP DPQPSPGLEP REVFRFYFRN APEVAADPEI EGRILALADA MAEASLTQAD 
PEGESSIPAV FTYFGQFIDH DITANTDRDE PGFDIDRADI DPQPRDLVEK EKLNLRNGRL
NLDSLYGDGP GQSPAAAKME AAMRDPADPA KMRLGPVQPV SNGSPGFIRP DLPTDNLADI
PRVGAAIDDS SLTLEEAREL IGEADDAKLR LSALIGDGRN DENLIVAQLH HSFLRLHNAL
VDKLRADGAA TGDDALFALA RQHTTWIYQW MVVNLFLKAV CDPSVVEDVL EKRAPLYRAF
FEAHKASVPE GALPMPLEFS VAAFRYGHSM VRGDYDYNRS FGVAVDGTPQ TRASFEQLFA
FTGGGNMSGF GLTSLPDNWI IEWDRFIRAD GPPGRTARKI DSRLALPLKE MRNPAPGVSG
IMRHLAARNL RRGYVFNLPD AQAILSELAY QGTRIEPLTA DEIASGATGA AIRDGGFDSS
TPLWFYVLKE AEVRAEGNHL GPLGSRLVAE TLIGLLVTDV SSFLHRGAFG SWTPAEAAQP
KGEPITSFAA MLVACGLLAP VPATPAAPQP AATV