Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3072 |
Symbol | |
ID | 4898949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 85376 |
End bp | 87100 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113674 |
Product | heme peroxidase |
Protein accession | YP_001044944 |
Protein GI | 126463831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0304079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.293212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCCCG CCTATCCGCT TCCCAACCCC GACCCGCAAC CCTCTCCTGG GCTCGAGCCG CGCGAGGTAT TCCGCTTCTA CTTCCGCAAT GCCCCCGAGG TGGCGGCCGA TCCCGAGATC GAGGGCAGGA TCCTCGCGCT GGCCGACGCC ATGGCCGAGG CGAGCCTGAC CCAGGCCGAT CCCGAAGGAG AGTCCAGCAT TCCGGCGGTC TTCACCTACT TCGGCCAGTT CATCGACCAC GACATCACGG CCAATACCGA CCGCGACGAG CCCGGCTTCG ACATCGACCG GGCCGACATC GACCCGCAGC CGCGCGATCT GGTGGAAAAG GAGAAGCTGA ACCTGCGCAA CGGCCGGCTG AACCTCGACA GCCTCTACGG GGACGGCCCC GGCCAGTCGC CCGCGGCGGC GAAGATGGAA GCGGCGATGC GCGACCCCGC CGATCCCGCC AAGATGCGCC TCGGGCCGGT GCAGCCGGTC AGCAACGGAT CGCCGGGCTT CATCCGGCCG GATCTGCCCA CGGACAACCT GGCCGACATC CCCCGGGTCG GCGCCGCCAT CGACGACAGC AGCCTGACAC TGGAGGAAGC CCGCGAGCTG ATCGGCGAGG CCGACGATGC CAAGCTGCGC CTCTCCGCGC TGATCGGCGA CGGGCGCAAC GACGAGAATC TGATCGTGGC CCAGCTGCAC CACAGCTTCC TGCGGCTGCA CAATGCGCTG GTGGACAAGC TCCGGGCCGA TGGCGCGGCC ACGGGCGACG ACGCGCTCTT CGCTCTGGCG CGGCAGCACA CGACCTGGAT CTACCAGTGG ATGGTGGTGA ACCTCTTCCT GAAGGCGGTC TGCGACCCCT CCGTCGTGGA GGATGTGCTC GAGAAGCGCG CCCCGCTCTA CCGGGCCTTC TTCGAGGCGC ACAAGGCCTC GGTGCCCGAA GGCGCCCTGC CGATGCCGCT CGAGTTCAGC GTGGCGGCCT TCCGCTACGG CCATTCGATG GTGCGGGGCG ACTACGATTA CAACCGCAGC TTCGGCGTCG CGGTCGACGG CACGCCGCAG ACCCGCGCCA GCTTCGAGCA GCTCTTCGCC TTCACCGGCG GGGGCAACAT GTCGGGCTTC GGACTGACCT CGCTGCCCGA CAACTGGATC ATCGAGTGGG ATCGGTTCAT CCGCGCCGAT GGCCCGCCCG GGCGGACGGC CCGCAAGATC GACAGCCGGC TCGCGCTGCC CTTGAAGGAG ATGCGCAACC CCGCCCCGGG CGTCAGCGGG ATCATGCGCC ATCTGGCGGC ACGCAACCTG CGCCGGGGCT ATGTGTTCAA CCTTCCCGAC GCGCAGGCCA TCCTCTCCGA ACTGGCCTAT CAGGGCACGC GGATCGAACC GCTCACGGCG GATGAGATCG CCTCGGGCGC GACCGGGGCG GCGATCCGCG ACGGCGGCTT CGACAGCTCG ACGCCGCTGT GGTTCTACGT CCTGAAAGAG GCGGAGGTGA GGGCCGAGGG CAACCATCTC GGCCCGCTCG GCAGCCGTCT GGTGGCCGAG ACGCTGATCG GCCTCCTCGT GACCGATGTG TCGAGCTTCC TCCATCGGGG AGCCTTCGGC AGCTGGACAC CGGCCGAGGC GGCGCAGCCG AAGGGCGAGC CGATCACGAG CTTTGCCGCC ATGCTCGTAG CCTGCGGCCT GCTCGCGCCC GTGCCCGCGA CGCCGGCGGC GCCCCAGCCG GCCGCCACGG TCTGA
|
Protein sequence | MRPAYPLPNP DPQPSPGLEP REVFRFYFRN APEVAADPEI EGRILALADA MAEASLTQAD PEGESSIPAV FTYFGQFIDH DITANTDRDE PGFDIDRADI DPQPRDLVEK EKLNLRNGRL NLDSLYGDGP GQSPAAAKME AAMRDPADPA KMRLGPVQPV SNGSPGFIRP DLPTDNLADI PRVGAAIDDS SLTLEEAREL IGEADDAKLR LSALIGDGRN DENLIVAQLH HSFLRLHNAL VDKLRADGAA TGDDALFALA RQHTTWIYQW MVVNLFLKAV CDPSVVEDVL EKRAPLYRAF FEAHKASVPE GALPMPLEFS VAAFRYGHSM VRGDYDYNRS FGVAVDGTPQ TRASFEQLFA FTGGGNMSGF GLTSLPDNWI IEWDRFIRAD GPPGRTARKI DSRLALPLKE MRNPAPGVSG IMRHLAARNL RRGYVFNLPD AQAILSELAY QGTRIEPLTA DEIASGATGA AIRDGGFDSS TPLWFYVLKE AEVRAEGNHL GPLGSRLVAE TLIGLLVTDV SSFLHRGAFG SWTPAEAAQP KGEPITSFAA MLVACGLLAP VPATPAAPQP AATV
|
| |