Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3181 |
Symbol | |
ID | 5085666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 42796 |
End bp | 44898 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640484753 |
Product | hypothetical protein |
Protein accession | YP_001169370 |
Protein GI | 146279212 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACCA CCAGACGACA GATCCTCACG ACGCTTCCCG TCACTGGCGC GGCCTTCGCG CTTGGCGGCA ACTTCCTCGA AGAAGGCACG GCGCGGGCGC AGGGCGTGCG CGCGCCCCTC GAGGGACATT TCCACCCGAA GGGCAAGGCG CCCTCCGCGC ATACGATGAA GGTGCTGGCA GAGGCGCGCA AGGGGCTGCC CTTCTCGGAT CGCCGTGACA TCGAAGAGCA GGCGCGCGGG CTGATCGCCG AGCGTGCCGC CCCTCAGATC ATGGGCGACG CGGGCAACGT GGCCTTCGAC CGCAGCGAAT ACGACTTCCT CGACGGGAAC GATGACTTCG ACAGCATCCA TCCGTCGATG ACGCGCATCG CGCGGCTGAA CAACCACTTC GGGATCTATG AGGTGATCCC CGGCATCTAC CAGGTGCGCG GCCTGGATCT TTCCCACATG AGTTTCGTCC GCGGCAAGAC CGGCTGGATC GTCTTCGACA CGCTGACCAC GACAGAGACC GCACGCGCCG CCTGGGAGCT GTTCCAGGAG CACAGGGGCG AGGGGCTGCC GGTCACCGCT GTGATCTACT CCCACACCCA CGCGGACCAC TGGGGCGGCG TTCGCAGCCT CGTCGCCGAG GAGGATGTGG CCGCGGGCCG GGTGGAGATC ATCGCGCCCG ACGGCTTCAT GCAGTTCCTG ATCTCGGAGA ACGTCTTTGC CGGCAACGCG ATGAACCGGC GGCTGTTCTA TCAGTATGGG TTGCTTCTGC CGGTCGCGCC CCACGGCTTC GTCACCCAGG GTTTGGGGCA CCGGGTTCCG GCGGGCGTGA ACGGGCTGAT CCCGCCCACC CGCCTGGTTT CGGAACCGAT CGAGGAGTTC GAGGTCGATG GCGTGCGGAT GATCTTCCAG AACACGCCGA ACACCGAGGC GCCTCGGGAG ATGAACACCT ACATCCCCGA CCTCAAGGCG CTGTGGATGG CCGAGAACGT GACCGCCCAG CTCCACAACA TCTACACGCT GCGCGGGGCG CCGGTCCGCG ACCCGCTCAA CTGGTCGAAA TACATCAACG AGGCGCTCTA CCGTTTCGGG CAGGAGGCGG AGGTGATGTT CGCCTCGCAC CACTGGCCGC GCTGGGGCAA CGAGCGCATC CGCGAGATCC TGCGCGACCA GCGCGACCTC TACGCCAACA TGAACAACCA GGTTCTGCAC TACGCCAACC AGGGCGTCAC CATCAACCAG GTCCACAACG TCTATCGGAT GCCCGAGAGC CTGCAGGCCA AATGGCACTG CCGGGGCTAC CACGGCTCTC CCCAGCACAA TGCGCGCGGC GTGATCCAGC GTTACCTCGG CTTCTGGGAC TGCAACCCGA CCACGCTGAT CCCGCTTTCG CCCGCCGACT CGGCGCCGCT CCATGTCGAG ATGATGGGCG GCGGCGCGGC GATCCTCGCG AAGGCGGGCG AACTGCACGA AGCGGGTGAC TATTTCCGCG CGACCGAGAT CCTCGACAAG CTGGTTCAGG CCGAACCGGG CAACGGCGAG GCCAGGGATG CGCTCGCCGA CGCCTTCGAG CAGATCGGGT ATCAGCAGGA GAATCCGGGG CTCCGGAACA GCTTCCTCGC GGCGGCCTAT GAGTTGCGCT CGGGCATCCC GCAGGGGGCG ATGGTCAGCA GCTCCAGCCC CGACGTCATC CGCGGCATGT CCACCGAACT GTTCCTGAAC TTCCTTGCCA TCCGCATGGA TGGCCGCCGC GCCACCGGTC TTGCCTTCGT CATGAATCTC GAGACTCCCG ATACGGGTGA GACCTTCGTG GTCGAGCTTG CCAACGAGAC GCTCACGAAC ATCGCAGGCT TCCGCGCGGA CGCAGCCGAC CTGACCCTGA AGGTGAACCG TGCCGACCTC GAGCGCGTCA TGGCGGGACA GAGCGGGCTG GACGCGCTGC TGGCCGACGG CACCGCGCAG GCCGAGGGCG ATGTCGCGAT CCTCGAACGG CTTGCGGGTC TCATGGTCGA GTTCGACCCC CGCTTCGAGA TCATGCCCGG CACGGCAAAG CGCACCGAGC TTGCGGCGGC CCAGCCCTTC GAATGCGACA TCGGGGCGCC GATCGCGGAG TGA
|
Protein sequence | MMTTRRQILT TLPVTGAAFA LGGNFLEEGT ARAQGVRAPL EGHFHPKGKA PSAHTMKVLA EARKGLPFSD RRDIEEQARG LIAERAAPQI MGDAGNVAFD RSEYDFLDGN DDFDSIHPSM TRIARLNNHF GIYEVIPGIY QVRGLDLSHM SFVRGKTGWI VFDTLTTTET ARAAWELFQE HRGEGLPVTA VIYSHTHADH WGGVRSLVAE EDVAAGRVEI IAPDGFMQFL ISENVFAGNA MNRRLFYQYG LLLPVAPHGF VTQGLGHRVP AGVNGLIPPT RLVSEPIEEF EVDGVRMIFQ NTPNTEAPRE MNTYIPDLKA LWMAENVTAQ LHNIYTLRGA PVRDPLNWSK YINEALYRFG QEAEVMFASH HWPRWGNERI REILRDQRDL YANMNNQVLH YANQGVTINQ VHNVYRMPES LQAKWHCRGY HGSPQHNARG VIQRYLGFWD CNPTTLIPLS PADSAPLHVE MMGGGAAILA KAGELHEAGD YFRATEILDK LVQAEPGNGE ARDALADAFE QIGYQQENPG LRNSFLAAAY ELRSGIPQGA MVSSSSPDVI RGMSTELFLN FLAIRMDGRR ATGLAFVMNL ETPDTGETFV VELANETLTN IAGFRADAAD LTLKVNRADL ERVMAGQSGL DALLADGTAQ AEGDVAILER LAGLMVEFDP RFEIMPGTAK RTELAAAQPF ECDIGAPIAE
|
| |