Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3870 |
Symbol | |
ID | 4898524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 997542 |
End bp | 999158 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640114474 |
Product | hypothetical protein |
Protein accession | YP_001045721 |
Protein GI | 126464608 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0563781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGGG CATGGGGTTT TGCGCTGGCG ATCCTCTTGG CCGCGCCGCC CGCCGCGGCG GGCGGGATCA CGGTGGACGA GACATATTGG AACCCGCAGC CGGCCGAGGG CGACCTCGTG CTGCCGATGC CCTGCGGCGG CGCGATGGCG TTCCGCCCGG TGGCGACACC CAATGCCGAC GGTGCGGTGG GCGACGTTCC GGTCATCCTC GGTCGCGAGG ACGAGGACCA GCCCTATCTG GACGGCACGC GGCGCTCCTA TGTCTCGGGA GGGTTTCCGG GGACGGGCGA GGCGGAAGCC AAAGCCATGT TCTACATGGC GAAATACGAG ATCGCCGAGG CGCAGTACCG CGCTGTGACG GAAGGCTGCC CGCAGAAGGA GCCGCGCCGG CGCGACTTCC TGCCGGTGAC GGGTGTGACG AAGACCGAGC TTGACGCCTT CGCGCAGGCC TGGACGGTCT GGCTGATGCG GAACGCGCCG GAAAGCCTCG CCCTCGCCGG TGCGGCGCCG GCCCATCTGC GGCTTCCGAC CGAGGAGGAG TGGGAGTTCG CCGCCCGGGG CGGTCTGGCC GTCGATCCGG CGCTGTTCCG CGGCGCGCTG CCGCCGATCC CGCCCGGTCA CTCCGCGTCC GAATACATCG CGCATGGCGG CAACGACAGC GCGGGCGGCA AGGTGCAGGC GATAGGCACG CTGGCGCCGA ACCCGCTCGG GCTGCACGAC ATGCTGGGCA ACGTCTCGGA ATATGTCCTG ACCCCCTTCG CGATGGTGCG CCACGGGCGG CTGCACGGGC AGGCCGGCGG CTACGTCAAG CGCGGCGGCG ATGCACGCAC GCCGCTCGAC CAGATCACCA GCGCGACCCG GTTCGAGGTG CCGCCCTTCG ACGCGCATTC GAAGGATGTG ACGCGCGAGG CTTTCACCGG GGGGCGGCTC GTGCTGTCCA CGCTCTCGAT CACCTCGGCC GAGCAGGCGA AGGCGGTCGC GGCGGCGCTC GAGACCCTGT CGCGGGCCGA CCCTGCGCTC GACTCGGCGG CCTCGGAGGC CGAGGTTCTG GCGCTGCTCG ACCGGCTTCA GCGCGAGGCC GGCGATGCCG CCGACCGCAG CCGCTTTGCC ACCATCGCCC GGACGATCCG CGAGGCCCGC GCCGAGACGA ACGCGCAGCG CGACCGGACC ATCCGGATGA TCCTCGGCTC GTCGGTGCTG ACCTGCGACC AGATCGTGCA GCGCTACCTG AACGCGCTGG CCATCGCGGC GCTGGTGCCG AGCTATGACG GGCTGGCCGC CGAGGCCGAG GCCAGCGGCG ACACCGCACT GGCGCAGGAG GTGGCCGAGG CCCGCGCCGA AGCAGAGGCG AAGCTGCGCG AGATGGAGGA GGCGGTCGGC CGCGAAAGCG TCGACTACGC CAACATGATC GAGGGGCTTT CGGCCGAATT CTCGCAGGAG CTGCTGGCGG CGCAGATCGC GGCCGTGCGG GGCGAGCACG AGAGCCGCGG CCCTCGGCGC GGTGCCTGCC TCGGCGCGGT GCAGGCCCAT CTCGACCGGC GGGCCCGGAG CGGGATGAAC GACCTCGCCG CCATCCGGTC CGACATGCAA CAGATCGCGG CCGCGCAGGC CCGCTAA
|
Protein sequence | MARAWGFALA ILLAAPPAAA GGITVDETYW NPQPAEGDLV LPMPCGGAMA FRPVATPNAD GAVGDVPVIL GREDEDQPYL DGTRRSYVSG GFPGTGEAEA KAMFYMAKYE IAEAQYRAVT EGCPQKEPRR RDFLPVTGVT KTELDAFAQA WTVWLMRNAP ESLALAGAAP AHLRLPTEEE WEFAARGGLA VDPALFRGAL PPIPPGHSAS EYIAHGGNDS AGGKVQAIGT LAPNPLGLHD MLGNVSEYVL TPFAMVRHGR LHGQAGGYVK RGGDARTPLD QITSATRFEV PPFDAHSKDV TREAFTGGRL VLSTLSITSA EQAKAVAAAL ETLSRADPAL DSAASEAEVL ALLDRLQREA GDAADRSRFA TIARTIREAR AETNAQRDRT IRMILGSSVL TCDQIVQRYL NALAIAALVP SYDGLAAEAE ASGDTALAQE VAEARAEAEA KLREMEEAVG RESVDYANMI EGLSAEFSQE LLAAQIAAVR GEHESRGPRR GACLGAVQAH LDRRARSGMN DLAAIRSDMQ QIAAAQAR
|
| |