Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1859 |
Symbol | |
ID | 4896845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1963520 |
End bp | 1966621 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640112451 |
Product | hypothetical protein |
Protein accession | YP_001043735 |
Protein GI | 126462621 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCC ACAAGGAAAT CGGCTTCGAG GAAGTGATCT GCGCCCATCT CGCCTCGCGG GGCTGGCTCT ATTCCGACGG CGACGCGCAG CAGTATGACC GCACCCGGGC GCTTCTGCCC GCCGATGTCG CCGCCTGGGC CGAGACCACG CAGCCGGAGG CCTGGGCGGA GGCTGTGGCC AAGCATGGCG AGGCGGCGAT CCTCGACCGG CTGCGGAAAC TCCTGAACGA GCGCGGCACG CTCGAGGTGC TGCGGCGGGG GATCGACATC GTGGGCCTCA AGGTGCCGCT GGCGCTGGCC CAGTTCCGCC CGGCCATGGG CATGAACCCC GAGATCACCG CGCGCTATCA CGCGAACCGG CTGCGGGTGG TGCGGCAGGT GCGCTATTCG GGGACGAACG AGAAATGCCT CGACCTCGTG CTGTTCCTGA ACGGCATTCC GGTGGCGACG GCGGAGCTCA AGACCGACTT CACCCAAGGC GTGCAGGACG CGGTCGACCA ATACCGCTTC GACCGCAAGC CGCATGAGAA GGGACAGGCG CCGGAGCCGC TCCTCTCCTT CCCCGGCGGC GCGCTGGTGC ATTTCGCGGT CAGCGACAGC GAGGTGATGA TGACGACCCG GCTCGAGGGG CCGGGCACGC GCTTCCTGCC GTTCAACAAG GGCGACCATG GTGCCAAGGG CAATCCGGTG AACCCGATGG GCCACCGCAC CGCCTATCTG TGGGAGGAGA TCTGGGCGCG CGACAGCTGG CTCGAGATCC TCGGCCGCTA CATGGTCGGC ACGCGAGACG CGAAGAAGGC TCTGACGGGG ATGATCTTCC CGCGCTTTCA CCAGCTCGAC GCCACGCGCA AGCTGCGGGC GGCGGTGCTG GAGGAAGGCC CGGGCGGCAA GTACCTGATC CAGCACTCGG CGGGCTCGGG CAAGACCAAC TCGATCGCCT GGACGGCGCA TCAGCTGGCC GACCTGCATG ACGCGGCGGA CGCGAAGGTG TTCTCCTCGG TGGTGGTGGT GTCGGATCGC AATGTGATCG ACACGCAGCT GCAGGAGGCA ATCTTCGCCT TCGAGCGCAC GACCGGCGTG GTGGCGACCA TCACGAACGA CAGCGGTGCG AAGAGCGGTC AGCTGGCCGA GGCGCTGGCG GGCGGCAAGA AGATCGTGGT CTGCACGATC CAGACCTTTC CTCACGCGCT GAAGGCGGTG CAGGAGCTGG CGGCGGCGAC GGGCAAGCGC TTCGCGGTGA TCGCGGACGA GGCGCATTCC AGTCAGACCG GCGAGGCAGC GGCGAAACTG AAGCAGGTGC TCTCGGCCGA GGAACTGGCG GCGGTCGCGG ATGGCGGCGA GATCGGCACC GAGGACATCC TCGCCGCGCA GATGGCCGCG AAGGCGTCGA GCGCGGGCAT CAGCTTCGTG GCCTTCACCG CGACGCCGAA GGCCAAGACG CTGGAGCTGT TCGGCCGCCG CCCCGATCCC TCGGCGCCCG CGGGGCCAGG CAACCTGCCC GCGCCCTTCC ACGTCTATTC GATGCGGCAG GCGATCGAGG AGGGCTTCAT CCTCGACGTG CTGAAGAACT ACACGCCCTA TTCGCTGGCC TTCAAGCTGG CGAGCGACGG GCGGGAGCTG GACGACGCCG AGGTCGAGCG CAGCTCGGCG CTGAAGTCGC TGATGAGTTG GGTCAAGCTG CACCCCTGGA ACATCAGCCA GAAGGTGGCG ATCGTGGTCG AGCATTACCG CGCGATGGTG ATGCCGCTGC TCGAGGGCCG CGCCAAGGCG ATGGTGGTGG TCGAGAGCCG GAAGGAGGCG GTGCGCTGGC AGCTGGCGAT GCAGAGCTAC ATCGCCGAGA AGGGGTATCC GCTCGGCACG CTGGTCGCCT TCTCGGGCGA GGTGACCGAC CCGGAGAGCG GCCCCGACCC GTTCTCCGAA ACCAGCAAGG CGCTGAACCC GGGGCTGAAG GGCAACATCC GCGAGACCTT CAAGGGAGCC GGATTCCATG TGCTGATCGT GGCGAACAAA TTCCAGACCG GGTTTGACCA GCCGCTCCTG TGCGGGATGT ACATCGACCG GCGGCTGGCA GGGATCCAGG CGGTGCAGAC GCTGTCGCGG CTGAACCGGG CCTACCAGCA GGGCAGCGTG GTGAAGGACA CGACCTATGT GCTCGACTTC GCCAACAGCG CCGACGACAT CCTGGCGGCC TTCAAGACCT ACTACGAAAC TGCCGAGCTG CAGGACGTCA CCGACCCGAA CCAGGTGCTG GACCTGCGCG CCAAGCTCGA CGGGCTCGGC TACTACGACG ATTTCGAGGT CGACCGCGTG GTGAAGGCCG AGCTGAACCC GAAGGCCACC CACGCCGACC TGTCCGCCGC GATCGAGCCG GTGGCCGACC GCCTGCTGAA AGCCTTCAAG GCCGCGCGCG ACGTGGAAGC GAAGGCGCGC GCCGAGGGCA ACAACAAGGC GGCGGTGGCG GCGAAGCAGG AGCAGGACGC GCTGATCGTC TTCCGCTCGA ACATGGGCGC CTTCCTGCGG CTCTATGCCT TCCTGTCGCA GATCTTCGAC TATGGCGCGA CGGCGATCGA GGCGCGCAGC CTGTTCTATC GGCGTCTGAT CCCGCTGCTG GACTTCGGCC GCGAGCGTGA GGGTGTCGAC CTGAGCAAGC TGGTCCTGAC CCACCACAAG CTGTCCACCG TCGGAAAGAA GGACCTGCTC CTCGGCGGGG CGGCCGAGAA GCTGAAGCCT ATGACCGAAA CCGGCTCGGC CAATGTACAG GACAAGGAGA AGGCGCTGCT GTCCCAGATC GTGGCGAAGC TAAACGACCT CTTCACCGGC GACTTGACCG ACAACGACCA GCTCTCCTAC GCGATGACGG TCAAGGGCAA GCTCCTGGAG AACACGACCC TCGCCCAGCA GGCCGCAAAC AATCACAAGG AGCAGTTCGG CAACTCGCCC AACCTGATGA ACGCCCTGAT GGACGCGATC ATCGACGCGC TGGACGCGCA TTCAGCGATG AGCAAGCAGG CCCTCGACTC ACCGGTAATC CAGAAAGGCC TGAAGGACGC CCTCATGGGC CCCGGTCGCC TCTATGAGGC CCTGCGGGAG CGAGCGGCAT GA
|
Protein sequence | MATHKEIGFE EVICAHLASR GWLYSDGDAQ QYDRTRALLP ADVAAWAETT QPEAWAEAVA KHGEAAILDR LRKLLNERGT LEVLRRGIDI VGLKVPLALA QFRPAMGMNP EITARYHANR LRVVRQVRYS GTNEKCLDLV LFLNGIPVAT AELKTDFTQG VQDAVDQYRF DRKPHEKGQA PEPLLSFPGG ALVHFAVSDS EVMMTTRLEG PGTRFLPFNK GDHGAKGNPV NPMGHRTAYL WEEIWARDSW LEILGRYMVG TRDAKKALTG MIFPRFHQLD ATRKLRAAVL EEGPGGKYLI QHSAGSGKTN SIAWTAHQLA DLHDAADAKV FSSVVVVSDR NVIDTQLQEA IFAFERTTGV VATITNDSGA KSGQLAEALA GGKKIVVCTI QTFPHALKAV QELAAATGKR FAVIADEAHS SQTGEAAAKL KQVLSAEELA AVADGGEIGT EDILAAQMAA KASSAGISFV AFTATPKAKT LELFGRRPDP SAPAGPGNLP APFHVYSMRQ AIEEGFILDV LKNYTPYSLA FKLASDGREL DDAEVERSSA LKSLMSWVKL HPWNISQKVA IVVEHYRAMV MPLLEGRAKA MVVVESRKEA VRWQLAMQSY IAEKGYPLGT LVAFSGEVTD PESGPDPFSE TSKALNPGLK GNIRETFKGA GFHVLIVANK FQTGFDQPLL CGMYIDRRLA GIQAVQTLSR LNRAYQQGSV VKDTTYVLDF ANSADDILAA FKTYYETAEL QDVTDPNQVL DLRAKLDGLG YYDDFEVDRV VKAELNPKAT HADLSAAIEP VADRLLKAFK AARDVEAKAR AEGNNKAAVA AKQEQDALIV FRSNMGAFLR LYAFLSQIFD YGATAIEARS LFYRRLIPLL DFGREREGVD LSKLVLTHHK LSTVGKKDLL LGGAAEKLKP MTETGSANVQ DKEKALLSQI VAKLNDLFTG DLTDNDQLSY AMTVKGKLLE NTTLAQQAAN NHKEQFGNSP NLMNALMDAI IDALDAHSAM SKQALDSPVI QKGLKDALMG PGRLYEALRE RAA
|
| |