Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3857 |
Symbol | |
ID | 6411537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4143765 |
End bp | 4146686 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642713739 |
Product | hypothetical protein |
Protein accession | YP_001992830 |
Protein GI | 192292225 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.663936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCCG CCGAATTCAT CAAGAAATGG AAGCCTGTCG CGCTGACGGA GCGCGCGGCC GCACAGACTC ACTTCCTCGA TCTGTGCAAG CTGTTCGAGC ACGAAGATCC GGTGTCGGCC GACCCGACGG GTGAATGGTT CACCTTCGAG AAAGGCGCAA CCAAAACCGG CGGCGGCGAC GGCTTTGCGG ACGTCTGGAA GAAGAACTAC TTTGCCTGGG AATATAAGAA GAAGAAGCGC GACCTCGGCG TCGCAATGAA CCAGCTCGTT CGCTACGCTG CAGCACTGGA AAATCCGCCG CTGCAGGTCG TTTGCGACAC CGATCGCTTC GTCATCCGCA CCGCCTGGAC CAATACGGTT CCGAAGGAAT ACGAGATCGA GCTTGACGAT CTTGCCGATC CGGAGAAGCG CAAGATTCTC TGGGCGGTGT TTCACGACCC CGAGCAGTTG CGGCCGCAGC AGACCCGCAC CGCGATCACT AAGGAAGCCG CCGACAAATT CTCAACCATC GCGCTACGCC TGCAGGGCCG CGGCACGCCG GAAGAGATCG CGCACTTTGT CAATCAGTTG GTGTTCTGTT TCTTTGCGAG CAGTGTCAAG CTGCTGCCGG AAGGCTTCTT TCCGAAGCTA CTGAAGCGCG CAGCGCAAAA GCCACAACAT GCCATCGACT ACTTCAACAA GCTGTTTGAG GCGATGGAGA ACGGGGGTGA ATACGACCTG ACTGACATCG CGCATTTCAA CGGCGGGCTA TTCGACGGGC GCCGCGCGCT CAAACTCGAC GAGGGCGATA TCGGCCTCTT GATCGAGGCC GGCAGTCTCG ATTGGGGCCA GATCGATCCG ACGATCTTCG GCACGCTGTT CGAGCGTTTC CTCGATCCCG ACAAGCGGGC TCAGATCGGC GCGCACTACA CCGACCCCGA CAAGATCCTG ATGATCGTTG AGCCGGTGAT TCTGCGGCCG CTTCGAGCGG AATGGGACGC CGCACGCGCG AAAATTGCTG AGATCGCTGG AGAAGCCAAC GCGTTGCAAC AAACCGGCTT CAGCAAGCAA GGCGCCAAGA GCTTCGACAA GAAGATCACG AATATTCGTG CGAAAGCCGA AGTCATCCGG GATCAATTCA TCGAGCGGCT GCGCGGCATC ACCATCCTCG ATCCAGCTTG CGGCTCGGGC AACTTCCTGT ATCTCGCCTT GCAAGGGGTT AAGGATATCG AGCTCCGCGC CAACCTCGAA TGCGAAGCGC TCGGACTGTC GCCGCGACTT CCGGTAATTG GCCCGGAAAT CGTCCACGGC CTCGAGATCA ATGAACTCGC CGCGGAGCTG GCGCGCACCA CGATCTGGAT CGGCGACATC CAATGGCGCA TCCGCAACGG CATCTACTCC AACCCGCGTC CGATCCTGCG CAAGCTGGAT TCGATCGAAT GCCGTGACGC GCTGATTACT AAGCTAACAG ATGGAACTTA CGCAGAGGCC GAATGGCCTA CGGCTGAATT TATCGTGGGC AATCCGCCAT TCTTGGGCGA CAAATTTATG CTTGATCGCT TGGGAGTGAG ATACACCCAA GCACTTCGCG AAGCTTTTCT CGGCAGAGTC CCGGGAGGCT CAGATCTTGT TTGTTACTGG CTAGAGAAGG CTCGAGCACA GATACTTTCA AATGAGACGT TTGGCGCAGG ATTTGTCGCG ACCAATTCAA TACGCGGCGG AGCAAATCGC ACCGTTGTCG ACAGAGTCAC GGCTGATCTA GACATTTTCT GCGCTTGGGC CGACGAAGAC TGGACAATCG AAGGCGCCGA CGTCCGTGTC TCACTCATTT GCTTTTCCTC GAAAGGCCGG GCCCAATTGC TTGTCGAACT CAACGGTCAG AGCGTCGCCC GCATATTCTC GGACCTGACA AGCAGCGCAA CCGATTTTAC ACGCGCTCGT AGCCTGAGAT CGTGCCGTGA GGTTGCGTTT ATCGGCAATC AAAAGGGCGG CGCATTCGAT CTACCGGGAT CAATCGCCCG TTCATTCCTC ACTCTGCCTC AGAACCCTAA CGGAAACTCG AATGCGGATG TCGTCAAGCC ATGGATCAAT GGACTCGACA TCGTTCGACG CCCCAGGGAC TATTGGATCA TTGACTTCAC TGGCTTACAA GAATCCGAGG CTGCTCTTTA CGAGGGGCCA TTTCAGTACA TTCTGGAGCA CGTTAAGGAG TATCGGAACG AAGAAGCTCA CGAATCGAGC AAGATGAATT GGTGGATACA TCAGAGGCCA CGCCATGCGC TTCGATTAGC CATAGACGGA CAGTCACGCT ACTTGGCCAC CGCACGTGTC GCCAAGCACC GGCTATTCAT TTGGGTTGAT CATCAAGTCG TACCTGACAG TCAGGTTGTA GCTATTGCAC GAAGCGACGA TGCGACCTTT GGCATTCTAC ACTCTAGCTT CCACGAGTCA TGGACGCTTC GCCTCTGCAC ATGGCTCGGC GTTGGCAACG ACCCACGCTA TACACCGACC ACTACCTTCG AAACCTTCCC TTTCCCCGAA GGCTTGACGC CGGACATCCC GGCGGGGGAC TACGCCGACG ACCCGCGCGC GCAGGCGATC GCGAAGGCGG CGAAGCGGCT CGACGAACTG CGCAAAGCGT GGCTCAATCC GCCCGATCTG GTCCGGATCG AGCCGGAGGT CGTGCCGGGT TATCCCGACC GCATCCTGCC GAAGGACACA AAGGCGGCGT CCGAATTGAA GAAGCGGACG TTGACCAATC TTTACAACGC GCGTCCGCAA TGGCTGGCCG ACGCGCATCG CGATCTCGAT GCCGCGGTCG CTGCGGCCTA TGGCTGGCCC GCCGACATTA CGGAAGACGA CGCACTGGCG AAGCTGCTGG AGCTGAATCT GTCGCGCGCG GGCGCGTCGA GCCCGCCTCC GGCCAACAAG GATGAAGGTT AG
|
Protein sequence | MTPAEFIKKW KPVALTERAA AQTHFLDLCK LFEHEDPVSA DPTGEWFTFE KGATKTGGGD GFADVWKKNY FAWEYKKKKR DLGVAMNQLV RYAAALENPP LQVVCDTDRF VIRTAWTNTV PKEYEIELDD LADPEKRKIL WAVFHDPEQL RPQQTRTAIT KEAADKFSTI ALRLQGRGTP EEIAHFVNQL VFCFFASSVK LLPEGFFPKL LKRAAQKPQH AIDYFNKLFE AMENGGEYDL TDIAHFNGGL FDGRRALKLD EGDIGLLIEA GSLDWGQIDP TIFGTLFERF LDPDKRAQIG AHYTDPDKIL MIVEPVILRP LRAEWDAARA KIAEIAGEAN ALQQTGFSKQ GAKSFDKKIT NIRAKAEVIR DQFIERLRGI TILDPACGSG NFLYLALQGV KDIELRANLE CEALGLSPRL PVIGPEIVHG LEINELAAEL ARTTIWIGDI QWRIRNGIYS NPRPILRKLD SIECRDALIT KLTDGTYAEA EWPTAEFIVG NPPFLGDKFM LDRLGVRYTQ ALREAFLGRV PGGSDLVCYW LEKARAQILS NETFGAGFVA TNSIRGGANR TVVDRVTADL DIFCAWADED WTIEGADVRV SLICFSSKGR AQLLVELNGQ SVARIFSDLT SSATDFTRAR SLRSCREVAF IGNQKGGAFD LPGSIARSFL TLPQNPNGNS NADVVKPWIN GLDIVRRPRD YWIIDFTGLQ ESEAALYEGP FQYILEHVKE YRNEEAHESS KMNWWIHQRP RHALRLAIDG QSRYLATARV AKHRLFIWVD HQVVPDSQVV AIARSDDATF GILHSSFHES WTLRLCTWLG VGNDPRYTPT TTFETFPFPE GLTPDIPAGD YADDPRAQAI AKAAKRLDEL RKAWLNPPDL VRIEPEVVPG YPDRILPKDT KAASELKKRT LTNLYNARPQ WLADAHRDLD AAVAAAYGWP ADITEDDALA KLLELNLSRA GASSPPPANK DEG
|
| |