Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2815 |
Symbol | |
ID | 6410481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3060023 |
End bp | 3063016 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642712693 |
Product | outer membrane insertion C-terminal signal |
Protein accession | YP_001991799 |
Protein GI | 192291194 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3637] Opacity protein and related surface antigens |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.71405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATT TTCCAGCGTG GCTGAGCGTC TGGTCGGTGG TGGCCGTTGC GTGCGGTAGT GCGCATGCGG CCGATCTGCA GAACGGCGCG GTCCCGTCGG CCGTGCCGGC GGCAACTAAC TGGACCGGCT TCTACATCGG TACCCACGTC GGCGCCGCAG CGTCGGACAG CGCGTGGAAC AGCATTGCTG GAAGCGGGGC CGGACTGGCG GCGGGGCCGT TCCCTGGTCA TGGCGTCAGC GGCAATGCGA TCGCCGGCAT CCAGGGCGGC TACAACGCGC AGTTCGGCAA TTACGTGCTC GGCATCGAAG GCGACGCCAG CTTCGGTTCC ATCAACGGCT TGGCCCGCTG CCTGCACGGC ACCTTTGCCT GCACCAGCCG GATCGACGAG CTGTGGACGC TCGCCGCGCG CTTCGGCTAC GCCGCGGGCG ATTTGCTGGT CTATGGCAAG GCCGGCGCCG CCTGGGCCGA TGTCCATCGC CGGATGGCCA GCGGCAACTT CGTCAACGTC CTCGAAGCGT CGGAGACGCG GTCCGGCTGG CTGCTCGGCG CCGGCGTCGA ATATGCCTTC CTGCCGGGCC TGTCGGCCAA GATCGAGTAC AACTACGCAG ATTTCGGCAG TCGCTCCCTG ACGATGGCCG ATCAGTTCGG CGACGTTTCG GACGTGTCGA TCGGCCAGAC CGCGCATCTG GTCAAGGTCG GCCTCAACTA CCGGCTCGGC GCGGCCTCGG TCAGTGCAGC GCCGCTTCCC GGCGTGCCGC TGCCGCAGTG GAGCTGGACC GGGATCTATC TCGGCGTCCA TGCCGGCGGC GGCTTCGGCA GCAACGATTG GGACTCCGCG ACGGGCGCGC TGCTCGCGGC GTCGACGTCG GGCGGGTTCC CCGGGCGCGG CGACAGTTCC GGGCTGTTCG GCGGCGGCCA GATCGGCGCG AACTATCAAT TCGGCCGCTG GGTCGCCGGC GTCGAAGCGT CCGCCGCCGC GGCGGATATC GGCGGCTACG CCAAATGCGC CACCGATGTT GGGACACGGA GAAATTTCAC CTGCCACAAT GAGCTGTCGT CGCTCGGCAC CATCACGGGC CGGCTTGGGC AGACCTGGGG CAACCTGTTG ATCTATGGCA AGGCCGGCGC GGCTTGGGCA ACCGGCAGCA GCGATGCGCA GCGCGCAGGC AGCGCCAGCC GCTTCACCGA AAGCGGCACG CGCTGGGGCT GGGTCACCGG AACCGGCCTG GAATATGCGC TCAGTCCGAA CCTGTCGGCC TTCGTGGAGT ACAACCACGT CGATTTCGGC ACCCAAGACA CCGCTTACGT CGATCAGTTC GGCAATGCCT CCGAGGTCGG CTTCAAGCAG AAGTTCGATC TGGTGAAGGC GGGTCTGAAT TATCGGCTCG GCTCGGGCGC GCCGACGCTC GGCGCGGGCG GCGACGTGCC GCTGTTCGTC AAGGCCGCCG CGCTGCCGCT CGGCTGGCAG GTCGAGGCCG GCACCCGCTA TTGGGGCAGC TCGGGGCGGA TGCAGAAGGA TCTGAACGAC AACGTCTCGC CGAGCCGGCT GAATTCGCGG CTGATCTATG GCGATCAGAC CGGACATTCG CTCGAAGCCT TCGTGCGCGT CGATCACGCC TCGGGCCTGT TCGCCAAGGC CAATCTCGGC CTCGGCCATC TCGTCAACGG TCAGCTCAAC GACGAAGACT TCCCGAGCGA AGTGAACTAT TCGAACACGA TCTCGGAGAT GCGCGACGGC CGCCTGGCCT TCGGCAGCGC GGACATCGGC TACAATTTCA TCAACGATGG CGGCCGTAAG CTCGGTGGCT TCGTCGGTTA TCGTTCGCTC TACCAGACTG GCAACGGCTT CGGCTGTCGG CAGATCGCGA CCGACTTCGA CACCTGTGGC GTCCCGTTTC CCACCAACTT CGTCGGTCTC AGCGAGACCG AATCCTGGCG CGGCGTCGCG CTCGGCCTCA ACGTTCGCGC GCCGCTGACC GAACGGCTGC GGCTGGAGGT CGATGCGGCC TATCTGCCTT ACGTCAATCG CGCCGGCTTC GACAATCATT GGTTTCGCGC CGACATCAAT CCGCAGTCGG AGGTCGGCCA CGGCTGGGGC ACGCAGTTCG AAGCGATCCT GTCCTACGCC GTTACCGATC GCTTCAGCGT CGGCGTCGGC GGCCGCTATT GGTACTTCGC GACCGACAGC GCCAGCACGA TCTTCCCGAG AGAAGCGACG ACGTCGCCGA TGCAGTTCTA TTCCGAGCGC TACGGCGTCT TCCTCCAGAC CTCGTACAAG TTCGGCGACG TCGATGCGGC AGAAGCTTCG GCGCACGGCA TCACCAAGGC GCCGCCGCGG ATCGCGCCCA CCAACTGGAC CGGCCTCTAT GTCGGCGCCA GCGTCGGCGC GGGCTGGGGC CGCACCACCT ATGCGGACCC GTTCCCGACC CCGACGACGG GTGATCGGGC TGACCTCGGC GGCGCGCTGC TCGGCGGCCA GATCGGCGCG AACTACCAGT TCGGTCATCT CGTCGCCGGC GTCGAAGCCT CGGCCAACTG GGCGAACATC CGCGGCACCG ACACCTGCTT CGGCGCGTAT CCCAATCCTG TGGTGGCCGG CTTCAACTGC GGCAGCAGGA TCGACGCGAT CGGCGCTTTC ACAGCGCGCG GCGGCTACGC GATCGATCGC ACGCTGCTCT ACGTCAAGGG CGGCGCGGCC TGGGATCGCC AGCAGGATCA GTTCAACACC GTCGGCGTCG GCGGCACGAC ACTGGCGAAC ACCAGCACCA ATTGGGGCTG GACCGTCGGC GGCGGTCTCG AATACGCGCT GGCGCCGTCG TGGTCGATGG CGCTGGAATA CAAGTACTTC GACTTCGGCG CGTCCCCGGC GTTCAGCACG TCGGTGACGC CGGCATTGGA CGGCGTCAAT CTGGCGCCGC AGAGCAACAA GCTGCAGACG GCGTCGCTCG GGGTGAACTA CAAGTTCGGG CCGTCGTTGT TCGTAAGGGA CTGA
|
Protein sequence | MKNFPAWLSV WSVVAVACGS AHAADLQNGA VPSAVPAATN WTGFYIGTHV GAAASDSAWN SIAGSGAGLA AGPFPGHGVS GNAIAGIQGG YNAQFGNYVL GIEGDASFGS INGLARCLHG TFACTSRIDE LWTLAARFGY AAGDLLVYGK AGAAWADVHR RMASGNFVNV LEASETRSGW LLGAGVEYAF LPGLSAKIEY NYADFGSRSL TMADQFGDVS DVSIGQTAHL VKVGLNYRLG AASVSAAPLP GVPLPQWSWT GIYLGVHAGG GFGSNDWDSA TGALLAASTS GGFPGRGDSS GLFGGGQIGA NYQFGRWVAG VEASAAAADI GGYAKCATDV GTRRNFTCHN ELSSLGTITG RLGQTWGNLL IYGKAGAAWA TGSSDAQRAG SASRFTESGT RWGWVTGTGL EYALSPNLSA FVEYNHVDFG TQDTAYVDQF GNASEVGFKQ KFDLVKAGLN YRLGSGAPTL GAGGDVPLFV KAAALPLGWQ VEAGTRYWGS SGRMQKDLND NVSPSRLNSR LIYGDQTGHS LEAFVRVDHA SGLFAKANLG LGHLVNGQLN DEDFPSEVNY SNTISEMRDG RLAFGSADIG YNFINDGGRK LGGFVGYRSL YQTGNGFGCR QIATDFDTCG VPFPTNFVGL SETESWRGVA LGLNVRAPLT ERLRLEVDAA YLPYVNRAGF DNHWFRADIN PQSEVGHGWG TQFEAILSYA VTDRFSVGVG GRYWYFATDS ASTIFPREAT TSPMQFYSER YGVFLQTSYK FGDVDAAEAS AHGITKAPPR IAPTNWTGLY VGASVGAGWG RTTYADPFPT PTTGDRADLG GALLGGQIGA NYQFGHLVAG VEASANWANI RGTDTCFGAY PNPVVAGFNC GSRIDAIGAF TARGGYAIDR TLLYVKGGAA WDRQQDQFNT VGVGGTTLAN TSTNWGWTVG GGLEYALAPS WSMALEYKYF DFGASPAFST SVTPALDGVN LAPQSNKLQT ASLGVNYKFG PSLFVRD
|
| |