Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0331 |
Symbol | |
ID | 3719091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2057613 |
End bp | 2059451 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640071543 |
Product | hypothetical protein |
Protein accession | YP_353408 |
Protein GI | 77463904 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.378113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGAC GGACCATCCT GACATCGGCC GCCGCCGCGC TGATGCTGGC CCCTGCAGGA CGCCTCCTCG CGCAGTCGGG CAGAGAGGCT TTGCCTGCGG ACCACCCGCT CCAGGCGGCC TGGCGCAGCT GGAAGGATGC GTTCCTGCTG CCCGCCGGCC GCATCGTCGA CGGGCCGCAG CAGAATGCGA GCCATTCCGA AGGGCAGGGC TACGGAGCCA CGCTCGCCGC GATCTTCGGC GACGAGGAGG CCCTGCGGCG CATCGTCGAC TGGACCGAGG CGAACCTTGC GCGGCGCGAG GACAAGCTTC TGAGCTGGCG CTGGCTGCCC GGTGTGGCGC TGGCCGTGCC CGACGAGAAC AACGCCACCG ACGGCGATCT CTTCTACGCC TGGGGTCTCG CCATGGCCGC GCAGCGGTTC GGCAAGGCCG ATTACGCCGG GCGGGCGACC GAACTGGCGC GCGCCATCGC GCTGCATTGC GTGCGTCCGC ATCCGGACGG CTCCGAGCAG CTCGTGCTGC TGCCGGGGGC CAGCGGCTTC GAGACGCCGG ACGGGGTGGT GCTCAACCCC TCCTACTACA TGCCCCGCGC CCTGACCGAG CTCGCCGCCT TCAGCGGCCA GGACCGGCTG GCGCGCTGTG CCCGCGACGG GGCGGACTGG ATCGCGTCGC TCGGGCTTCC GCCGGACTGG GCGCTGGTCA CGCCCTTCGG CACACAGCCG GCGCCGGGCC TGTCCCACAA CAGCGGCTAC GATGCGCTGC GGGTGCCCCT GTTCCTGCTC TGGTCCGGGC TGACCGCCAA TCCCGCGCTG CGCCGCGCGG TGGAGGCGGC CGGGGACGCC GCAGCCGGCG ACACGCCGGT GAGGTTCGAC CGCGACACGG GGGCGGTGCT GGAACGGTCC GCCGATCCGG GCTTCCGCGC CGTGCTCGCG CTTGGCGATT GCGCCCTTTC GGGTCGTCCG GGGGCGGCGA TCCCGCCCTT CGACGCGCGC CAACCGGGCG GCGGATGCCG AGCTGCGGCG CCTCCGCGCG CAGTTTCCCG ACTGGGACGT GCCGTCCGAC CTCACGACGC TGGGCCAGCA GCGGTCTCCC GCCGCCGAGA TCGACCGGAT CTACCGGCAG ATCGCGGCCG GAGACCTGAC CGAGGCCCGG CAGGCGATGG ACGAGACGTC GCGCAACTTC CCCGGATGGA CGCCGCCGCC CGAGATGGAG CGTCTTCTGG CCACGGCCGA GGCACAGGCC GCCTTCGATG CGGCCGCCAG TGCGGGCAAT GCGGGCGCGG CAATCGAGAT CGCGCGGCGG ACGCCCGCGA TCCTGCGCTG CGACCGGGTG AACAACGCCT GGCGGCTGGC CGAGCTGCAG GCGGCGGCGG GCCAGAAGGC GGCCGCGCTG CAGAGCTATC GCGGGGTGAT CGCCTCCTGC TCGGGCCTGT CCGAGGTGAC GGCGACGCTC GAGAAGGCGG AGGCCGTGGC CAGCGATGCG GAGCTGGTCG AGCTCTTCCG GCTGGCCAAT GCGCAGCTTC CGGGCTCGGG ACCTGCGCTG AAGGCGCTCG AGACACGGCT GAGGGCGGGA CGCGGCGACA CGGCGCCCGA GGCATCGGCG CCGGCTGCCG CAGCAACGGG CGGAGCCAAG CGCACGCCGG GCCGCACTGC GGTGGCCGAG GCGGATCTGC CCGCGGCGGG GCGCCCGCGC ACTGCGGGCG TGGCGCGCAG CGGCGGAGGG GCGGGGCTGT CCGCGGTCCG CGCGGCAGCG CAACGCGGCG ACTGGCGGAC CTGCACCGGC CTCACCAGCG GCGCCACCAG CGCCGACATG CTCTACGAGC GGGCCTGGTG CGTCTATAA
|
Protein sequence | MRRRTILTSA AAALMLAPAG RLLAQSGREA LPADHPLQAA WRSWKDAFLL PAGRIVDGPQ QNASHSEGQG YGATLAAIFG DEEALRRIVD WTEANLARRE DKLLSWRWLP GVALAVPDEN NATDGDLFYA WGLAMAAQRF GKADYAGRAT ELARAIALHC VRPHPDGSEQ LVLLPGASGF ETPDGVVLNP SYYMPRALTE LAAFSGQDRL ARCARDGADW IASLGLPPDW ALVTPFGTQP APGLSHNSGY DALRVPLFLL WSGLTANPAL RRAVEAAGDA AAGDTPVRFD RDTGAVLERS ADPGFRAVLA LGDCALSGRP GAAIPPFDAR QPGGGCRAAA PPRAVSRLGR AVRPHDAGPA AVSRRRDRPD LPADRGRRPD RGPAGDGRDV AQLPRMDAAA RDGASSGHGR GTGRLRCGRQ CGQCGRGNRD RAADARDPAL RPGEQRLAAG RAAGGGGPEG GRAAELSRGD RLLLGPVRGD GDAREGGGRG QRCGAGRALP AGQCAASGLG TCAEGARDTA EGGTRRHGAR GIGAGCRSNG RSQAHAGPHC GGRGGSARGG APAHCGRGAQ RRRGGAVRGP RGSATRRLAD LHRPHQRRHQ RRHALRAGLV RL
|
| |