Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1637 |
Symbol | |
ID | 4895209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1729800 |
End bp | 1730918 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640112230 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_001043519 |
Protein GI | 126462405 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.381986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTCT GGACCATGGC GCTGGGAGGG CTGGCGCTCG GCGCCGGTGC TGCGGCGGCG GGCGACATCG AACGGTCGCA GCAGAGCGTG GGCATCCTGT TCGAGGAAGG GCGCTATGCC GAATTCACCT TCGGCGCGGT CAACCCGGAC GTGAGCGGTT CGGTGGGCGG GCTCGGCTCG GGCAACATGG CGGGCAACTT CAACACCTGG TCGCTCGGCT ACAAGCAGCC TCTGGGCGCC AACATGGATC TCGCGCTGAT CCTCGACCAG CCGATCGGCG CCGACGTGGA CTACCCCGAC GAAGGCCTCT ACCCGCTGGC GGGCACGACG GCCGAGCTGC GCTCGAACGC GATCACGGCC CTCCTGCGCT ACAAGTTCCA GAACAACGTC TCGCTCTACG GCGGGCTCCG CGCGCAATCG GTCGAGGGCA AGGCCCACAT CCTGTTCGAC ATCTACCAGA GCGGGACCTT CGTCGCGACG ACCGACTACA ATCTCGAGAC GAACCGGGAC TGGTCGCTGG GCTATGTGCT GGGCGTGGCC TGGGAGAAGC CCGAGATCGC CGCCCGCGTG GCGCTGACCT ACAACTCGGC CATCGACCAT ACGCTCGAGG GCGACGAGAG CGGCACCAAC GCGGTTTTCG GCAATTTCTC GGGCAAGAGC GACTTCGACA CGACGGTGCC GCAGTCGGTC AACCTCGAGT TCCAGACCGG CATCGCCGAG GACACGCTGC TGTTCGGCTC GGTGCGCTGG GTGGACTGGA CCGAGTTCCT GATCGATCCC GAGATCTACC AGTATTACTT CCCCGGCATC CCTCTGGTGG CCTACGAGTC CGACCGCTGG ACCTACACGC TGGGCGTCGG CCGCCGCTTC AACGAACACT GGTCGGGCGC CGTCACCGTC TCCTACGAAC CTCAGACGGG CGACGACACG GGCAACCTCG GGCCGAACGA CGGGTTCCGC TCGATCGGGA TCGGCGCGAC CTACTCGCAC GAGAACATGA AGATCACCGG CGGCATCCGC TATGTCGAAC TCGGCAATGC GACCACGCGC GGCGTGGGTG CCGAATTCGA CGACAACAAT GCCGTCGCCG CGGGCATCCG CGTGGGCTTC ACCTTCTGA
|
Protein sequence | MKVWTMALGG LALGAGAAAA GDIERSQQSV GILFEEGRYA EFTFGAVNPD VSGSVGGLGS GNMAGNFNTW SLGYKQPLGA NMDLALILDQ PIGADVDYPD EGLYPLAGTT AELRSNAITA LLRYKFQNNV SLYGGLRAQS VEGKAHILFD IYQSGTFVAT TDYNLETNRD WSLGYVLGVA WEKPEIAARV ALTYNSAIDH TLEGDESGTN AVFGNFSGKS DFDTTVPQSV NLEFQTGIAE DTLLFGSVRW VDWTEFLIDP EIYQYYFPGI PLVAYESDRW TYTLGVGRRF NEHWSGAVTV SYEPQTGDDT GNLGPNDGFR SIGIGATYSH ENMKITGGIR YVELGNATTR GVGAEFDDNN AVAAGIRVGF TF
|
| |