Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0383 |
Symbol | |
ID | 5537845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 480347 |
End bp | 482203 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892546 |
Product | extracellular solute-binding protein |
Protein accession | YP_001430533 |
Protein GI | 156740404 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.531894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.778896 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTCGA TGCGTGACGT ATGGCGGCGA AGCGGCGCGC TGGCGCTGCT TCTTATCCTG ATCATCCCGG TCCTGGCGGC GTGTGGCGGT CAGCAACCGG CAGCGCAACC GACGGCAGCG CCAGCGCAAC CGACGGCAGC GCCAGCGCAA CCGACGGCAG CGCCAGCGCA ACCGACGGCA GCGCCAGCGC AACCGACGGC AGCGCCGACG GCGCCACCTG CCGCACAGCG GGGCGGGCGC CTGAAAATCC TCTACTGGCA GGCGGTGACG ACCCTCAACC CGCACCTGGC GACCGGTACG AAGGACTTTG ATGGCGCGAC CGTTATCCTC GAACCGCTGG CGCGTTACAA CGAAAAAGAT GAACTGGTGC CCTTCCTGGC GGCTGAGATT CCCACCATCG AGAATGGCGG CGTCGCTCCG GATGGAACCA GCGTGACCTG GAAACTCAAA CCAGGGCTCA AGTGGTCGGA TGGGAGCGAT TTCACAGTCG ACGATATCAT CTTTACCTGG CAGTACTGTG CTGATCCGGC GACGGCCTGC ACGACGAAGG CGGTCTTCGA TCCGATCGCC AATGTCGAGA AGGTCGATGA TACGACGGTC AAGATCACCT GGAAAGAGCC TACTGCCTAC CCCTACATCG CCTTCGTCGG TCCGAATGGC ATGATCCTCC AGAAGAAGCA GTTCGAGAAG TGCATCGGCG CGGCAGCCAG CACCGATGCG GCATGCCAGG CGGCGAATCT GGCGCCGATC GGTACGAATG CATGGAAGCT GAAGGAGTTC AAGCCGGGCG ATGTGGTGAT CTATGAGCGT AACCCCTTCT TCCGCGATGC CGACAACGTC TTCTTCGACG AGGTCGAGAT CAAGGGCGGC GGTGATGCTG CCTCAGCCGC GCGCGCCGTC TGTGAGACGG AAGAGGTGGA TTTTGCGTGG AACTTGCAGA TTCCGAAGGC GGTGCTCGAG CCGATCCTTG CGAGCGGTAA GTGCGACCCG CTGGCCGGCG GTTCGTTCGG TGTCGAGCGT ATTGTCGTCA ACTTCGCCAA CCCCGATCCG GCTCTCGGCG ATAAGCGCAG TGAACCGGAT CAACCGCATC CGTTCCTGAC CGATCCTGCC GTGCGCAAGG CGATCTCGCT GGCGATTGAC CGCAAGGCGA TTGCTGAGCA GTTGTACGGA CCGACCGGCA AGCCGACCTG CAATGTGCTG GTCGTGCCTG CATCGGTTAA CTCACCGAAC CTGACGTGTG AGCGCGATGT CGAAGCGGCG AAGAAGTTAC TCGAGGATGC GGGCTGGAAG TTGAACGGCT CGGTGCGCGA GAAGGAAATC GGCGGGAAAC CGGTCCGGCT TGTCGTCAGT TTCCAGACCT CAATCAACCC GCTGCGCCAG AGCACGCAGG CGATCATCAA GTCGAACCTG GCGGAGATCG GCATTCAGGT GAACGTCAAA GCCATCGATG CCAGTGTCTT TTTCGGCGGT GATGAGGGCA ACCCGGATAC GCTGAACAAG TTCTACGCCG ACCTCCAGAT GTATACGAAC GGTCCGAGCA GCGCCGATCC GCAGCAATAC CTCCAGGGGT GGCTCTGCTC CGAGCGCGCG TCGGCGGCGA ACCGGTGGAA TGGCAACAAC GACGGACGTT ATTGCAACCC GGAGTATGAC GCCCTCTTCG AGCAGTTGAA GAAGGAACTT GATCCGAAGC AGCGCGCCGA ACTGGCGATC AAGATGAACG ATCTGCTGGT GACCGATGGC GCCATCATTC CGCTTATCAA CCGCCAGACG CCGAATGCGA AGGTGAAGGC GCTCAAAGGT CCGACCTTCA ATACGTTCGA CTCGAGCATC TGGAATATCG CCTCCTGGAG CAAGTAA
|
Protein sequence | MRSMRDVWRR SGALALLLIL IIPVLAACGG QQPAAQPTAA PAQPTAAPAQ PTAAPAQPTA APAQPTAAPT APPAAQRGGR LKILYWQAVT TLNPHLATGT KDFDGATVIL EPLARYNEKD ELVPFLAAEI PTIENGGVAP DGTSVTWKLK PGLKWSDGSD FTVDDIIFTW QYCADPATAC TTKAVFDPIA NVEKVDDTTV KITWKEPTAY PYIAFVGPNG MILQKKQFEK CIGAAASTDA ACQAANLAPI GTNAWKLKEF KPGDVVIYER NPFFRDADNV FFDEVEIKGG GDAASAARAV CETEEVDFAW NLQIPKAVLE PILASGKCDP LAGGSFGVER IVVNFANPDP ALGDKRSEPD QPHPFLTDPA VRKAISLAID RKAIAEQLYG PTGKPTCNVL VVPASVNSPN LTCERDVEAA KKLLEDAGWK LNGSVREKEI GGKPVRLVVS FQTSINPLRQ STQAIIKSNL AEIGIQVNVK AIDASVFFGG DEGNPDTLNK FYADLQMYTN GPSSADPQQY LQGWLCSERA SAANRWNGNN DGRYCNPEYD ALFEQLKKEL DPKQRAELAI KMNDLLVTDG AIIPLINRQT PNAKVKALKG PTFNTFDSSI WNIASWSK
|
| |