Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2800 |
Symbol | |
ID | 5540287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3620835 |
End bp | 3622727 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640894927 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432889 |
Protein GI | 156742760 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA ACAAACGACG ACTGACCCGG CGAACATTCC TGCGCGCTGC GGCTATCGGC ATTGGTTCAG CGACACTTGC CGCCTGTGGC GGCGGCGGCG GCGCCACTGC TCCTACAGCG CCGCCCGCAA CGGTTCCTCC GCCGACGACC GCTCCAACCC TCGCGCCGGT GCAACCCACG CCGCCTCCGA CGACAGCCCC CGCGCCCACG ACTGCACCAG CCAGCGCCGT GACCAACTCG CTCGGCGTCA CCCTGCCGGC AAACGCAGCG CCGCTTGAGC ATCAGGCATT TGTCGTCTAC TTCGACATCA CTGCCGACTT CACCAGCCCC AATCAGATGG AGACGATCTA CAAGTCGGGA GGGTTTGGCA GCATCACCAA CCTGACCGGC GATACGCTCG TGCGCCTGAA TAAAGATTTT CAGGTACAAC CGGCTGCTGC GCTTTCGTGG TCGTCGGATG AAACCGGCAA GGTCTGGACG TTCAACCTGG ACCCCAACCT GGTCTGGAGC GATGGCACAC CGGTGACAGC CGAAGACTTC GTGGCGACCT TCCGCTACGC TGCCGATCCG AAGCATGCCT GGGATTTCGC CTGGTACTAC AGCGCACCCG GCGCAATCAA GAACTGGGAC AAGTGCGTTG CTGGCGAACT GCCGCTGGAA GAGCTTGGCG TGACCGCCAA AGATGCCCAT ACGCTGGTTA TCGAAACCGA AACGCCAGCC CCTTTCTTGC CCGCAAAACT GGTCTACAGC GAGGTGTTGA GCGCCGCAAA ACTGAAGGAA TATGGTTCTG GTCTCTATAC CGCCGATCCG GCAAAGACGA TTTCGTGCGG ACCGTACCTG CTGAAAGAGT TCAAGCCGGG CGAGCGGGTG GTCTTCGAGA TCAACCCGAC GTACAAAGGC ACCAACCGCC CGCGCATTGA GCGTGTCATC CAGATCGCTG CGCGACCGGA AGCCATGTTC GCCGGGTATC AGGCAGGCGA GGTGGACCGC GTGACCGGAG AGCAGTTGCA GACTGCCGAT AACGAAATCA TTGCCCGCGA CCCCGAACTG TCGAAACAGG TGCGCCTCAC TGCTGCCGAT TTCCGCACGG ACTACCTCTT CTTCGACTGC CAGAACCCGC CGTTCAACGA CGTGCGGGTG CGCCAGGCGT TCAGCCACAT CATCGATCGT GACACGCTGA TCAAGACGAT CATCACGCCG ACGCAAGGCA TCCCGGCATA CTCGTTCCTG ATGCCGGGAT TCCCGGCGTC GAACTCCGAA GGGCTGAAGG ATATTCAGCG CTACGATCCC GAACGTGGCC GCGCGCTGCT GAAGGAAGCC GGCTATGAGG GAGGCAAGGG CTTCCCCAAA CTGACCCTCT GGCTGCGCAA CGAACCGCAG ATTCGCCAGG CGCTCGCAGC GGCGATTGCG GCGGCGATCA CGCAGGAGTA CGGCATCGAA GTCGAGGTCT CGAACAAAGA GTTCAAGACC TTTATGGACG CGCTCAACGC CAAGCCGACC CAGATTCAGT TCGGTATGGT GTCGTATGGC ATCGACTTCC TCGATCCGTC GAATATGCTC GGCGTCTGGC TCAGCACGGG GCGCCACAAC TGGTTCAACA AGAAGTTCGA CGAGATGGTG CTGAAAGCGG CAGAGATGAC CGATCAGGAA GCGCGCATCA AAATCTTCCA GGATGCCGAA CGGTTGCTCT GCGAAGAAGC GCCGGCGGTC TTTATCTATC ACCGCACCGT TGCCGATATC TACAAGCCGT ATGTCGTCGG CGAGTGCTTC GAGCCGAACA TCGCCGGATT CGCCGGATTG CAGTGGCCCG GCTTTACATC GATGAGCGAC TCACTCCAGA CGCTGTACAT CAGCGATGAA GTGACAAAAT ATCGCAAGGC GCCGCCGAAG TAG
|
Protein sequence | MTDNKRRLTR RTFLRAAAIG IGSATLAACG GGGGATAPTA PPATVPPPTT APTLAPVQPT PPPTTAPAPT TAPASAVTNS LGVTLPANAA PLEHQAFVVY FDITADFTSP NQMETIYKSG GFGSITNLTG DTLVRLNKDF QVQPAAALSW SSDETGKVWT FNLDPNLVWS DGTPVTAEDF VATFRYAADP KHAWDFAWYY SAPGAIKNWD KCVAGELPLE ELGVTAKDAH TLVIETETPA PFLPAKLVYS EVLSAAKLKE YGSGLYTADP AKTISCGPYL LKEFKPGERV VFEINPTYKG TNRPRIERVI QIAARPEAMF AGYQAGEVDR VTGEQLQTAD NEIIARDPEL SKQVRLTAAD FRTDYLFFDC QNPPFNDVRV RQAFSHIIDR DTLIKTIITP TQGIPAYSFL MPGFPASNSE GLKDIQRYDP ERGRALLKEA GYEGGKGFPK LTLWLRNEPQ IRQALAAAIA AAITQEYGIE VEVSNKEFKT FMDALNAKPT QIQFGMVSYG IDFLDPSNML GVWLSTGRHN WFNKKFDEMV LKAAEMTDQE ARIKIFQDAE RLLCEEAPAV FIYHRTVADI YKPYVVGECF EPNIAGFAGL QWPGFTSMSD SLQTLYISDE VTKYRKAPPK
|
| |