Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0516 |
Symbol | |
ID | 5537979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 671225 |
End bp | 672691 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640892678 |
Product | extracellular solute-binding protein |
Protein accession | YP_001430664 |
Protein GI | 156740535 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA GACAGTACCC CTGGATTCTC CTGCTCGGTG CGTTGATGCT GGTGCTGGCA GCATGTGGCG GGCAACAAGG CGCCTCCCCC ACGGCAGCGC CGGGCGAAGC GCCAACCACT GCGCCGGCCA GTGACCTGCC GACTCCTACG CCGGTTATTG AGTTTGCGCA GCAACCACAG TCCGGTCAGA AGGTGCTGGT CTGGATGGTG CGCATCAACA CGGTCGAAAA CCGCTGGGAG CGTGATGTTG TTCTGCCGGC ATACCAGCAG GTCGCCCCCG ATGTATTCGT GAAGGTGCTC AATATCAACC AGGACGATAT CGCCGTCAAG CGTGAGGCGA TGATCGCCGC CAAGGAACCG CTGCACGTCT GGTCGTCGAA CTGGGGCGGC GATGGCTTCG CCAGCGACCG CTTCCGCGGC TTGCTGGCAG ACCTGACGCC GCTCATCGAA CGCGACAAGT GGGATACGAG CGACTTTATC CCCGAAGTGT TCGGCATCTA CAACGTCGAG GGCAAGCAGT ACGGCATTCC GTTCCTCACG ACCGGCAGTT ATGTGTACTA CAACATGAAA CTGTTCGACG AGGCTGGCGT GCCCTATCCG CCGAGCGACT GGAACGATAA GTCGTGGACG TGGGATGCAT TCCTTGAGAC TGCCAAGAAA TTGACGAAGA ACCCGGATGA TCCGAGCACG GCGGTGTACG GCGGTGTCAA CGGTTTGTGG CCCCCCTTCG ACAGTATTCC GATGATCTGG GGAAAGGACC CGTTCACAGC ACAAGCGCTC GAGACCGGCT TCTCCGATCC GATCAAACTG GATGAACAGA CGGCTGCGGC GTTCCAGGCG GTTCACGATC TGGTCTACGT CCATAAGGTC GCGCCCGACC AGGCGGCGTC TCAGGCGCTC GATCAACTGG GTGGCGCCTT CCTCTCCGGT CGAGTCGCCA TGTTTATGAC CGGCGGCTGG GGTCACTGGA ACTATAAGGA TATCATCGAC GATCCCAACG GTTTCTGCTG GGGTGCAGCG CCACTCCCCT GGGGGACGCC CGATGCCACT ATCCGCGCGA CGATCTTTAC CGATCCCTGG GTCGTTACTG CTGGCATGGA TGCAGAGAAC ACCGATATGG CATGGAACTT CGTGAAGTTC CTGGCATCGG CAGAGCAGCA GCGCGCTTAT ACCCTGGCTA CCGGAACCCC GCCGGTGCGC CAGAGCCTGC TCAATGATTA CTACAAGCAG TATGAGAAGT GCGTCCCGGC AGAAAAGACG AAAGAGTCGT TCCAGGGCGC CTTTAGCCAC GGACGCGAGT CATCGAACCA CCTGCTGGTC AAGTTCGATG AACTCAGCCA GACCTGGGAT AACCTCCTGA GTCCGTTCTG GAACGATCCG AACGCGAAAG CCTCGGATCT GATGCCGCTC CTCGAGTCGG ACGTCAATGC CGCTCTGGAG CGCATCCGCA AAGAGGCGGG CAGGTAA
|
Protein sequence | MKIRQYPWIL LLGALMLVLA ACGGQQGASP TAAPGEAPTT APASDLPTPT PVIEFAQQPQ SGQKVLVWMV RINTVENRWE RDVVLPAYQQ VAPDVFVKVL NINQDDIAVK REAMIAAKEP LHVWSSNWGG DGFASDRFRG LLADLTPLIE RDKWDTSDFI PEVFGIYNVE GKQYGIPFLT TGSYVYYNMK LFDEAGVPYP PSDWNDKSWT WDAFLETAKK LTKNPDDPST AVYGGVNGLW PPFDSIPMIW GKDPFTAQAL ETGFSDPIKL DEQTAAAFQA VHDLVYVHKV APDQAASQAL DQLGGAFLSG RVAMFMTGGW GHWNYKDIID DPNGFCWGAA PLPWGTPDAT IRATIFTDPW VVTAGMDAEN TDMAWNFVKF LASAEQQRAY TLATGTPPVR QSLLNDYYKQ YEKCVPAEKT KESFQGAFSH GRESSNHLLV KFDELSQTWD NLLSPFWNDP NAKASDLMPL LESDVNAALE RIRKEAGR
|
| |