Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3940 |
Symbol | |
ID | 5210924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4932077 |
End bp | 4933396 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640597536 |
Product | extracellular solute-binding protein |
Protein accession | YP_001278242 |
Protein GI | 148658037 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACTC CTGCCCTGAC CATCCGCCGG TATGCCTGTG CGCTGGCGAT CATGCTGCTG GCGGCATGCA GTCAACCCGG TGCAGCGCCG CCAACACCGG CGCCTGCTCC AACGGTTGCG CCGACAACGC CGGCAACGCC GCTGCCTTCG CCGACCGCTC CACCGACCGC AACGCCGCCA CCCACACCCA CTCCGGCGCC CGAACCGCTG ACGGTGTGGG TGGCGGCTGA CGAAGCGCAC CGCGACGCAC TGACCCGCCT GCTCACCGAC GCCGCTGCCG AAACCGGCGT TCCGGTCCGC ATCATGAGCG GCAGCCCCGA CGCCATGATC GCCCGGTTGC GCGTTGATCA ACTCGACGGG CGCCCGCCGC CAGACCTGAT CTGGGGGAAC GGCAATGACC TGGCAATCCT GCGCACGATG GGGCTGATTC AGGCGACGCG TCGGACGGCT TCTCCCAACG ACACGCTGCC CGCTGTCATC ACCGGCGCCA CTACCGACGG TCAACAGTGG GGTGAGCCGG TTGCGGCGCA GGGATTCTTG CTGCTGCTCT ACAATCGGAA ACTCGTGGAA CATCCGCCGC GCACCGTCGA TTCACTGATC GCCACTGCGC GTGCCAACAC TGGCGGCAAT CGGGTCGGAC TGGTCGCCGG ATGGACCGAA GCGCGCTGGT TTGCGTTGTG GCTGGATATG ACCGGTGGAA CGATGCTTGA TGCTGATGGG ATGCCGCTAC TCGACACACC CGCAGTTATC GCAGCGCTCG ATCTGCTGCG CACCATGCGA CGGTATGGAC CGACATCCCC CTCGACCTAT GACGAAGGCG CACGGTTGTT CCGTCGCGGC AGGGCAGCGC TGGCAATCGA CGGCGACTGG GCGCTGGAGA GTTATCGCGG ATTGACCGAG ACGCTGGAGT TGGGCATTGC GCCGCTGCCG TTGACCAGCC GCGGAACGCC AGCGACAGCG CCGCTGACCG GCGTCTACCT GATGTACGGC GCCGCGCTCG ACGCCTCACG CCTGGCACAG GCGGAAGCGC TTGCACAGAC CCTGCGCGAA CCGGCATGGC AGGCGCGCAT TGCCCGCGAC ACAGGGATGC TCCCGGCTTC CATCGCTGCA CTGAGCGACC CGGCGGTCAC TGACGATCCG GCGCTTGCCG CCGCAGCGCA GTACGCCAGA AACGCACCCG GCATCCCGCC CGACCGCCCG ATCCGCTGCG CCTGGGATGC TATCGAAGCG GCGCTCTCCC CGTTTTTGCT TGGCAAACGC ACCGCCGCCG AAACCGCATC AGCGATGCAA CAGCGCGCCG ACGCCTGTGC ACGTCAGTAG
|
Protein sequence | METPALTIRR YACALAIMLL AACSQPGAAP PTPAPAPTVA PTTPATPLPS PTAPPTATPP PTPTPAPEPL TVWVAADEAH RDALTRLLTD AAAETGVPVR IMSGSPDAMI ARLRVDQLDG RPPPDLIWGN GNDLAILRTM GLIQATRRTA SPNDTLPAVI TGATTDGQQW GEPVAAQGFL LLLYNRKLVE HPPRTVDSLI ATARANTGGN RVGLVAGWTE ARWFALWLDM TGGTMLDADG MPLLDTPAVI AALDLLRTMR RYGPTSPSTY DEGARLFRRG RAALAIDGDW ALESYRGLTE TLELGIAPLP LTSRGTPATA PLTGVYLMYG AALDASRLAQ AEALAQTLRE PAWQARIARD TGMLPASIAA LSDPAVTDDP ALAAAAQYAR NAPGIPPDRP IRCAWDAIEA ALSPFLLGKR TAAETASAMQ QRADACARQ
|
| |