Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4599 |
Symbol | |
ID | 5211585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5763828 |
End bp | 5765693 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640598178 |
Product | extracellular solute-binding protein |
Protein accession | YP_001278880 |
Protein GI | 148658675 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.593059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTCGA TGCGTAACGT GTGGCGGAAG AGCGGTGCGC TGGCGCTGCT CCTCATCCTG ATCGTCCCGG TTCTGGCAGC GTGTGGCGGG CAGCCAGCCA CAGCACCGAC CACGGCGCCA GCGGCGCAAC CGACGACTGC ACCCACTGAG GCGCCAGCGG CGCAACCGAC GACTGCGCCC ACTGAGGCGC CAGCGGCGCA ACCGACGACT GCGCCGACCA CTCCGCCAGC GGCGCAGCGC GGCGGTCGCC TGAAGATCCT GTACTGGCAG GCGGTGACGA CCCTCAACCC TCACCTGGCG ACAGGTACGA AGGACTTCGA TGGCGCAACG GTTATCCTCG AACCGCTGGC GCGCTACAAC GAGAAGGATG AACTGGTGCC CTTCCTGGCG GCGGAGATTC CAACGATCGA GAACGGCGGC GTCGCTCCTG ATGGAACCAG CGTCACGTGG AAGCTCAAAC CAGGGCTGAA GTGGTCGGAT GGCAGCGATT TCACGGTTGA CGACATTATC TTCACCTGGC AGTACTGCGC CGATCCTGCG ACTGCCTGCA CCACCAAGGC GGTCTTCGAC CCGATCGAGA AGGTCGAGAA GGTCGATGAT ACCACTGTCA AGATCATCTG GAAGGAGCCG AACGCCAACC CGTACATCTC GTTCGTCGGT CCGAACGGCA TGATCCTGCA GAAGAAGCAG TTCGAGAAGT GCATCGGCGC GGCAGCCAGC ACCGATGCGG CGTGCCAGGC GGCGAACCTC GCGCCGATCG GTACAAATGC CTGGAAGTTG AAGGAGTTCA AGCCGGGCGA TGTGGTGATC TACGAACGCA ACCCCTTCTT CCGCGATGCC GATAATGTCT TCTTCGACGA AGTGGAGATC AAGGGCGGCG GTGATGCTGC GTCAGCAGCG CGCGCCGTGT GTGAGACGGA AGAGGTCGAT TTCGCCTGGA ATCTCCAGAT CCCGAAAGCG GTTCTCGAGC CGATCCTTGC TTCCGGTAAG TGCGACCCGA TCGCCGGCGG TTCGTCCGGA GTTGAGCGCA TTGTGGTCAA CTTCTCGAAC CCCGACCCGG CACTGGGCGA CAAGCGCAGC GAGCCGGATC AACCCCACCC CTTCCTGACC GACCCTGCAG TGCGCAAGGC GATCTCGCTG GCGATTGATC GCAAGGCAAT CGCCGAGCAG TTGTACGGTC CTACCGGCGA ACCTACCTGC AACGTGCTGG TGGTGCCGAC TGCGGTCAAT TCGCCAAACC TGACATGCAA CCGCGATGTC GAAGCAGCGA AGAAGTTGCT CGAAGATGCG GGCTGGAAGT TGAAAGGCTC GGTGCGGGAG AAGGAGATCG GGGGGAAAAC GGTCAGACTG GTCGTCAGTT TCCAGACCTC GATCAATCCG CTGCGCCAGA GCACCCAGGC GATCATCAAG TCGAACCTGG CGGAGATCGG TATTCAGGTG AACGTCAAAG CCATTGATGC CAGCGTCTTC TTCAGTGGCG ATCCAGGCAA CCCGGATACC CTGAACAAGT TTTACGCCGA CCTCCAGATG TACACCAACG GTCCGAACAA CGCCGATCCA CAGCAGTACC TCCAGGGCTG GACCTGCGAA GAGCGCGCTT CGGTCGCCAA TCAGTGGAAC GGCAACAATG ACGGTCGCTA CTGCAACCCG GAGTACGACA AGCTTTTCGA GGAGTTGAAG AAGGAACTTG ACCCGAAGAA GCGCGTCGAA CTGGCGATCA AGATGAACGA TCTGCTGGTG ACCGATGGTG CGGTGATTCC GCTGATCAAC CGCCAGACGC CGAATGCGAA GGTGAAGGCA CTCAAGGGAC CCACCTTCAA CACGTTCGAC TCCGACATCT GGAATATTGC CTCCTGGAGC AAGTAA
|
Protein sequence | MRSMRNVWRK SGALALLLIL IVPVLAACGG QPATAPTTAP AAQPTTAPTE APAAQPTTAP TEAPAAQPTT APTTPPAAQR GGRLKILYWQ AVTTLNPHLA TGTKDFDGAT VILEPLARYN EKDELVPFLA AEIPTIENGG VAPDGTSVTW KLKPGLKWSD GSDFTVDDII FTWQYCADPA TACTTKAVFD PIEKVEKVDD TTVKIIWKEP NANPYISFVG PNGMILQKKQ FEKCIGAAAS TDAACQAANL APIGTNAWKL KEFKPGDVVI YERNPFFRDA DNVFFDEVEI KGGGDAASAA RAVCETEEVD FAWNLQIPKA VLEPILASGK CDPIAGGSSG VERIVVNFSN PDPALGDKRS EPDQPHPFLT DPAVRKAISL AIDRKAIAEQ LYGPTGEPTC NVLVVPTAVN SPNLTCNRDV EAAKKLLEDA GWKLKGSVRE KEIGGKTVRL VVSFQTSINP LRQSTQAIIK SNLAEIGIQV NVKAIDASVF FSGDPGNPDT LNKFYADLQM YTNGPNNADP QQYLQGWTCE ERASVANQWN GNNDGRYCNP EYDKLFEELK KELDPKKRVE LAIKMNDLLV TDGAVIPLIN RQTPNAKVKA LKGPTFNTFD SDIWNIASWS K
|
| |