Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0093 |
Symbol | |
ID | 5207026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 109432 |
End bp | 111246 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640593725 |
Product | extracellular solute-binding protein |
Protein accession | YP_001274484 |
Protein GI | 148654279 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.503977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGAC AACGACTCAG AAGCGCCGCA TGGTTGATCG TGCTGCTTGC CGTTGCAGGC ATCGCCCTGG CAGGTTGCAC AATTGAAGGC GGTGTTCCGA CCCCGACCAT CGCACCAACA CGAGCGCCGA TCGGTCCGAT CCCCACCAGC GCGGTCATCG CTGAACGCAT GGCGGCGCGT AGCGACACAT GGATGATCGG GATGGTCGAT CTTCCGCCGG ACATCTACCC GTATCCCCAA TCGGCAGCGA CCCAGCGCGC AACGGCGCCG ATTACCGAAC TGCTGTTTCC GGCGCCGATC CTGACCTACA ACTACGGGTA CACCGTGACC GGTGTCCTCG AGCGTATTCC GACCCTCGAC AATGGTGATG CCGAACTGCG CAAGGTCGAT GTGTATCTGG ATGCGACCGG CGCGATAACC ACGACAGCGA CCGATGTTGT TACCCAGGTT GATCAACTGG TCATCACCTT TCGCTGGAAC CCGCAATTAC GCTGGTCTGA CGGCACACCC GTCACCGCTG ATGATTCGGT GTTCGCCTAC GAACTGGCGA AAGCGGCGCC GCCGGGCGAT GCCGCCGCCG AGTTGCTGGC GCGGACAGTT GCATACGAGA AGGTCGATGA CCATACGACG CGCGCCGTTC TGCGCCCGGA TTACGTTGGT CCAGCGTATT TCATGAGTTA CTGGACTCCT CTGCCGCGCC ATCTGCTTCG GGGCGTTGAT CCAGTGCGGG TGCGCGAGAG TGAGTTCGCG CAACGTCCAG TCGGATACGG TCCTTATGCA CTTGTCGAGC GAACTTCCAC CGAACTGCGC TTCGAGCGCA ATCCGTATTA TTTCGGTCCG GCGCCTTCCG CGTCACGCCT GGTGATCCGC GCTTTTGCCG ATCTTGAACT GTTGCGCGCC AATCTGCTCA ACGGCAATCT TGACCTGGGA TTTGCCGATC GCATTCCGCC GGCAATGCTG GATCGTTTCG CCAGCGATGC CAGCGAACAA ACCTTGCAGG TAATGACCGT TCCCAACCCT ATCTGGGAGC ATATTGTGTT CAACCTGGAT GTTCCAATCC TTCAGGATAT TCGTGTGCGG CGCGCTATCG CATACGGTAC GAACCGGCAG GCAATCGCCG ATGCGCTGTT CGGCGGGCGG ACACCGGTGC TGGACAGTTG GGTGCTGCCG GGGGATCAAC TCGCCGCTCC GCCTGACCAC TTGACCCGCT ACCCCTACGA TCCCGATCAG GCGCGACAGT TGCTTGAAGA GGCAGGATAT GCCGATCCTG ACGGCGATGG CATCCGTGCC ACCGCCGAGG GGGTCGCACT GACGCTGCAA TTGCTGACCA CCCAGGGGAG TGCAGTGCGC AGCGAGATTG CCCGGCGATT CCAGCAGGAT ATGCACGCCC TCGGTATCGA AATCGAGATC AACGAAGCCC CATCCGAAGA GATGTTCGAT GCCGACGGTC CTCTCTACCT GCGACAGTTC GATCTGGCGC TCTTTGGATG GATTGCCGGA GCAGAGCCGG GCGGATTGCA ACTCTGGAGT TGCGCTGCGG TTCCCTCTGA GAGCAACGGG TATCGCGGCG AAAACTTCGC CGGTTGGTGC TTCCGCGATG CCGATCGCGC CGTGCGAACC GCCGATACAA CCCTTGATCC GGTCGAGCGC GCAGAAGCAT ATCTGCGTCA GCAGCAACTC TGGACACAGG AACTGCCAGC GCTGCCCCTC TTTCAGCGGT TGAGCATCGT CGCGGCAAAC CCCGGCGTCG AGGGGCTTTC GCCCGACGCC CTTGCCCCTG TGACGTGGAA TGTATCGGCG TGGAAGCGGA AGTAA
|
Protein sequence | MIRQRLRSAA WLIVLLAVAG IALAGCTIEG GVPTPTIAPT RAPIGPIPTS AVIAERMAAR SDTWMIGMVD LPPDIYPYPQ SAATQRATAP ITELLFPAPI LTYNYGYTVT GVLERIPTLD NGDAELRKVD VYLDATGAIT TTATDVVTQV DQLVITFRWN PQLRWSDGTP VTADDSVFAY ELAKAAPPGD AAAELLARTV AYEKVDDHTT RAVLRPDYVG PAYFMSYWTP LPRHLLRGVD PVRVRESEFA QRPVGYGPYA LVERTSTELR FERNPYYFGP APSASRLVIR AFADLELLRA NLLNGNLDLG FADRIPPAML DRFASDASEQ TLQVMTVPNP IWEHIVFNLD VPILQDIRVR RAIAYGTNRQ AIADALFGGR TPVLDSWVLP GDQLAAPPDH LTRYPYDPDQ ARQLLEEAGY ADPDGDGIRA TAEGVALTLQ LLTTQGSAVR SEIARRFQQD MHALGIEIEI NEAPSEEMFD ADGPLYLRQF DLALFGWIAG AEPGGLQLWS CAAVPSESNG YRGENFAGWC FRDADRAVRT ADTTLDPVER AEAYLRQQQL WTQELPALPL FQRLSIVAAN PGVEGLSPDA LAPVTWNVSA WKRK
|
| |