Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3050 |
Symbol | |
ID | 5210018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3828692 |
End bp | 3830965 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640596642 |
Product | extracellular solute-binding protein |
Protein accession | YP_001277364 |
Protein GI | 148657159 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0190244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.13375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTCC GAAACAGATG TTCACCACGT TCCTCACACT CTGACATCAG CTACGAGGAG GTACCTGTGT CTAGAACACG CACACTGAGC CGACGACGAT TCCTGACGCT TTCGGCGATG ACTGCTGCCA GTGCGGCGAT TGCTGCGTGT GGCGGCGGTC AGCCTGCTCA GGCGCCAACG TCTGCACCGG CGCCGACATC TGCACCGGCG CCGACATCTG CTCCCGCGCC AACTGTTGCC ATCCCGCCCA CCACGCCGCC CCAGGCGACT GCTGTGCCCC AGGCGGTCAG CAGGTTCAAA GAGTCGCCCG AACTGGCAAA ACTGGTCGCC GAAGGGAAAC TGCCGCCCGT TGATCAGCGT CTGCCGAAGA ACCCTTACGT TGTGCCGCAC AAATGGCTCA GTGTCGGCAC GTATGGCGGG ACGCTCAACT TCACCAACTC GTGGGGACCC GATGGTATGG CGACGATTGT GCAGGAGAGC CAGTACGGTC ACTCGATCCT GCGCTGGCTT GACGATGGCT TGAAGATTGG TCCGGGGCTT GCTGAAAGTT GGGAAGCGAA TGCCGATGCC AGCGAGTGGA CGTTCAAGTT CCGCGAGGGG CTCAAGTGGT CTGACGGGCA TCCCTGGACG GTCGATGACA TTCTGTACTG GTGGGAGTAT ATGGTCGGCG GCAACGGGAA GGAGAAAGAG TTTCCGGAGG GGTTGAAGCC GATCGAGCCG CCGCCGGACG AGGGACGTTC CGGTACTGGA ACGCTGGCGA CTCTCATCAA AGTCGATGAC TACACGCTGA CGATGAAGTT CGATGCGCCA GCGCCGCTGA CTGCTGATCG CCTGGCGATG TGGGTCAACG CAGGCATCGG ACCGCGCTGG ATGGCGCCGC GGCATCATAT GGAGCAGTTC AACCCGGTGC TCAACCCTGC AAAATACAAA GACTGGGACG AACATACCAA GCGCATCCGC TTCGTCACCA ACCCCGATTG CCCGACGATG ACGGGATGGA AGTGTGAAAG TTTTGAGGAG GGCGTGCGTG GCGTTTATAC CCGCAACCCG TATTACTGGT GTGTCGACGC TGAGGGGAAT CAACTGCCAT ACATCGACCG CATCGTTTCG ACCACCTTCC AGAACAGCGA AGTCGAGAAG TTGAATGTCA TCCAGGGGAA GAGCGACTTC TCGCACCACT GGGTGCTCAG TCTCGATGAT GTGCCCAACC TGCGCCAGAA TGCTGAAGCA GGCGGGTATG AAGTGCGGTT CTGGGAGAGC GGATCCGGGT CGGGAACCTC GTTCTTCTTC AGTTACGACT ACAACGATCC GAAGTGGCGG GAATTGATCC GCAATCCGAA GTTCCGGCAG GCGCTCTCGC TGGCGTTCAA CCGCGCTGAA CTGCAAAAAA CGCAGTACTT CGGCACCGGT GAGTTGACAA CCGGCACCTT CAGCCCGAAA GCCATCGAGT ACAATATCGA CGAAAAGGGG AACCCGGACC GGGCAAACGC GCGGTACCGC GAATGGCGCG ATAGTTACGT GGCGTTCGAT CAGGAACGGG CAAAGAAACT GCTCGACGAC ATTGGGGTCA AGGTTGGTCC CAACGGTTTC CGTACCTTCC CCGACGGTTC ACCGCTGGAG ATTTTGCTGG TCGTCGCCGC CAACACCAGC AAGAACACGA TTGCTCAGAA TGAGCAGATG ATCCGCGACT GGGGGCAGAT CGGCATCAAG GCGACGCTGA CCCCGGTGCC GCCGCAGGGA CGCCGTGAGG ACTGGTTCGC CGGCAAACTG ATGTCGAACG CCGACTGGGG CATCGGTGAT GGTCCGAACC ATCTGGTGTA TCCGCAGTGG CTGGTGCCGA TTGAGCCGGA ACGCTGGGCG CCGCTTCACG GACAGGGGTA TCAGTTGCGC GGTACGGCAA GCGAGAAGGA AGAACTGGAC AAAGATCCGT GGCAACGCGC GCCTGCCCGA ATCGTGCCGA CCGATAAGGA TTTCGACCCG ACCATCGCAA AACTGCACGA GATCTACGAT AAGACGAAAG TCGAACCGGA TTTCCTGAAG CGCACACAGA TGGTCTGGGA GATGATCAAG ATCCACGTCG AGAACGGTCC GTTCGTGCAG GGGTCGGTAG CGAACTTTGA CCGCGTGTTC ATCGTCAAGA AGGGCTTGAT GAACGTTCCC CGGAAAGAAG ACCTTGCGCT CGGCGGCTTC ACCGATCCCT GGATCCATCC AACGCCGGCG GTGTACGATC CTGAAGCATG GTATTGGGAC GATCCGGCGA AGCATAGCGG TTGA
|
Protein sequence | MFFRNRCSPR SSHSDISYEE VPVSRTRTLS RRRFLTLSAM TAASAAIAAC GGGQPAQAPT SAPAPTSAPA PTSAPAPTVA IPPTTPPQAT AVPQAVSRFK ESPELAKLVA EGKLPPVDQR LPKNPYVVPH KWLSVGTYGG TLNFTNSWGP DGMATIVQES QYGHSILRWL DDGLKIGPGL AESWEANADA SEWTFKFREG LKWSDGHPWT VDDILYWWEY MVGGNGKEKE FPEGLKPIEP PPDEGRSGTG TLATLIKVDD YTLTMKFDAP APLTADRLAM WVNAGIGPRW MAPRHHMEQF NPVLNPAKYK DWDEHTKRIR FVTNPDCPTM TGWKCESFEE GVRGVYTRNP YYWCVDAEGN QLPYIDRIVS TTFQNSEVEK LNVIQGKSDF SHHWVLSLDD VPNLRQNAEA GGYEVRFWES GSGSGTSFFF SYDYNDPKWR ELIRNPKFRQ ALSLAFNRAE LQKTQYFGTG ELTTGTFSPK AIEYNIDEKG NPDRANARYR EWRDSYVAFD QERAKKLLDD IGVKVGPNGF RTFPDGSPLE ILLVVAANTS KNTIAQNEQM IRDWGQIGIK ATLTPVPPQG RREDWFAGKL MSNADWGIGD GPNHLVYPQW LVPIEPERWA PLHGQGYQLR GTASEKEELD KDPWQRAPAR IVPTDKDFDP TIAKLHEIYD KTKVEPDFLK RTQMVWEMIK IHVENGPFVQ GSVANFDRVF IVKKGLMNVP RKEDLALGGF TDPWIHPTPA VYDPEAWYWD DPAKHSG
|
| |