Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2808 |
Symbol | |
ID | 5209777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3499400 |
End bp | 3501583 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596407 |
Product | extracellular solute-binding protein |
Protein accession | YP_001277129 |
Protein GI | 148656924 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAACG CACGGAAGCT CAACCGGCGC ACATTCCTGC GCCTGTCAGC CGTCACGGCG GCCAGCGCCG CTATCGCAGC CTGTGGCGGC CAGCCTGCCG CACCTCAACC GACCACTGCG CCTGCCGCGC CTCAACCGAC CACTGCGCCC GCCGCGCCCG CTCCGACCAC TCCTCCCGCA GCGACTGCCG TTCCTCCGGT GACAACCAAG TTCAAAGAAG CGCCGATGCT GGCGAAACTG GTGCAGGAGG GCAAGTTGCC GCCGGTGGAT GAGCGCCTGC CGAAGAACCC CTACACGCCG CCCCACTCCT GGCTTACCGT CGGCAAGTAT GGCGGTGTGC TGAAGAAGAC CTACAACAAC AACTGGGGTA TTACCGGTTT CATCCACGAG ATGCAGTATG GCTCGTCGCC GCTGCGCTGG CTCAAGGATG GACTGGCGAT CGGTCCCGGT TTTGTTGAAA GCTGGGAGTC GAATGCGGAC GCGAGCAAGT GGACGTTCAA GATCCGCGAG GGGATCAAGT GGAGCGATGG TCAGCCGTTC ACCACAAAGG ACATCATGTA CTGGTGGGAG TATACGGTTG GCGGTAACGG CAAGGAGAAG GAGTTCCCCG CCGGCCTCAA GCCGATCAAC GCCCCGCCGG ATGAAGCGCG ATCCGGCACC GGCACGCTGA TGACCCTCAA CGCACCGGAT GACTATACCT TCGAGATGGT CTTCGATGCA CCGGCGCCAC TCACGGCGGA CCGCCTGGCG ATGTGGGTCA ATATGTTCAT CGGTCCGGCA TGGGTGATGC CGCGCCACTA TATGGAACAG TTCAACCCGG TGCTCAACCC CGATAAGTAC AAGGACTGGG AAGAGCATCA GCGCAAGTTC AACCACAACA ACCCCGACTG CCCGCGTCTG ACCGGCTGGA AACTGGATGT GTTCGAGGAA GGCGTCCGCG CTGTCTGGTC GCGCAACCCC TACTATTGGG CAGTCGATAA GGAAGGCAAT CAGTTGCCGT ATATCGATCA GATCATTGTG ACGGCGGTCA AGGACAAGGA GATCGAGAAA CTGGCGTACA CGGAGGGACG CGCCGACCAC GCGCACTTCC ACGGTCAGGG TCTGGCAGAT GTCCAGTCGC TACGCGATGC CGAGTCCAAG AGCCAGCTCG AAGTTCGCTT CTGGGACTCT GGATCGGGCA CCGGTTCGCT CTACTTCTTC AACATGGACT TCAAAGACCC GAAGATGCGC GCGGTGTTCC GCGATCCGAA GTTCCGCCAG GCGCTGTCGC ACGCCTACAA CCGCGCCGAT GTGCAGAAGG CGGTCTACTT CGGGCTTGGC GAGTTGACCA CCGGCACCTT CAGCCCGAAG GCGATCGAGT ACAACATCAA CGATCAGGGC AAGCAGGTGT ACGCTGCATG GCGCGATAGC TACGTCAAGT ACGATCCGGC GCTGGCGGAG AAGATTCTGG ATGAAGCAGG CTACAAGAAG GGTCCCGACG GCAAGCGCAC CATGCCGGAT GGCAGCCCGC TGCAGATCCA GGTGACCTAT GGCGCCAACG CAACGCCTGG CGGCGAGCAC CTGTCGAAGA ACGAGCGTCT GGTGCGCGAC TGGCAGGCGA TCGGCATTGA TGCAGTGCTG ACGCCCATTC CAGGTGAGGG CGCCGATGAG AAATGGCGCG CTGGTGAAAT TCCGATGAAG ACAGAGTGGG AAGTTGGCGA CGGCCCGAAC CACCTGGTCT TCCCATCGTG GCTGGTGGCG GACGAGACCG AGCGCTGGGC GCCGCTGCAC GGGCGCGGGT ACACACTGCG CGGCACGGCG TCGGAGAAGG AAGAACTGGA CAAGAATCCG TGGGATCGCA ACCCGCCGCG TATCAATCCC ACCGATCCGG ATTATATGCC GGCGATTAAG AAACTCCACG ACCTGTTCGA CAAGAGCAAG GTGGAACCGG ATGCCATGAA GCGCCACCAG CTCGTGTGGG ACATGATCAA GGTCCACATC GAGGAGGGGC CGTTCTTCAC CGGAACGATC GCCAACCCGC CGCGCATTAT CCTGGTCAAG AAGGGCCTGA TGAACGTACC AACCCGCGAT GACCTGTTGA AGGAAGGGTT GGGTGGCTTC GTCAACCCGT GGATCATCCC GTCGCCGGCG GTCTATGACC CGGAGACCTG GTACTGGGAT AACCCCGACG CGCACCAGGC GTAG
|
Protein sequence | MSNARKLNRR TFLRLSAVTA ASAAIAACGG QPAAPQPTTA PAAPQPTTAP AAPAPTTPPA ATAVPPVTTK FKEAPMLAKL VQEGKLPPVD ERLPKNPYTP PHSWLTVGKY GGVLKKTYNN NWGITGFIHE MQYGSSPLRW LKDGLAIGPG FVESWESNAD ASKWTFKIRE GIKWSDGQPF TTKDIMYWWE YTVGGNGKEK EFPAGLKPIN APPDEARSGT GTLMTLNAPD DYTFEMVFDA PAPLTADRLA MWVNMFIGPA WVMPRHYMEQ FNPVLNPDKY KDWEEHQRKF NHNNPDCPRL TGWKLDVFEE GVRAVWSRNP YYWAVDKEGN QLPYIDQIIV TAVKDKEIEK LAYTEGRADH AHFHGQGLAD VQSLRDAESK SQLEVRFWDS GSGTGSLYFF NMDFKDPKMR AVFRDPKFRQ ALSHAYNRAD VQKAVYFGLG ELTTGTFSPK AIEYNINDQG KQVYAAWRDS YVKYDPALAE KILDEAGYKK GPDGKRTMPD GSPLQIQVTY GANATPGGEH LSKNERLVRD WQAIGIDAVL TPIPGEGADE KWRAGEIPMK TEWEVGDGPN HLVFPSWLVA DETERWAPLH GRGYTLRGTA SEKEELDKNP WDRNPPRINP TDPDYMPAIK KLHDLFDKSK VEPDAMKRHQ LVWDMIKVHI EEGPFFTGTI ANPPRIILVK KGLMNVPTRD DLLKEGLGGF VNPWIIPSPA VYDPETWYWD NPDAHQA
|
| |