Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2110 |
Symbol | |
ID | 5539590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2710683 |
End bp | 2712890 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894244 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432213 |
Protein GI | 156742084 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.233193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTGTGT CCAAGTCACG CATCCTGAGT CGTCGCAGAT TTCTGACACT TTCAGCCATG ACCGCTGCCA GCGCCGCGAT TGCTGCTTGC GGCGGCGGGC AACCCGCGCA AGCGCCAACT ACGGCGCCGG CGCCAACGGC AGCGCCAACC ACGGCGCAAG CGCCAACCGT CGCTATTCCG CCAACTACGC CGCCGCAGGC GACCGCTGTT CCCCAGACGG TCAGCAAGTA TAAGGAGGCG CCAGAACTGG CAAAGCTGGT TGCCGAAGGC AAATTGCCGC CCGTTGACGA GCGACTGCCG AAAAACCCGT ATGTCGTACC GCACAAGTGG CTCACCGTCG GCAAGTATGG CGGAACGATC AACTTTACCA ACTCTTGGGG ACCCGACGGT ATGGCGACGA TTGTGCAGGA GAGTCAGTAT GGTCACTCAA TCCTGCGCTG GCTCGATGAT GGCCTGAAGA TCGGTCCGGG TCTTGCCGAG AGTTGGGAAG CGAACGCCGA TGCCAGCGAG TGGACGTTCA AGTTCCGCGA AGGGCTGAAA TGGTCGGATG GGCATCCCTG GACAGTTGAT GACATTCTCT ACTGGTGGGA GTACATGGTC GGCGGCAACG GCAAGGAGAA GGAGTTCCCC GAAGGGCTGA AACCGATCGA GCCGCCGCCG GACGAAGGGC GCTCCGGTAC GGGAACGCTG GCGACGCTGA TCAAAGTCGA TGACTATACG CTGACGATGA AGTTCGACGC CCCCGCACCG CTGACGGCGG ATCGTCTGGC GATGTGGGTG AATGCCGGCA TTGGGCCGCG CTGGATGGCG CCGCGTCACC ATATGGAGCA GTTTAACCCC GTCCTCAATC CCGATAAGTA CAAAGACTGG GATGAGCACA CCAAGCGCAT TCGCTTCGTC ACCAACCCCG ACTGCCCGAC GATGACCGGT TGGAAGTGTG AGAGTTTCGA GGAGGGTGTG CGCGGTGTCT ATACGCGCAA TCCGTACTAC TGGTGCGTGG ACGCTGAGGG GAATCAGTTG CCCTATATCG ACCGCATTAT CTCGACCACC TTCCAGAACA GCGAGGTCGA AAAACTGAAC GTGATCCAGG GCAAGAGCGA CTTTACCCAT CACTGGGTGC TCAGCCTCGA TGATGTGCCC AACCTGCGCC AGAATGCCGA TGCTGGCGGG TATGAGGTGC GCTTCTGGGA GAGCGGGTCG GGTTCGGGCA CGTCGTTCTT CTTCAGTTAC GACTACAACG ATCCAAAGTG GCGCGAACTG ATCCGCAACC CGAAGTTCCG TCAGGCGTTG TCGCTGGCGT TCAACCGCGC CGAATTGCAG AAGACGCAAT ACTTCGGCAC CGGTGAGTTG ACGACCGGCA CCTTCAGCCC GAAAGCGATC GAGTACAATA TCGACGAGAA AGGCAATCCG GATCGGGCCA ATGCACGCTA CCGCGAATGG CGCGATAGTT ATGTGGCGTT CGATCAGGAG CGGGCGAAGG CGTTGCTCGA CGAAATTGGC GTCAAAGTTG GCGCTGACGG GTTCCGCACC TTCCCGGACG GCTCGCCGCT CGATATTCAG GTGATCGTCG CATCGAACAC GAGTAAGAAC ACTATCGCTC AAAACGAGCA GATGATCCGC GACTGGGGGC AGATCGGCAT CAAGGCGACG CTTACTCCGG TGCCGCCGCA GGGTCGCCGC GATGACTGGT TCGCGGGTAA GTTGATGTCG AATGCCGATT GGGGTATTGG CGACGGACCG AACCATCTGG TCTACCCGCA GTGGTTGGTG CCAATCGAAC CGGAACGCTG GGCGCCCCTC CACGGTCAGG GGTATCAGTT GCGCGGCACT GCCAGTGAAA AAGAGGAACT GGACAAAGAT CCCTGGCAGC GCGCGCCGGC TCGTATTGTG CCGTCCGACA AGGACTTCGA TCCGGTTATT GCCAAACTGC ATGAAATCTA CGACAAGACG AAGGTCGAGC CGGAGTTTCT CAAGCGCACC CAGATGGTCT GGGAAATGAT GAAGATCCAT GTCGAGAACG GACCGTTCGT ACAGGGGTCG GTCGCCAACT TCGACCGCGT GTTCATCGTG AAGAAGGGGT TGATGAACGT GCCGAAGAAG GAAGACCTGG CGCTTGGCGG ATTTACCGAT CCCTGGATCC ATCCCACGCC GGCGGTCTAT GACCCTGAAG CCTGGTATTG GGATGATCCG TCAAAGCATA CAAGTTGA
|
Protein sequence | MPVSKSRILS RRRFLTLSAM TAASAAIAAC GGGQPAQAPT TAPAPTAAPT TAQAPTVAIP PTTPPQATAV PQTVSKYKEA PELAKLVAEG KLPPVDERLP KNPYVVPHKW LTVGKYGGTI NFTNSWGPDG MATIVQESQY GHSILRWLDD GLKIGPGLAE SWEANADASE WTFKFREGLK WSDGHPWTVD DILYWWEYMV GGNGKEKEFP EGLKPIEPPP DEGRSGTGTL ATLIKVDDYT LTMKFDAPAP LTADRLAMWV NAGIGPRWMA PRHHMEQFNP VLNPDKYKDW DEHTKRIRFV TNPDCPTMTG WKCESFEEGV RGVYTRNPYY WCVDAEGNQL PYIDRIISTT FQNSEVEKLN VIQGKSDFTH HWVLSLDDVP NLRQNADAGG YEVRFWESGS GSGTSFFFSY DYNDPKWREL IRNPKFRQAL SLAFNRAELQ KTQYFGTGEL TTGTFSPKAI EYNIDEKGNP DRANARYREW RDSYVAFDQE RAKALLDEIG VKVGADGFRT FPDGSPLDIQ VIVASNTSKN TIAQNEQMIR DWGQIGIKAT LTPVPPQGRR DDWFAGKLMS NADWGIGDGP NHLVYPQWLV PIEPERWAPL HGQGYQLRGT ASEKEELDKD PWQRAPARIV PSDKDFDPVI AKLHEIYDKT KVEPEFLKRT QMVWEMMKIH VENGPFVQGS VANFDRVFIV KKGLMNVPKK EDLALGGFTD PWIHPTPAVY DPEAWYWDDP SKHTS
|
| |