Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2454 |
Symbol | |
ID | 5539935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3152757 |
End bp | 3154805 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894584 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432552 |
Protein GI | 156742423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.39331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAA ACGCCAGGCG ATCAATCAGT CGCCGCAATT TTCTGCGCCT GTCGGCTGTC CTCGGCGCAA GCGCCGGCCT TGCCGCCTGC GGCGGCGCTC CGACCGCTCC CGCTCCCACT GCCGCTCCAT CCGCTCCAAC TGCGCCACCG GCGCCAACGG TCGCTCCTGT TGCCACTGCT GCTCCGCCAC AGGCAGCAAC CGCCGTGCCT GTAGCGGTGT CGCAGTTCAA CGAAGCGCCC GTCCTGGCGG AACTGGTCCA ACAGGGCAAA CTGCCGCCGG TCGATCAGCG CCTGCCCAAG AATCCAGTGG TGCTGACCGG TCTCGATGGC GTTGGCAAAT ATGGCGGAAC CATCCGTCGC GGCTTCCGCG GCTTTTCGGA CCGCTGGGGA CCGACCAAGA TTCAGAACGA GGGGCTGACC TGGTATAACC CTGACTTAAG CCTGCGCGCG AACATCGTCG AGTCGTGGGA GGTCAGCAGC GATGCGCGGC AGTGGACGTT CAAGCTACGC GAGGGGATGA AGTGGTCCGA TGGTTCGGAC TTCACGACCG ACGACTGGAA ATGGTGGTAC GAGAACGTCC TGCTCAACGA TCAGATCACG ACGGCGCTTC CCGGTGTTTG GTCAACCGGC TCACCGCGCG TGGCGATGAA AGCGGAGTTT CCCGATCGCT ACACGGCGGT GTTCACGTTC CAGGACCCCA AGCCACTCTT CGCCTACAAC GTCACCCGCG AGCAACCGTT CGTTCATGCC GGTTTCATGA AGCAGTTCCA CGAAGGCTTC GCCGATAAAG CCAAACTTGA GGCGGATGCG AAGGCGGCAG GGTTCGAGAC GTGGGTGCAA CTGTTCAACG ACAAAAATTG GTGGTGGACG GTCGGGCGTC CATCGCTCGG TCCGTGGGTA GCCGTCAACA CCATGACCGA GCAGTTGTTC ATTATGGAGC GCAACCCTTA CTTCTGGCAG GTGGATGAAG ATGGCAATCA GTTGCCGTAC ATCGACAAGA TCACCCACCT GCTGTTCGAG ACGCCCGATG CCTTCAACAC GCGCATCATT GCGGGCGAGG TCGATTTTCA GGCGCGCCAC GTGAGCATTG GCGACTTCAC ACTCCTCAAA GAGAATGAGA GTCGCGGCGA TTATCAGGTA GTGCTGGGCG TCAGCGCCAA CCATATCGCT TTCCAGCCGA ATCACACGGC GAAGAACCCG AAGATCCGCG AGTTCTTCCA GAACCGCGAC GTGCGCATTG CACTCTCATT GGCGATCAAC CGCGACGAAA TCAACGAACT GGTCTTCAAT GGCACCGCTG TGCCGCGCCA GTACAGCCCG TTGAGCATGT CGCCGCAGTA CTACGAGAAA CTGACGAAAG CGTATATCGA GTACGATCCG GCGCGCGCCA ATCAACTGCT CGATCAGGCC GGCTACGACA AGAAGGATAC TCAGGGCTTC CGCCTCTGGA AGGATGGCAG CGGTCCGATC AGTTTCATCA TTGAGGGCAC GGCGCAGGCG GGTTCACCCG ATGAAGACGC CGTCCAGACG ATGATCAAGT ATTTTGCCGA TGTCGGCATT AAGGCGACGT ACAAAGCGCA GGAACGTTCG CTGTACACCG AACGCTATCA GGCGAACGAG GTCGAGGCGG CGTTCTGGGG TGGTGACCGG ACGCTGTTAC CCATCGTGGC GCCCTGGATT TTCCTGGGGA GCATGATCGA CCGTCCCTGG GCTGCCGCGT GGGGAATCTA CTACAACAAC CCGAATGACC CCAACGCTGA ACAACCGCCG GAGGGTCATT TCATCACCAA AATCTGGGAT ATTCAGAAGC AGATCGATGT CGAGCCGGAC GAGGAGAAGC GCAATGCGCT CTTCCGTCAG ATTCTCGACA TCTGGGCGGA GGAGTTGCCG ATGATCGGCA TCCTCGGTGA ACTGCCATCA CCGACGATTG TGAAGAACGG CTTCAAAGGC TTCCGCGCCG GATTCCCCAA CGACGATACA ACCGAGGACG AGAACCTGCT CAACACGCAG ACCTACTACT GGGATGATCC GGCAAAACAC ACCAATTAG
|
Protein sequence | MNKNARRSIS RRNFLRLSAV LGASAGLAAC GGAPTAPAPT AAPSAPTAPP APTVAPVATA APPQAATAVP VAVSQFNEAP VLAELVQQGK LPPVDQRLPK NPVVLTGLDG VGKYGGTIRR GFRGFSDRWG PTKIQNEGLT WYNPDLSLRA NIVESWEVSS DARQWTFKLR EGMKWSDGSD FTTDDWKWWY ENVLLNDQIT TALPGVWSTG SPRVAMKAEF PDRYTAVFTF QDPKPLFAYN VTREQPFVHA GFMKQFHEGF ADKAKLEADA KAAGFETWVQ LFNDKNWWWT VGRPSLGPWV AVNTMTEQLF IMERNPYFWQ VDEDGNQLPY IDKITHLLFE TPDAFNTRII AGEVDFQARH VSIGDFTLLK ENESRGDYQV VLGVSANHIA FQPNHTAKNP KIREFFQNRD VRIALSLAIN RDEINELVFN GTAVPRQYSP LSMSPQYYEK LTKAYIEYDP ARANQLLDQA GYDKKDTQGF RLWKDGSGPI SFIIEGTAQA GSPDEDAVQT MIKYFADVGI KATYKAQERS LYTERYQANE VEAAFWGGDR TLLPIVAPWI FLGSMIDRPW AAAWGIYYNN PNDPNAEQPP EGHFITKIWD IQKQIDVEPD EEKRNALFRQ ILDIWAEELP MIGILGELPS PTIVKNGFKG FRAGFPNDDT TEDENLLNTQ TYYWDDPAKH TN
|
| |