Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2184 |
Symbol | |
ID | 5539665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2805618 |
End bp | 2806505 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894318 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432286 |
Protein GI | 156742157 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0092646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAAC AAATGCTACT TATCGTGACG CTGATCGCCG CCATCCTGGC GATTAGCGCC TGCGGCGGAG CGCCTGCCGC GCAGCCGACC CAACCCGCCG CGCAGCCGAC CCAACCCGCC GCGCAGCCGA CCCAACCCGC CGCGCAGCCG ACCCAACCCG CCACGCAACC AGAGGGGAAA CTGGCGCAGA TCCGGGCAGC CGGCAAACTC ATCGTCGGCA CGTCGGCAGA CTACCCGCCC TACGAGTCGA TCGACGCCAA TGGCAACTTC GTCGGTTTCG ACATGGACCT CATCCGCGCT GTCGGCGAAA AACTCGGCGT CCAGGTCGAG ATTCGCGATA TGCCGTTCGA CTCGTTGATC GCATCGCTCC AGGAAGGCAA GATCGATGCC GTGATCGCCG CCATGCAGGC GACCGCCGAG CGTGAAGAGA AGGTCGATTT CACCATTCCT TACCGCATGA CGAAAGATGC ATTTATCGGC GCCGGCGATA CAACAATTGT CATGAGCAAA CCGGAGGATG CGGCGGGAAT GACCATCGGC GCACAGACCG GTACGGTTCA GGAAGGGTGG ATTCAGAAGA ACCTGGTGGC CACCGGATTA ACGCCTGCCG ATAAGGTCTT CAGCTATGAG CGCGCCGATC AGGCAGCGCT CGACCTTGCC AGTGGACGGC TCCAACTGGT GCTGATGGAC GCCGAACCCG CGTTGGAACT TGCCCAAAAG AATAATCTGA AGGTGCTGCT CGTCACTGAG ACAACCGCCG AGGGCGGCAA GAGCATCGCC ATCCCTGAAG GCGCCGGTGA CCTCAAGGCG GAACTGGATC GGATCATTCA GGGATTGATC GATGACGGCA CTGTGAAAGC GCTCGAAGAA AAGCACGGAC TGCCATAA
|
Protein sequence | MKKQMLLIVT LIAAILAISA CGGAPAAQPT QPAAQPTQPA AQPTQPAAQP TQPATQPEGK LAQIRAAGKL IVGTSADYPP YESIDANGNF VGFDMDLIRA VGEKLGVQVE IRDMPFDSLI ASLQEGKIDA VIAAMQATAE REEKVDFTIP YRMTKDAFIG AGDTTIVMSK PEDAAGMTIG AQTGTVQEGW IQKNLVATGL TPADKVFSYE RADQAALDLA SGRLQLVLMD AEPALELAQK NNLKVLLVTE TTAEGGKSIA IPEGAGDLKA ELDRIIQGLI DDGTVKALEE KHGLP
|
| |