Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1695 |
Symbol | |
ID | 5539173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2187594 |
End bp | 2188829 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893834 |
Product | extracellular solute-binding protein |
Protein accession | YP_001431805 |
Protein GI | 156741676 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0207768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000479364 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACACGT CACCCAATGC TCGAACAATC GCACTTCTGT TGCTGACCCT CATTCTCGCC GGGTGCGGCG ATCTCGGCGG ATTGCTCGGC AACCAGCCCA CTCCTGCACC GATCATCCTC ATCGCAACCG CCACACCCGT ACCGCCGAGT CAGGCGACCC CGACGACAGA CATTGTTCCA TCGCCGACAG TTGCGCCACC CGACACTCCG GTGGCAACCT CCGTCCTACC GACTGCTGCG CCTCCCACAC CCACACCCGC ACCGCAAAAA ATCCTGGCGC GCGTCAAAGA GCGTGGCTAT CTGATCTGTG GAACGAACGC CGATCTGCCG GGGTTCGGCT TCTACGACAA CGTGCGCCAG GCGTGGAGCG GCTTCGATGT CGATTTCTGC CGCGCCGTCG CTGCTGCCAT CTTCGGTGAT GCCACAAAAG TGGAGTTCGT CGCGCTCGGC ACCGGACCAG GACCCAACAA CCGGTTCGAT GCTGTGCGTG AAGGGCGCGT CGATGTCCTG TTCCGTAACA CGACATGGAC ACTTGGACGC AACATCAGCG GGTTGGCATT CGGTCCCACC ACCTTCCACG ATGGTCAGAC CTTCATGGTG CGCATCAGGG ACCGGATCAC CAAACTGGAA GACCTCGCAG GCAAAGTCAT CTGTGTGGCG AAAGGCACCA CCAGCGAGCA AAACCTGAAC GACGACTTTG CCGCGCGCGG CATTCAGTTC ACTGCCCGCG TCCTCAATGG CGAAGACGAA CTCTACCCGG CGTATGATGA AGGGGAGTGC GACGCAGTGA CCAGCGACAG TTCACAACTG GCAGCCAAAC GTCAGCAACT CAAGAATCCC GCCGACCACA TCATCCTCGG CGACCGCATC TCACGCGAGC CGCTCGGTCC GGTCATCGCC CGCGACGACA ATCAGTGGCT CGACGTGATC AGCTGGACGG TCTTTGCCAC GATTTACGCC GAAGAACTGC GTGTTGATCA GCGCAATGTC GATCGTCTGC GCGCCAGCAC GACCGATCCG CGTATCAAAC GGCTGCTAGG GCTGGAAGGA AACTTCGGCG AGGGATTGGG GTTGCCGAAC GACTTCGCCT ATCAGATTAT CAAGCAGGTC GGCAACTACG GCGACATCTA CAACCGTAAC CTGGGACCGA ACACCGTCAT CAATCTGGAT CGCGGTCCGA ACAAAGTCTG GAACCTTGGC GCTGGCGGCG TGCTTGCCTC CCCGCCGTTT CGTTGA
|
Protein sequence | MHTSPNARTI ALLLLTLILA GCGDLGGLLG NQPTPAPIIL IATATPVPPS QATPTTDIVP SPTVAPPDTP VATSVLPTAA PPTPTPAPQK ILARVKERGY LICGTNADLP GFGFYDNVRQ AWSGFDVDFC RAVAAAIFGD ATKVEFVALG TGPGPNNRFD AVREGRVDVL FRNTTWTLGR NISGLAFGPT TFHDGQTFMV RIRDRITKLE DLAGKVICVA KGTTSEQNLN DDFAARGIQF TARVLNGEDE LYPAYDEGEC DAVTSDSSQL AAKRQQLKNP ADHIILGDRI SREPLGPVIA RDDNQWLDVI SWTVFATIYA EELRVDQRNV DRLRASTTDP RIKRLLGLEG NFGEGLGLPN DFAYQIIKQV GNYGDIYNRN LGPNTVINLD RGPNKVWNLG AGGVLASPPF R
|
| |