Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2119 |
Symbol | |
ID | 5539599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2722187 |
End bp | 2723335 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894253 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432222 |
Protein GI | 156742093 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTC GCGTTGCGCT CATTGGCGGA CCAATGTACA ACGCACTCTA TACTCGCCTG GATCAGTTCA GTCAGCAGAG CGGCGTTCAG GTGGAAGTCG CTTTTGTCGG CGATCATCCC GCCCTTAACA CGTTCCTGGC GACAGATGCT GCGGCAGACT GTCATGTGGT ATCAACTCAC ACCAAGTACG CCCCATCGCA GCAACGTCTC CTGGCGCCCC TCGACGAACT TTTGACGCCT GCTGAGTGGA GCGACTTTAT GCCTTCACTC CTCGAATTGG CGCGCATTGA TGGTCGGCTC TACGGCATTC CTCGCAACAT CGATGTGCGC CTGCTGCACT ATCGCACCGA TCTGATTGAC CAACCGCCCA CCACGTGGGA CGAGCTGCTT GACCTGGCGC GCAGGGTCAA CCATCCGCCT GAATGGTATG GCTTTCTCTT TCCCGGCACA GAGTCAGGAC TCTTTGGCAC GTTCTACGAA CTGGTCGAGA GCGCGAATGC CAGGCTGTTT TCCCCTGATC TGACACCGAA TATCGAGAAT GACGGCGGAC GCTGGGCGCT AGGGTTTTTG CGCACCTGTT ATGCGGAAGG ACTGGTTCCG CCGGAGATTG TCACCTGGCA CTATGATGAG GTGCATCTCT GGTTCCGCGC TGGACGTGCC GCGATGGTAG GAGATTGGCC CGGCTATTAT GCCGATTATT GCGCCACCGA CTCGCAAGTG CGTGAACGCT TTGCGCTTGC ACTCTATCCT GCCGGACCGT CCGGGGGGGT GCGTGTGTAT GGCGGCAGCC ATACTTTTGC TCTGACCCAC CGCGGGGTGG AGCAGACCGA TGCTGTCGCG CTACTGCGCT TCCTCACCGC GCCCGAGCAG CAATTGCTGG AGGCGAAACA GGGTTCGACG CCGGTGCGCC ATTCCGTTAT GCAGCGGATC GAGCAGCATG CCACACCACA CGAGCGCCAG CGTTGGGCGA CCCTCGCCGC CGCTATTGAA CGGGTGGTCA TTCCCCCCAA ATTTGAGCGG TATCCGCTGG TTGAGCAGGC GCTCTGGACA ACTGTCCAGC AGGCGATGGT CGGCGCCATA GCAATTGACG AGGCCTTGCA TCGGTTGACA GACCGGATTA CCAGAATTGT GGCAGGCAAT GATGGGTGA
|
Protein sequence | MTVRVALIGG PMYNALYTRL DQFSQQSGVQ VEVAFVGDHP ALNTFLATDA AADCHVVSTH TKYAPSQQRL LAPLDELLTP AEWSDFMPSL LELARIDGRL YGIPRNIDVR LLHYRTDLID QPPTTWDELL DLARRVNHPP EWYGFLFPGT ESGLFGTFYE LVESANARLF SPDLTPNIEN DGGRWALGFL RTCYAEGLVP PEIVTWHYDE VHLWFRAGRA AMVGDWPGYY ADYCATDSQV RERFALALYP AGPSGGVRVY GGSHTFALTH RGVEQTDAVA LLRFLTAPEQ QLLEAKQGST PVRHSVMQRI EQHATPHERQ RWATLAAAIE RVVIPPKFER YPLVEQALWT TVQQAMVGAI AIDEALHRLT DRITRIVAGN DG
|
| |