Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0906 |
Symbol | |
ID | 5538372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1185887 |
End bp | 1187866 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893056 |
Product | extracellular solute-binding protein |
Protein accession | YP_001431039 |
Protein GI | 156740910 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACGGA CACGACTGCT TCTGACGGGG CTGCTGCTCG TGGTCAGCCT GATCATCGCC GCCTGTGGGG GAGGACCGGC GGCGCCAGCA GCAACACCAG CCGGTTCAGC GACGACGCCA GCCGCGCCGG CGCCGGCGGG CGGAACAGAC GCGACACTGA CGGAGTTTGG CGATCTGCCG CGCCACGAGA CGTTGATCGT CGATATTCTC ACCGGGCGGG TCGGCTCGCC GGACGACTTC AACAACTGGG TGGGCTGGAA GTGGCGTGAT CGCGGGATGC AGAACCTGGC GAACGAGCCG CTCTGGTCGG TTGATTTCGC CACCGGCAAG ATTATTCCGG GTCTGGCAGA GGGCGATCCG GTTTACAACG CAGATTTTAC CGCCCTCACG ATTCCGCTGC GCAAGGGGGT GACGTGGCAC GACGGACAGC CGTTCACAGC GGCAGACGTG GTCTTCACCG TCGAAACGCT GATGAAACAC GAGGGGTTCG GCGACAACAG TTTTTTCGTG GAGAATGTGA AATCGGTCTC CGCCGTTGAT GATCATACCG TCGCCTTCGA GTTGAACGCG CCGAACTCGC GCTTCCACAC CCGCTTCCTG GATCGCTGGG GCTGCACCTG GATTATGCCC AAGCATATCT GGGAGTCGGT GGAAGACCCG GTCACGTTTA AGTTTAACCC CTTCATCGGC ACCGGTCCGT ACAAACTGCA CAGTTTCGAC CCATCCGGGT TCTGGACGAT TTGGGAGAAA CGGGCTGACT GGGACAAGAG TCCGACCGGC ATGATGTACG GTGAACCAAA GCCCAAATAT GTCATCTTCC GGTACTTCGC CAACGAAGGC GCTAAGATTC TGGCGCAGTT AACCCATCAG GTCGATGTTG TCAACGTGTC GTCCGACGGG CTTAAGGCCG TGCTGACCCA GTGCGACTCC TGCCGCGCCT ATCAGTTGAA CTGGCCCTAC GTCGTCAACA ATGATCCGGC GCAGACCGGC ATCACGTTCA ACACCGCCAG GGCGCCGTAT GACAACCGCG ATGTGCGATG GGCGCTGCTG CTGGCAATTG ATATTGCCGA ATACATGGGC ATTGCCGTCG ATGGCACCGG CGCGCTCAGC CCGGTTCACA TTCCGTCGCT GTCGAACTAT CCCAAAGACT TCATCCAACC GATGCTGCCC TGGCTGGAAG AGTTCACCCT CGATCTCGGC AATGGCGAAA CCTTCAAGCC CTTCGACCGA AACGCCTCGC AGCGCATCGC CGAGTATGCT CGCTCGCGCG GCTACGCTGT GCCCGACGAT CCCGCCGAGC AGGCAAAACT GTTTGGCTAT GGCTGGTACA AGTATGCGCC CGATGTCGCC GAAAAGCTGC TGGTCAAGAA CGGCTTCACC AAGACGTCCG ACGGCAAATG GCTCTTGCCG GACGGCACGC CCTGGAAGAT TCGCTGCCTG ACCGGCACGC AACTGGCGAC CGGCATGGGT GAACGCAACT GCGTCGCCGC TGTGCAGCAG TGGAAGAGAT TCGGCATCGA CGCCGAGGTG TATTCCTCGG AAGCGGCAGC AAGCCTGAAT GCAACCGGCG ATTTCGACGT TTCCAGCAAC TGGCCCGCGC AGGAACCCTG GGGCGCCGGA CCAGACCTCT ACCGTGTGCT CGACTACTAC AACTCGGCGT ATGTGAAACC GGTCGGCGAG AATACCAGCG GTCACCCGTC GCGCTGGTCG AGTCCGGAGA TGGATGCGAC GATCGAGAAA TTGCGCCAGA CCGATCCCAC CAATTATCAG GCGGTCGTTG ATGTCGGCAT CGAAGGCTTG AAGATTGCCG TGCGTGAAAT GCCCGGCATT CCGACCTATG GCTATGTCGG GTTCATTGCA TGGGATCAGA CCTACTGGAC CAACTGGCCC GGCGCTGAGA ATCCCTACAC GCAACCGTAT ACGCACTGGG GTCCGTTCAA ATATATGACG CCGTTCCTTC AGCCAACCGG GACGCGGTAA
|
Protein sequence | MRRTRLLLTG LLLVVSLIIA ACGGGPAAPA ATPAGSATTP AAPAPAGGTD ATLTEFGDLP RHETLIVDIL TGRVGSPDDF NNWVGWKWRD RGMQNLANEP LWSVDFATGK IIPGLAEGDP VYNADFTALT IPLRKGVTWH DGQPFTAADV VFTVETLMKH EGFGDNSFFV ENVKSVSAVD DHTVAFELNA PNSRFHTRFL DRWGCTWIMP KHIWESVEDP VTFKFNPFIG TGPYKLHSFD PSGFWTIWEK RADWDKSPTG MMYGEPKPKY VIFRYFANEG AKILAQLTHQ VDVVNVSSDG LKAVLTQCDS CRAYQLNWPY VVNNDPAQTG ITFNTARAPY DNRDVRWALL LAIDIAEYMG IAVDGTGALS PVHIPSLSNY PKDFIQPMLP WLEEFTLDLG NGETFKPFDR NASQRIAEYA RSRGYAVPDD PAEQAKLFGY GWYKYAPDVA EKLLVKNGFT KTSDGKWLLP DGTPWKIRCL TGTQLATGMG ERNCVAAVQQ WKRFGIDAEV YSSEAAASLN ATGDFDVSSN WPAQEPWGAG PDLYRVLDYY NSAYVKPVGE NTSGHPSRWS SPEMDATIEK LRQTDPTNYQ AVVDVGIEGL KIAVREMPGI PTYGYVGFIA WDQTYWTNWP GAENPYTQPY THWGPFKYMT PFLQPTGTR
|
| |