Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0519 |
Symbol | |
ID | 5537982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 676418 |
End bp | 678277 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640892681 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001430667 |
Protein GI | 156740538 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCGC TCTCGCTGCT GCCAATGCCG CAGTCGCTGA CCATGCTGCC GGGGAGGTAT GTGGCGACTT CGGCGCAGCG CATTCTGCTC CAGGGAACGG CGCCGGGCGA TCTGTGGTTC ACGGCGCGCC GCCTGCAAAA CGCGCTGCGC GCCTATGCCG GCGTCGAGTG GGAATTGACC GCAACGCCGG AAGGACCGCC AGGTGAGATC GGCGCCACGC TCCGGGTGGC TGCCGATAGC GTCATCCATC CGCAGGGGTA CGCGCTCACG ATCAAAGATG GCGGCATCAT TATTGAAGCG CCAGAACCCG CCGGCATATT CTATGGGGTC TGCACGCTGA TCCAGATTAT CGAGCAGAGC GGACGCTCTC TACCTTGCCT GCACATCAAC GACCACCCCG ATTTTGCCGC GCGCGGAGTC ATGATCGATA TCAGCCGCGA CAAGGTTCCA ACAATGGAGA CCCTGTTGAT GCTGACCGAT ATGCTGGCGG GGTGGAAGAT CAATCAGGTG CAGTTATACA CCGAGCATAC CTTCGCCTAT CGGAATCACC CCGACGTATG GGCGCGCGCG TCGCCGCTGA CCGGTGAAGA AGTCCTGGCG CTCGATGCAT ACTGCAAAGA ACGTTTCATT GAACTTGTGC CCAATCAGAA CTCGTTTGGG CACATGCACC GCTGGCTCAT CCATCCGCGC TATGCGGCGC TTGCCGAGAT CCACGGCGAA TTCCAGGCGC CGTGGGGAGT GATGCGGGGT CCGTTCAGTC TGGCGCCGGA AGACCCGGAC AGTCTCAAAC TGGTGCGCAG CCTGTATGAT GAATTGCTGC CGCACTTTTC GAGCCGGTTG TTCAACGTCG GTTGCGATGA GACGGTCGAT CTTGGGCAGG GGCGCAGTAG AGACGCCTGT GCGCAACGCG GCGTCGGGCG GGTGTACCTC GATTTCCTGC TTAAGCTCTA CGATGCTGTG AAACAACACG GGCGCACGAT GATGTTCTGG GGCGACATTG TCAACAATCA CCCGGAACTG ATCGACGAGT TGCCACGCGA TCTGATTGCG CTCGAATGGG GATACGAAGC CGACCATCCG TTCGACCGTC ATTGCGCTCG CTATGCCGCC GCAGGCATTC CGTTCTATGT CTGTCCTGGC ACATCGTCGT GGCAGAGTAT CGCCGGACGC ACCGACAATA CCCTCGGCAA TCTGCGGAAT GCTGCCGAGA ATGGATTGAA ACACGGCGCT ATCGGCTACC TGATCACCGA CTGGGGCGAT ATGGGGCACT GGCAGGTGTT GCCGATCAGT TTCCTGGGAT TTGCCGTCGG CGCAGCGTTC GCGTGGGCAT ACGCCGCCAA CCGCGATATG AATGTGCCGG CGGCTGTCAG CCGCCATGCG TTTACCGACC CGACCGGCGC GATGGGGCAG GTGGCATACG ACCTGGGGAA TGTGTACCGC GCCGTTGGGT ATGAGCCGCC CAATTCGTCG GTGCTGTTTT GGGTGTTGCA AGCGCCGGAC AGCGACGCGC GCGATCTGCC GCCGCTCGAT TTCGACCGCG CGCTCGACGC GATCAACGCC GCGATCCAAC CGATTGCCGT CGAGCGGATG ATCCGCCCGG ATGCGCCGCT GATATTGCAA GAGTTCGACA ATACGGTGCG CCTGCTGCGA CACGCCTGTC GTCTGGGGCA GTTGCTGACC CAACCCGACG GCGCCGGTGC GTTGCCGCGT CGCCGCTTGC TGAATGATGA TATGCGCGAG ATTATCCGCG AGTACGAACG TCTCTGGCTG GCGCGCAACC GCCTCGGCGG GCTGTCCGAC AGTGTCGCGC GGTTGGAGCG CGTGCGCGCC CGATACAATC CTCAGGAGCA GACGCCATGA
|
Protein sequence | MESLSLLPMP QSLTMLPGRY VATSAQRILL QGTAPGDLWF TARRLQNALR AYAGVEWELT ATPEGPPGEI GATLRVAADS VIHPQGYALT IKDGGIIIEA PEPAGIFYGV CTLIQIIEQS GRSLPCLHIN DHPDFAARGV MIDISRDKVP TMETLLMLTD MLAGWKINQV QLYTEHTFAY RNHPDVWARA SPLTGEEVLA LDAYCKERFI ELVPNQNSFG HMHRWLIHPR YAALAEIHGE FQAPWGVMRG PFSLAPEDPD SLKLVRSLYD ELLPHFSSRL FNVGCDETVD LGQGRSRDAC AQRGVGRVYL DFLLKLYDAV KQHGRTMMFW GDIVNNHPEL IDELPRDLIA LEWGYEADHP FDRHCARYAA AGIPFYVCPG TSSWQSIAGR TDNTLGNLRN AAENGLKHGA IGYLITDWGD MGHWQVLPIS FLGFAVGAAF AWAYAANRDM NVPAAVSRHA FTDPTGAMGQ VAYDLGNVYR AVGYEPPNSS VLFWVLQAPD SDARDLPPLD FDRALDAINA AIQPIAVERM IRPDAPLILQ EFDNTVRLLR HACRLGQLLT QPDGAGALPR RRLLNDDMRE IIREYERLWL ARNRLGGLSD SVARLERVRA RYNPQEQTP
|
| |