Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3616 |
Symbol | |
ID | 5210594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4516585 |
End bp | 4518444 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640597209 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001277921 |
Protein GI | 148657716 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.748752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCATC TCTCGCTTCT CCCGATGCCG CAGTCGCTGA CGTTCTTGCC GGGCGCCTAT GTTGCCAGTT CGGCGCGCCG CATCCTGTTG CAGGGAACAG CGCCGGGAGC GCTCCTCTTC GCAGCGCGGC GTTTGCAGAG CGCATTACGC GCCCACGCTG GCGTCGAGTG GGAACTGACC GCCACACCCG AAGGACCGCC GGGCGAAATC GGCGCCACGC TGCGGGTGAT CGCTGATAGC GCCAGCCATC CGCAGGGGTA CGTGCTTACG ATCAAGGACG GCGGCATCAT TATCGAAGCG TCAGCCCCCG CTGGCATCTT CTATGGCGTC TGCACGCTGA TCCAGATCGT CGAGCAGACC GGTCGCACCC TTCCCTGCCT GCACATCAGC GACTATCCCG ATTTTGCCGC ACGCGGAGTG ATGATCGATA TCAGCCGCGA TAAAGTCCCA ACAATGGAAA CCCTGTTCAT GCTGGTCGAT ATGCTGGCAG GGTGGAAGAT CAACCAGGTG CAACTCTACA CCGAGCATAC CTTCGCGTAT CGCAATCACC CCGATGTGTG GGCGCGCGCA TCGCCGATGA CCGGCGAGGA AATCCTGACG CTCGATGCGT ACTGCAAAGA ACGCCATATC GACCTGGTTC CCAACCAGAA CTCGTTTGGG CATATGCACC GCTGGTTGAT CCATCCGCGC TATGCCGCGC TCGCCGAGAC CCACGACGCA TTCCAGGCGC CCTGGGGAGT GATGCAGGGA CCGTTCAGCC TGGCGCCGGA CGATCCCGGC AGCCTGGAAC TGGTGCGCAG CCTGTATGAT GAACTGCTGC CGCACTTTTC GAGCCGGTTG TTCAACGTCG GCTGCGATGA GACGGTCGAT CTTGGACAGG GACGCAGTAA AGACATCTGT GCACAGCGTG GCGTCGGTCG GGTCTACCTC GATTTTCTGC TCAAACTCTA CGATGCTGTG AAGCAGCACG GGCGCACGAT GATGTTCTGG GGCGATATTG TCAACAATCA CCCGGAATTG ATCGGCGAAC TGCCACGCGA TGTAATTGCG CTCGAATGGG GATATGAAGC CGATCACCCC TTTGATCGCA ATTGCGCGCG CTATGCCGAA GCAGGGATCC CCTTCTATGT CTGCCCCGGC ACGTCGTCGT GGCAAAGCAT CGCCGGGCGC ACCGACAATG CGCTGGGCAA TCTGCACAAT GCAGCCGAGC ACGGGTTGAA GCACGGCGCT ATCGGCTATC TGATCACCGA TTGGGGCGAC ATGGGGCACT GGCAGGCGCT GCCGATCAGT TTTCCGGGAT TCGCGGTCGG CGCGGCGTTT GCCTGGGCAT ACGCTGCAAA CCGCACCATC AATGTTCCGG CAGCGGTCAG CCGCCATGCG TTCACCGACC CGACTGGCGC GATGGGACAG GTTGCATACG ACCTGGGGAA TGTGTATCGC GCCGTCGGCT ACGAGCCGCC CAACTCGTCG GTGCTGTTCT GGGTGTTGCA GGCGCCGGAC ACCGATGCGC GCAACCTGCC GCCGCTCGAC TTCGACCGCG CGCTCGATGC GATCGATGCT GCAATCCAGC CGATTGCCAC AGAACGCATG ACGCGCGCCG ATGCCCCGCT CATCCTGCAA GAGTTCGATA ACACGGTGCG ATTACTGCGC CACGCCTGCC GTCTGGGACA GTTGCTGGTT CAACCCGATG GACCCGGCGC GTTGCCGCGG CGCCGGTTGC TGAACAACGA CATGCGCGAG ATCATTCGCG AGTACGAACG TCTCTGGCTG GCGCGCAACC GCCTCGGCGG GCTGTCCGAC AGCGTCGCCC GGTTGGAGCG TGTGCGCGCC CGCTATACTT CTCAGGAGCA GCACGCATGA
|
Protein sequence | MDHLSLLPMP QSLTFLPGAY VASSARRILL QGTAPGALLF AARRLQSALR AHAGVEWELT ATPEGPPGEI GATLRVIADS ASHPQGYVLT IKDGGIIIEA SAPAGIFYGV CTLIQIVEQT GRTLPCLHIS DYPDFAARGV MIDISRDKVP TMETLFMLVD MLAGWKINQV QLYTEHTFAY RNHPDVWARA SPMTGEEILT LDAYCKERHI DLVPNQNSFG HMHRWLIHPR YAALAETHDA FQAPWGVMQG PFSLAPDDPG SLELVRSLYD ELLPHFSSRL FNVGCDETVD LGQGRSKDIC AQRGVGRVYL DFLLKLYDAV KQHGRTMMFW GDIVNNHPEL IGELPRDVIA LEWGYEADHP FDRNCARYAE AGIPFYVCPG TSSWQSIAGR TDNALGNLHN AAEHGLKHGA IGYLITDWGD MGHWQALPIS FPGFAVGAAF AWAYAANRTI NVPAAVSRHA FTDPTGAMGQ VAYDLGNVYR AVGYEPPNSS VLFWVLQAPD TDARNLPPLD FDRALDAIDA AIQPIATERM TRADAPLILQ EFDNTVRLLR HACRLGQLLV QPDGPGALPR RRLLNNDMRE IIREYERLWL ARNRLGGLSD SVARLERVRA RYTSQEQHA
|
| |