Gene RoseRS_3616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3616 
Symbol 
ID5210594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4516585 
End bp4518444 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content62% 
IMG OID640597209 
Productglycoside hydrolase family protein 
Protein accessionYP_001277921 
Protein GI148657716 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.748752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCATC TCTCGCTTCT CCCGATGCCG CAGTCGCTGA CGTTCTTGCC GGGCGCCTAT 
GTTGCCAGTT CGGCGCGCCG CATCCTGTTG CAGGGAACAG CGCCGGGAGC GCTCCTCTTC
GCAGCGCGGC GTTTGCAGAG CGCATTACGC GCCCACGCTG GCGTCGAGTG GGAACTGACC
GCCACACCCG AAGGACCGCC GGGCGAAATC GGCGCCACGC TGCGGGTGAT CGCTGATAGC
GCCAGCCATC CGCAGGGGTA CGTGCTTACG ATCAAGGACG GCGGCATCAT TATCGAAGCG
TCAGCCCCCG CTGGCATCTT CTATGGCGTC TGCACGCTGA TCCAGATCGT CGAGCAGACC
GGTCGCACCC TTCCCTGCCT GCACATCAGC GACTATCCCG ATTTTGCCGC ACGCGGAGTG
ATGATCGATA TCAGCCGCGA TAAAGTCCCA ACAATGGAAA CCCTGTTCAT GCTGGTCGAT
ATGCTGGCAG GGTGGAAGAT CAACCAGGTG CAACTCTACA CCGAGCATAC CTTCGCGTAT
CGCAATCACC CCGATGTGTG GGCGCGCGCA TCGCCGATGA CCGGCGAGGA AATCCTGACG
CTCGATGCGT ACTGCAAAGA ACGCCATATC GACCTGGTTC CCAACCAGAA CTCGTTTGGG
CATATGCACC GCTGGTTGAT CCATCCGCGC TATGCCGCGC TCGCCGAGAC CCACGACGCA
TTCCAGGCGC CCTGGGGAGT GATGCAGGGA CCGTTCAGCC TGGCGCCGGA CGATCCCGGC
AGCCTGGAAC TGGTGCGCAG CCTGTATGAT GAACTGCTGC CGCACTTTTC GAGCCGGTTG
TTCAACGTCG GCTGCGATGA GACGGTCGAT CTTGGACAGG GACGCAGTAA AGACATCTGT
GCACAGCGTG GCGTCGGTCG GGTCTACCTC GATTTTCTGC TCAAACTCTA CGATGCTGTG
AAGCAGCACG GGCGCACGAT GATGTTCTGG GGCGATATTG TCAACAATCA CCCGGAATTG
ATCGGCGAAC TGCCACGCGA TGTAATTGCG CTCGAATGGG GATATGAAGC CGATCACCCC
TTTGATCGCA ATTGCGCGCG CTATGCCGAA GCAGGGATCC CCTTCTATGT CTGCCCCGGC
ACGTCGTCGT GGCAAAGCAT CGCCGGGCGC ACCGACAATG CGCTGGGCAA TCTGCACAAT
GCAGCCGAGC ACGGGTTGAA GCACGGCGCT ATCGGCTATC TGATCACCGA TTGGGGCGAC
ATGGGGCACT GGCAGGCGCT GCCGATCAGT TTTCCGGGAT TCGCGGTCGG CGCGGCGTTT
GCCTGGGCAT ACGCTGCAAA CCGCACCATC AATGTTCCGG CAGCGGTCAG CCGCCATGCG
TTCACCGACC CGACTGGCGC GATGGGACAG GTTGCATACG ACCTGGGGAA TGTGTATCGC
GCCGTCGGCT ACGAGCCGCC CAACTCGTCG GTGCTGTTCT GGGTGTTGCA GGCGCCGGAC
ACCGATGCGC GCAACCTGCC GCCGCTCGAC TTCGACCGCG CGCTCGATGC GATCGATGCT
GCAATCCAGC CGATTGCCAC AGAACGCATG ACGCGCGCCG ATGCCCCGCT CATCCTGCAA
GAGTTCGATA ACACGGTGCG ATTACTGCGC CACGCCTGCC GTCTGGGACA GTTGCTGGTT
CAACCCGATG GACCCGGCGC GTTGCCGCGG CGCCGGTTGC TGAACAACGA CATGCGCGAG
ATCATTCGCG AGTACGAACG TCTCTGGCTG GCGCGCAACC GCCTCGGCGG GCTGTCCGAC
AGCGTCGCCC GGTTGGAGCG TGTGCGCGCC CGCTATACTT CTCAGGAGCA GCACGCATGA
 
Protein sequence
MDHLSLLPMP QSLTFLPGAY VASSARRILL QGTAPGALLF AARRLQSALR AHAGVEWELT 
ATPEGPPGEI GATLRVIADS ASHPQGYVLT IKDGGIIIEA SAPAGIFYGV CTLIQIVEQT
GRTLPCLHIS DYPDFAARGV MIDISRDKVP TMETLFMLVD MLAGWKINQV QLYTEHTFAY
RNHPDVWARA SPMTGEEILT LDAYCKERHI DLVPNQNSFG HMHRWLIHPR YAALAETHDA
FQAPWGVMQG PFSLAPDDPG SLELVRSLYD ELLPHFSSRL FNVGCDETVD LGQGRSKDIC
AQRGVGRVYL DFLLKLYDAV KQHGRTMMFW GDIVNNHPEL IGELPRDVIA LEWGYEADHP
FDRNCARYAE AGIPFYVCPG TSSWQSIAGR TDNALGNLHN AAEHGLKHGA IGYLITDWGD
MGHWQALPIS FPGFAVGAAF AWAYAANRTI NVPAAVSRHA FTDPTGAMGQ VAYDLGNVYR
AVGYEPPNSS VLFWVLQAPD TDARNLPPLD FDRALDAIDA AIQPIATERM TRADAPLILQ
EFDNTVRLLR HACRLGQLLV QPDGPGALPR RRLLNNDMRE IIREYERLWL ARNRLGGLSD
SVARLERVRA RYTSQEQHA