Gene RoseRS_4113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4113 
Symbol 
ID5211096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5153447 
End bp5154721 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content61% 
IMG OID640597701 
Productglycoside hydrolase family protein 
Protein accessionYP_001278407 
Protein GI148658202 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG TTGTCATCGG CGGCGGCAGC ACCTACACCC CTGAACTGAT CAAAGGTCTG 
ATCGCCCGGA GTCCCATCCT GAATCTGCAC GAAGTGTGGT TGGTCGATCC TGATGAGGAA
CGCCTGCGGA TTGTCGGTTC GTTCGCACAA CGCATGGTCA GCCATGCAAA CGCCGGATTT
CGCGTCGAGT TGACCGCCGA CCGGCAGCTG GCGCTGGAAG ACGCCGATTA TGTGGTGACT
CAGTTTCGTG TTGGCGGGCA GCAGGCGCGT CGCAATGACG AACTGCTTGG ACGACGGCAT
CGTCTTGTCG GGCAGGAGAC GACCGGCGTC GGCGGGTTTG CCAAGGCGCT GCGCACCATT
CCGGTGGCGC TCGACATTGC GCGCGATATG CGCGCAATCG CACCGCAGGC GATCCTGCTC
AATTTCACAA ACCCGGCAGG TCTTGTCACC GAGGCGGTGG CGCGTCATGG CGGCGTGCCG
GTGATCGGGT TGTGCAACAA TGCGATCAAT GCGCAGCGCG CGATTGCCCG CATGGTCAAT
GTTCCACCCG AACAGGTGTT CATCGAGCAG GTCGGGCTGA ATCATCTGAA CTGGATCCGT
CGCGTGACGA TCAACGGCGA CGATGCGACT GATGCCGTAC TCGAGGCATA TGTCGAGCAT
CTGCGCCACG ACGAGGATCC GATCCATTTC CCACCCCGAT TGATCCAGAT GCTGCGCGCC
ATTCCTTCGT CGTACCTGCG CTATTTCTAC CTTACCCCGC AGATCATTGC GCAGCAGGAG
AGCGGTGCGC CAACCCGCGC CGACGTGGTG ATGGATGTCG AGCGACGGTT GCTCGCGCGC
TACGCCGACC CGACGCTGCG CGAGATGCCG CCGGAACTGA TGGAGCGCGG CGGAGCGTAC
TACTCCACAG CGGCTGCGGC GCTGATCGAA TCGCTCCACA CCGGCGACAA CGCCATTCAT
GTTGTGAATA CGCGCAACAA CGGCGCTATC CCCAACCTGG ACGATGATGT GGTCGTCGAG
ATGCCATGCA CGGTCGGGAA GCATGGCGCA ACGCCTATCC CCGTTGCGCC ACTAGAGCCG
ATCTTCCATG GTCTGACCTG TCAGGTGAAA GCGTATGAAC TGCTGACCGT GAAAGCGGCG
GTCGAGGGCG ACGAGGATGC AGCAATGCTG GCGCTGCTCA CCAACCCGCT CGGACCGGAT
GCAGCGCGCG TTGAGACGGT GTGGGAGGAT ATCAAACGAA CGAACCGGGG TTTGCTTCCG
ACCTTCGAGA GGTAA
 
Protein sequence
MKIVVIGGGS TYTPELIKGL IARSPILNLH EVWLVDPDEE RLRIVGSFAQ RMVSHANAGF 
RVELTADRQL ALEDADYVVT QFRVGGQQAR RNDELLGRRH RLVGQETTGV GGFAKALRTI
PVALDIARDM RAIAPQAILL NFTNPAGLVT EAVARHGGVP VIGLCNNAIN AQRAIARMVN
VPPEQVFIEQ VGLNHLNWIR RVTINGDDAT DAVLEAYVEH LRHDEDPIHF PPRLIQMLRA
IPSSYLRYFY LTPQIIAQQE SGAPTRADVV MDVERRLLAR YADPTLREMP PELMERGGAY
YSTAAAALIE SLHTGDNAIH VVNTRNNGAI PNLDDDVVVE MPCTVGKHGA TPIPVAPLEP
IFHGLTCQVK AYELLTVKAA VEGDEDAAML ALLTNPLGPD AARVETVWED IKRTNRGLLP
TFER