Gene RoseRS_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3943 
Symbol 
ID5210927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4935030 
End bp4936319 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content59% 
IMG OID640597539 
ProductAlpha-L-fucosidase 
Protein accessionYP_001278245 
Protein GI148658040 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACTC CAACGCCAAC GCGCGGCGAT ACGTCGTGGT TCGTCCGTGA TCGCTTCGGC 
ATGTTCATTC ACTGGGGATT GTATGCGCTT GCAGCACGCC ATGAGTGGGT CAAAAGCCGC
GAGAAGATCG ACGACGAAAC GTATCAGCGC TACTTCGACC ATTTCGATCC CGATCTCTAC
GATCCGCGCG TGTGGGCGCG CGCCGCGCGC GAGGCGGGGA TGAAGTATGT GGTGATCACG
ACCAGGCACC ACGAGGGGTT CTGCCTGTGG GATACGCACT ATACGGCGTA TAAGGCGCCC
AATACCCCGG CAAAACGCGA TCTGCTGAAA CCGTTCGTCG AGGCGTTCCG CGCCGAAGGA
TTGCGCATCG GCTTTTACTA CTCCCTCATC GACTGGCATC ATCCCGATTT TCCGATCGAT
ATCTACCATC CCCTCCGCGA CCACCCCAAT GTCGCCGAAT TGAATGCCGG TCGTGACATT
CGACGATATG CCGCATATAT GCGCAATCAG GTGCGCGAAC TTCTTACCGG CTACGGACCG
GTGGACATCA TCTGGTTCGA CTTCTCCTAC CCCAACCGCG CGTACAACGG TCTGCCGGGC
AAAGGACGCG CCGATTGGGA GAGTGAGGCG CTGTTGCGGC TGGTGCGCGA ACTGGCGCCG
GATATTATTG TCAATAATCG TCTCGATCTG CCAACCGAGT TCGCCGATGT GCATACCCCT
GAACAGTTTC AACCGCGTGA ATGGGTGCAT GTCAACGGCG AACCGGTGGT GTGGGAGACG
TGCCAGACAT TCAGCGGCTC GTGGGGCTAC CACCGCGACG AGATGACCTG GAAAAGCCCG
GAACAACTCA TTCAGATGCT GATCAACTCG GTGGCTTGCG GCGGAAACCT GTTGATGAAT
GTTGGTCCCA CCGCGCGCGG CACGTTCGAC GACCGGGCAA TGGCCGCGCT CAAGGTCTAT
GCCGACTGGA TGCGCCTGCA TAACCGCTCG ATCTATGGCT GCACGCAGAG CGAGTTCGCC
GCACCGACCG ACTGCCGCCT GACGCAAAAT GGGAAACGGC TCTACCTGCA CATCTTTTCC
TGGCCCTTCC GCCATGTGCA TCTTGACAGC ATGGCAGGCA GGGTGGAATA TGCGCAACTC
CTCCACGATG CCAGCGAGGT GAAACTGCTC GAGCCGGGCA GGCACAGTGA ATGGAGCATC
GCTGAAACTG CCGCCGATAC GCTGACACTG GAATTGCCGG TGGCCAAACC CAGGGTAACG
GTGCCGGTGG TAGAATTGTT CCTCCGTTGA
 
Protein sequence
MLTPTPTRGD TSWFVRDRFG MFIHWGLYAL AARHEWVKSR EKIDDETYQR YFDHFDPDLY 
DPRVWARAAR EAGMKYVVIT TRHHEGFCLW DTHYTAYKAP NTPAKRDLLK PFVEAFRAEG
LRIGFYYSLI DWHHPDFPID IYHPLRDHPN VAELNAGRDI RRYAAYMRNQ VRELLTGYGP
VDIIWFDFSY PNRAYNGLPG KGRADWESEA LLRLVRELAP DIIVNNRLDL PTEFADVHTP
EQFQPREWVH VNGEPVVWET CQTFSGSWGY HRDEMTWKSP EQLIQMLINS VACGGNLLMN
VGPTARGTFD DRAMAALKVY ADWMRLHNRS IYGCTQSEFA APTDCRLTQN GKRLYLHIFS
WPFRHVHLDS MAGRVEYAQL LHDASEVKLL EPGRHSEWSI AETAADTLTL ELPVAKPRVT
VPVVELFLR