Gene Rcas_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3784 
Symbol 
ID5541286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4955816 
End bp4956994 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID640895894 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_001433841 
Protein GI156743712 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.039561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0465307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCT CGTATGCAGC ACGCTATCGC GCTCACCTGG AGGAGTCGGT CATTCCGTTC 
TGGTTGCAGC ACTCGCTGGA TCACGAGTAT GGCGGACAGT TTACCTGCCT TGATCGCGAC
GGCTCGATCT ACGATACAAA GAAGTACATG TGGTTGCAAG GGCGCGCGGT CTGGATGTTT
TCGCGGTTGT ACAACGAGTT CGACCATCGC CAGGAGTTCC TCGACGCCGC AACGCTCATC
TGCCGATTTA TGCGTCGTTA TGGGCGCGAC CCGCAAGGGC GCGTTTATTT TAGTCTGACG
CGCGAAGGAG AACCGTACTT TTATCAGCGC AAGCCGTATG CGGCGGTGTT CTACATGCTC
GGCTTGCTGG AGTATGCGCG CGCAACCGGC GATCAAGAGT GTCTTAATGA GGCTATCGAG
GTCTTCTGGC AGATCGACCG ATGGATCCGC AATCCGGCGC TGCTCGACCG CCCTGCGCTG
ACGGGCAATC CGCCGGTCAG CAGCCTGGCG AATGTGATGG TGTTGGCGAG CATGGCCATC
GAACTGGCGC GGGTCTCCGA TGATCTGCGC TATGTGCAGG TGATCCGCGA CGCTATGGAT
GGCGTGGTGC GCCATTACGA TCCACAGCGC CGCATTCTCA TTGAACATGT GGCTCTCGAT
AGGCGCGACC TGCGCGCCTG GCCCGAAGGT CGGCTGTTCA GCCCTGGTCA CTCGATTGAG
GTCGCCTGGT TTCTGCTCCA CATGCTCGAG TTTGTGCCGT CCGAAGACCA TCGGCGCCTG
GCGTTCGATG TGCTTGAAGG ATCGCTGGAG TATGGCTGGG ATCGGGAGTA TGGCGGGTTG
TACTACTTTA TGGACATCGA AGGCAAGCCG ACGCTTCAGC TCGAGTCAAC GATGAAGTTG
TGGTGGCCCC ACACGGAGGC GATCTATGCC CTGACGCTGG CATATACGCT GACCGGCGAG
GAACGCTGGA TGCGCTGGCT GGAACGAGTG GACAACTATG CGTTTCGCAC GTTTGTCGAT
CCCGTAGAGG GGGAATGGTT CGGCTATTGC GACCGGCGCG GCGATCTGGC GTTGACCAGC
AAGGGAGGCA ATTACAAAGG GTTCTTCCAT GTGCCGCGCG CGCTACTCTT CAGTGTGCAA
CGCATTGAAG GCGCGTCCTT CGCGCAGCAC CGTCCATAA
 
Protein sequence
MLASYAARYR AHLEESVIPF WLQHSLDHEY GGQFTCLDRD GSIYDTKKYM WLQGRAVWMF 
SRLYNEFDHR QEFLDAATLI CRFMRRYGRD PQGRVYFSLT REGEPYFYQR KPYAAVFYML
GLLEYARATG DQECLNEAIE VFWQIDRWIR NPALLDRPAL TGNPPVSSLA NVMVLASMAI
ELARVSDDLR YVQVIRDAMD GVVRHYDPQR RILIEHVALD RRDLRAWPEG RLFSPGHSIE
VAWFLLHMLE FVPSEDHRRL AFDVLEGSLE YGWDREYGGL YYFMDIEGKP TLQLESTMKL
WWPHTEAIYA LTLAYTLTGE ERWMRWLERV DNYAFRTFVD PVEGEWFGYC DRRGDLALTS
KGGNYKGFFH VPRALLFSVQ RIEGASFAQH RP