Gene Rcas_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3971 
Symbol 
ID5541477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5176937 
End bp5178022 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content62% 
IMG OID640896079 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001434022 
Protein GI156743893 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.862921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0340868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG TGGTGACAGT TCTGGGCACC CGCCCGGAGA TTATCAAACT ATCGCCGCTC 
ATTCCACTGC TCCGTGAGCG GTTTCGCCAC ATCCTCGTGC ATTCCGGGCA GCACTATTCT
TTCGAGATGG ACGCGGTCTT TTTCGAGGAA TTGGGGCTGC CGGCGCCTGA TTACACGCTT
GGCGTCGGCT CAGCGTTGCA CGGCGAACAG ACAGCGCGCA TGCTGTCGCG GCTGGAGCCG
ATCCTGCTTG AAACAAAGCC CGACATGGTT CTGGTTCAGG GTGACACGAA TACGGCGATG
GCAGGCGGAT TGTGCGCGGC CAAACTCAAT ATTCCGGTCG CACATCTCGA GTCTGGTGGG
CGGTCCTTCA ATCGCCAGAT GCCCGAAGAA CTCAACCGCA TTATTCTCGA CCATATTGCG
ACGCTGTTGC TGGCTGCCGA TGAAACCGCC GAGCGCAATC TGCTGGCGGA AGGGTTGCCG
CCTGAGCGGA TCCGTATGGT TGGGTCGAGT GTGATCGATG CTGTCGCGCG GAACCGGCAG
CATGCCCGCC GCTCGACCAT CGTGCAGCGT CTGGAGGTGA CCCCCGGCGA CTACCTGGTG
CTGACCCTGC ACCGCAGCGA GAATACCACT CCTGCCGTGC TGCCCGGCAT GATCCGCGCC
CTCGGTGAGT TGGCGGAAGA GCACACAATC GTGTTTCTGC TGCATCCGCG CACTGCGGCG
GCGATGCGAT CCTATGGCAT TGTCATGCCG CGCAATATTC GCGTCAGTGA GCCGCTTGGC
TATCTCGACA CGCTCTGCCT CGTCGAGCAG GCGCGCGCGC TTCTCACCGA TTCTGGCGGC
TTGCAAGAGG AGGCGGGCGC ACTGGGAACG CCAACGCTCA TCCTGCGCAA CGAAACCGAG
TGGCGTTACC TGGTGGACGC CGGGATGCAC GTGCTGGTCG GTAACACGTA TGAGTCTATT
CTTAGTGGCG CTCGTCGATG GTTGCAACCC GCAGCGCTTG CCCGGTTGCG GTCCGCGCCG
GCGCCGATCC GCACCGGCGC CAGTGAACGC GCCGTCGCAG CGATGGTTGA CGTGTTATAC
CAATGA
 
Protein sequence
MKTVVTVLGT RPEIIKLSPL IPLLRERFRH ILVHSGQHYS FEMDAVFFEE LGLPAPDYTL 
GVGSALHGEQ TARMLSRLEP ILLETKPDMV LVQGDTNTAM AGGLCAAKLN IPVAHLESGG
RSFNRQMPEE LNRIILDHIA TLLLAADETA ERNLLAEGLP PERIRMVGSS VIDAVARNRQ
HARRSTIVQR LEVTPGDYLV LTLHRSENTT PAVLPGMIRA LGELAEEHTI VFLLHPRTAA
AMRSYGIVMP RNIRVSEPLG YLDTLCLVEQ ARALLTDSGG LQEEAGALGT PTLILRNETE
WRYLVDAGMH VLVGNTYESI LSGARRWLQP AALARLRSAP APIRTGASER AVAAMVDVLY
Q