Gene Rcas_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0784 
Symbol 
ID5538250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1025276 
End bp1026346 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID640892936 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001430919 
Protein GI156740790 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.769724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC CCTTGACCAT CGTCACCATT GTCGGCGCGC GACCGCAGTT CATCAAGGCA 
GCCGCCGTGA GTCGCGTGTT GCGAGCGCGT CACCGCGAGG TGCTGGTACA CACCGGGCAG
CATTACGACG CCAATATGTC GGCGATCTTC TTTGACGAAC TAGGCATTCC GCCGCCCGAT
GTCAACCTGG CGGTCGGCTC CGCCGGCCAT GGCGCGCAGA CCGGCGCAAT GCTGGCGAAG
ATTGAAGAGG TATTGCTGGC GGAACACCCG GATTGGGTGT TGGTGTATGG CGACACCAAC
TCCACGCTGG CAGGCGCGCT GGCGGCGGCA AAATTGCGCA TCCCCGTCGC CCACGTCGAA
GCCGGGCTAC GCAGTTTCAA CCGCGCCATG CCAGAAGAGA TCAACCGGGT ATTGACCGAT
CACCTGTCCG ATCTGCTCCT TTGCCCAAGC CAAACCGCTA TCGATAACCT GGCGCGCGAA
GGAATCACCC GCAGCGTCAT ACTGGTCGGC GATGTGATGG CAGACGCGCT GCGGTTGGCT
GCCGAACGTG CCGACACCCC AGTTCTGGAG TCGTTCGGCG TCCAGCCAGG CGACTACGCG
CTTGCAACCG TCCATCGCGC CGAAAACACC GATGACCCGC TGCGTTTGCA GGGCATTCTG
ATCGGTCTGA CGCTGCTGGA CATGCCGGTC ATCTTTCCGG CGCACCCGCG CGCGCGCCGC
GCAATTGCCG CGCTCGAATG GACTCCGCCT GCCCACGTGC GCCTGGTCGA ACCGGTCGGT
TACCTGGGCA TGGTCGCCCT CATGCGCGGC GCGCGCGTTA TCCTGACCGA TTCAGGCGGG
GTGCAGAAGG AAGCGTACTG GCTTGGCGTC CCCTGCGTGA CGCTGCGCGA CGAGACCGAA
TGGGTCGAGA CAGTCGCCCA TGGATGGAAC ACCCTGGTCG GCGCCGATCC CGAACGGATC
GTAGCCGCTG CGCGCCAGCC CTATCCGACA ACGCCGCACC CGCCGCTCTA CGGCGATGGT
CACGCCGCCG AACGGTGCGT GGCAGCGCTT GAAAAAGGGG GGGAAGGTTG A
 
Protein sequence
MSKPLTIVTI VGARPQFIKA AAVSRVLRAR HREVLVHTGQ HYDANMSAIF FDELGIPPPD 
VNLAVGSAGH GAQTGAMLAK IEEVLLAEHP DWVLVYGDTN STLAGALAAA KLRIPVAHVE
AGLRSFNRAM PEEINRVLTD HLSDLLLCPS QTAIDNLARE GITRSVILVG DVMADALRLA
AERADTPVLE SFGVQPGDYA LATVHRAENT DDPLRLQGIL IGLTLLDMPV IFPAHPRARR
AIAALEWTPP AHVRLVEPVG YLGMVALMRG ARVILTDSGG VQKEAYWLGV PCVTLRDETE
WVETVAHGWN TLVGADPERI VAAARQPYPT TPHPPLYGDG HAAERCVAAL EKGGEG