Gene Rcas_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3421 
Symbol 
ID5540920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4459336 
End bp4460748 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID640895539 
ProductUDP-glucose/GDP-mannose dehydrogenase 
Protein accessionYP_001433489 
Protein GI156743360 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.599581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.35369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA TCGTTGTCGG CACCGGTTTC GTCGGTCTGA CTCACGCAGC GGTCTGTTCG 
GAGTATGGCC ACGAAGTCTA TGCATACGAT ATTGATCACC GGCGTATCGA GGCGTATCGG
TCGTGTCGCC AGGATGCAAT TGAGCACTAT GTGCATGAAC CAGGGCTGAC GAGCATTATC
CTCGAAAACC ATGGACGGTA TCTGTTCTTC GTTGACGATG TCGAACCGGT AATCGAAGGC
GCTGATGCGC TCTTCCTGTG TTTGCCGACG CCGCCCAAGC CGGATGGCTC ATCGGACCTG
AGTTACTACT TTGCGGCAGT GCGCCATCTG GCGGAACTAC TGGCGCGCCG CCAGGATCAA
CGCCGCGTGG TGGTGATCAA CAAAAGCACA GTGCCGATTG GCACAGCGCG GCAGCTGGAA
CGGGTGCTGC GTGACTACCA CGTGCCCAAC GTGGGGGTTG CCTCAAACCC TGAATTTCTC
CCGGAGGGTG ATGCGGTCGA GAAGTCGCGC CGTCCGGATC GCGTGGTGGT CGGCGCCGAC
TGTGAGGAAG ATTTTCGTAT CATCCGCCGG ATTTACTCGC AATTCGTCAA CCACGTGCGC
ATTCGCTACA TCGAAACCAC GCCAGAAACC GCAGAAGCCA TCAAGTATGT GGCCAATACG
CTGCTGTTGA CCTACATCTC GTTCTGGAAC GGCGTTGGTG CGCGTCTTGC CGAAACCTTC
CCAAATATCA TCATGGAAGA TCTGAAGCGT GGCGTTACTG CCGACGCGCG CATTAGCACA
TGGGGGTCGT ATGTGTCGAA CGGCGCCGGC GGGTCGTGCT TCGGCAAGGA TATTCAGTCA
CTCATCTATC AATTGAAGAC GGCAGGTCAA TCGGTCGATA TTCTGCAATC GGTCTATGGC
ATCAACGAGT ATCAGAAAAC CTATCTGATT GATCGCGCGA TCCGGGAAGC CGGGGTCAGT
TTTAACAACA AAACGGTGGC GCTGCTGGGG CTGGCGTTCA AACAACGCAC CAACGATATG
CGTGACTCGT CGTCGCTCAA GGTGGTTGAG GCATTACTGG GCAGAGGTGT GCGCGCCATT
CGCGCCTACG ATCCAATGGC GCTGCGTGAA GCGCGCAAGT TCTTCGATCC AGAGAAGAAC
CATCTGTTCG AGCGCATTTC CTACCATACG TCGGTGCGCG AGGCGCTCGC CGGAACCGAT
ATGCTCTTCA TCTCCACCGA TTGGGAAGAG TTTCGCGGTC TGTCGCGCAC CATCGAAGAG
ACGGTCGCGC CGCCCTATCT GGTCATCGAT GGTCGTCGAA TGATTCCCGA TTATACGGAA
TTGGTTGCGG CTGGCTACGG GTACCTTGCC GTTGGTTCGC CGTACATGCC ACCCGGCGAT
GTTCCGCCGA ACCGCGTTGG ACGACGGGGC TGA
 
Protein sequence
MKIIVVGTGF VGLTHAAVCS EYGHEVYAYD IDHRRIEAYR SCRQDAIEHY VHEPGLTSII 
LENHGRYLFF VDDVEPVIEG ADALFLCLPT PPKPDGSSDL SYYFAAVRHL AELLARRQDQ
RRVVVINKST VPIGTARQLE RVLRDYHVPN VGVASNPEFL PEGDAVEKSR RPDRVVVGAD
CEEDFRIIRR IYSQFVNHVR IRYIETTPET AEAIKYVANT LLLTYISFWN GVGARLAETF
PNIIMEDLKR GVTADARIST WGSYVSNGAG GSCFGKDIQS LIYQLKTAGQ SVDILQSVYG
INEYQKTYLI DRAIREAGVS FNNKTVALLG LAFKQRTNDM RDSSSLKVVE ALLGRGVRAI
RAYDPMALRE ARKFFDPEKN HLFERISYHT SVREALAGTD MLFISTDWEE FRGLSRTIEE
TVAPPYLVID GRRMIPDYTE LVAAGYGYLA VGSPYMPPGD VPPNRVGRRG