Gene Rcas_0778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0778 
Symbol 
ID5538244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1017886 
End bp1018863 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content45% 
IMG OID640892931 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001430914 
Protein GI156740785 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.909827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGC ATGAGGTAGT AGTACTTGTA ACGGGAGGCA CGGGCTCATT CGGTAAAAAG 
TTCGCCAAGA TAATGCTAGA GGATTATCAA CCTGAGAAGG TGATAATTTA CAGCCGAGAT
GAGCTAAAGC AACATGAGAT GCGAATAGCG GGGTTTGATC ATCCCTCTAT ACGTTATTTC
ATCGGTGATG TACGCGATTT AGCAAGACTA CGTCGTGCTA TGTATGGAGT GGACATCGTG
GTTCATGCGG CTGCATTGAA GCAAGTTCCA GCTTGCGAAT ATAATCCTAT CGAGGCGGTC
ATGACAAACA TTAACGGTGC AAGAAATGTT ATTGATGCTG CCATCGATAT GGGCGTAAAA
AAGGTTGTGG CTATAAGTAC AGATAAGGCG GTAAATCCCG TGAATCTTTA TGGTGCCACC
AAGCTTTGTG CTGAAAAACT GTTTATTCAA AGCAACTCCT ATTCAGGCAG TACCGGAACT
CGCTTCAGTT GTGTACGCTA TGGTAACGTA GTTGGAAGCA GTGGTAGCGT AATCCCTCTT
TTCCGAGAGC AACGGCGATC TGGTCGTATT ACCGTGACTG ATCCGAGAAT GACACGTTTT
TGGATTACAT TAGATCAAGG CGTACGATTT GTTATTCGTT GCATTGAGCA AATGCATGGA
GGGGAAGTGT TTGTTCCTAA GATTCCCAGT ATGAACATTA TGGACCTAGC AAAAGCAATA
GCACCGGATT GCGTGGTGGA GTCCATCGGG ATTAGGCCCG GCGAGAAACT CCACGAAGTA
TTAATTTCTG AAGATGAAGC ACGTCATACG TTAGAACTTG AAGATATGTA TGTTGTTCAG
CCAAGATATC CATGGTGGCA GGTTAAGGAC TGGGAAGGAG GAAAGCCACT CCCTGAGGGT
TTCCGGTATG CTAGTAACAC AAACAGTCAG TGGCTCTCGG TAAGTGAGCT ACGAGTATTA
GCAGAGGACT TAATATGA
 
Protein sequence
MNWHEVVVLV TGGTGSFGKK FAKIMLEDYQ PEKVIIYSRD ELKQHEMRIA GFDHPSIRYF 
IGDVRDLARL RRAMYGVDIV VHAAALKQVP ACEYNPIEAV MTNINGARNV IDAAIDMGVK
KVVAISTDKA VNPVNLYGAT KLCAEKLFIQ SNSYSGSTGT RFSCVRYGNV VGSSGSVIPL
FREQRRSGRI TVTDPRMTRF WITLDQGVRF VIRCIEQMHG GEVFVPKIPS MNIMDLAKAI
APDCVVESIG IRPGEKLHEV LISEDEARHT LELEDMYVVQ PRYPWWQVKD WEGGKPLPEG
FRYASNTNSQ WLSVSELRVL AEDLI