Gene Rcas_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0803 
Symbol 
ID5538269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1048926 
End bp1050326 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content58% 
IMG OID640892955 
ProductN-acetylglucosamine-1-phosphate uridyltransferase-like protein 
Protein accessionYP_001430938 
Protein GI156740809 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA TTATCATCCG CGAGCGCACG CTCATTCCCC CCTTCGGCGA GCCGGCGCGC 
GATCTGCGTG TGCTGAACAA ACCGCTCTGG CTGCTGCAAC GTGACCTGCT CGCTCCCTAC
TGCCGCGGTG CGCTGGAAGT CGACTCGCTC GAAGAAGCGC CGCCGGTCAA CGAAGAATTG
CTCGTCCATC GTGACAATCT GTTCTTCAAC GAGTACCTGA TCGATGCCTT CATCCGCGAG
GCGCGCAAAA CTGGCCGCGC CTGTCGCATC GCTTTTGCGC GCAACGACGC GGCAATCGTC
ACGCACGCGC TGCACTTGCA AGAAGGCATT CGCCTCGATG AGCGCCATGG CGTGTACGTC
GCCGACCTGT TCTACTACCC ACGCGGTCCC GACGCCGATC CGGAGCCGCT GGTGATCGAT
ACGTTGCCGC GCGAAATGGG GTACTACCAC ATCCCCAGTT ATATGGCGCG TCACGGCGAC
CTGGTGTTTC AGGTGCCAAT GCGCGCGTTT CTGTCGATCG AAAACTGGGT GCATCTGTTT
CTGGCAAACT CGCCGTTTGG GGTCTTCTCG TGGGGGCGGA TCCACGAGCA ACTCGTCGAA
GAAAGTTGGA AAGAGAAAAT TGCGGTATCA TTGACCACGC TGGCGGAGCG ACTCAACCCG
TTTGCTCCGC CCTGGCGCAA CCATTTTCTG TCGTGCTCGC GTCTGGTGAA GGTCGGGAAA
AATTGCTCGA TCGATCCCAC AGCGGTTATT CATGGACCGA CCGTCATCGG CAACAATGTC
TATATCGGCG CTGGCGTCGT GATCACCAAT AGCCTGATTG GCGATAACGT CAATATCATG
CAGGGATCGC AGGTGATGCT GAGTGTGGTG AGCGACCGCT GCTACCTGCC GTTCAACGCT
GGCTTGTTTA TGACCACACT GATGGAAAAC TCGATGGTCG CCCAACTCAG TTGTTTGCAG
TTGTGTGTGG TAGGGCGAAA TACCTTCATC GGCGCGGGCA ACATTTTTAC CGATTTCCAC
CTGCTGAATC GGCCCATCCG CACCTTTCAC CGCTGGAAAG GCGCTGAAAA ACCGGAACTG
GCTGAGGTCG GATTGCCGGT CCTGGGATCT GCGGTCGGTC ACAATGTCAA GATCGGAAGT
GGTTTTGTCG TGTACCCGGC ACGTATGATA GAATCGAATA CTGTGCTGCT CTACTCGGCG
CCGGACACCG CCATCGGGCA CAATGTCGTG CATCTGGCAG GTGATGATGA GGACGATGCC
GACGCAAGCG GCGAGCCGCG CCGTACCGTG TATCGTTGGC CCCATCGGTA TGATCCCGCC
GTTCACGATG GCGACGAAGA GCCGCACGGT CCGCTGCTGA TATCGATGTC GCTTGCGCAA
TCCGTTTCTG CGCGACGCTA G
 
Protein sequence
MKRIIIRERT LIPPFGEPAR DLRVLNKPLW LLQRDLLAPY CRGALEVDSL EEAPPVNEEL 
LVHRDNLFFN EYLIDAFIRE ARKTGRACRI AFARNDAAIV THALHLQEGI RLDERHGVYV
ADLFYYPRGP DADPEPLVID TLPREMGYYH IPSYMARHGD LVFQVPMRAF LSIENWVHLF
LANSPFGVFS WGRIHEQLVE ESWKEKIAVS LTTLAERLNP FAPPWRNHFL SCSRLVKVGK
NCSIDPTAVI HGPTVIGNNV YIGAGVVITN SLIGDNVNIM QGSQVMLSVV SDRCYLPFNA
GLFMTTLMEN SMVAQLSCLQ LCVVGRNTFI GAGNIFTDFH LLNRPIRTFH RWKGAEKPEL
AEVGLPVLGS AVGHNVKIGS GFVVYPARMI ESNTVLLYSA PDTAIGHNVV HLAGDDEDDA
DASGEPRRTV YRWPHRYDPA VHDGDEEPHG PLLISMSLAQ SVSARR