Gene Rcas_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1210 
Symbol 
ID5538676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1563647 
End bp1564837 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID640893342 
Productmonogalactosyldiacylglycerol synthase 
Protein accessionYP_001431325 
Protein GI156741196 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTG TGCGCATTCT GATACTGACA ACTGATGCAG GGAGCGGTCA TCGCAGCGCT 
GCGCAGGCGG TCGAGGCAGC ATTGTTGCAC GTCTATCGTC ACAACGTGCA GGTCACGATT
GCCAATCCGT TGCACGAACC GTCCAGCCCA TCATTGCTCC GTCACGCAGA AGCATTCTAT
CTTTCAACCA TCCAGCATGC GCCGGAACGC TATGATCGGG CGCACACGCT GACCGATGCA
GCCGCGTATG CTGCATTACT GCGCGGCGCA ATGCGTCTGG CGATCGGCGA TGCCTTGCAC
CGCCTGCTAG TGCGTCACGC GCCGGATGTC GTCATCAGTG TGTATCCACT CTTTACGGCG
CTGGTTGCCG ATGCCTATCG CGGCGCGCGC GGGCGCCCCG GGCTGATGAC GGTTGTCACC
GATCTGGGGC ATGTTCATCA CACCTGGTTT TCACCGGTCG ATGATCTGTG TATCGTTCCA
AACGCACAGG TGCGCACTCG CGCACTGAGT TGCGGGCTGA ACCCGCGGCA GGTGCAGATT
GTCGGCATCC CGGTGCATCC GCGGTTTGCT GCGCAGCGCG CCGATCCCGC CACGGTGCGG
CGTGATCTGG GGTGGCGCAC CGATCTGGTA ACCGTGCTGA TCTCCGGCGG GGGCGCGGGG
GTTGGTCCGC TGGCGGAACT GGCGATCGCC GCCGATGAAG CATGTCAGAA CCTTCAAATC
GCAGTGATCG CCGGTCGCAA TAGCGATCTT GCGGCGCGAC TGCGGGCGCG CGAGTGGAAG
AATCCAGTAC ACATCTACGG TTTTGTCCCG CTGGCGGATA TGATGTATGC GGCCGACATC
ATTGCCACCA AGGCCGGCGG GCTAAGCGTC AGCGAAGCGC TGGCGGTCGG GCGTCCGCTC
CTGATCTATG GCTCAGCGCC AGGGCAGGAG GCGGGCAACC TGGAGTATGT GATGCGGCGC
GGCGCGGCGC AGTACACACC GGATGCCGCG CAATTCGTCG CTGCGTTGCA GCGCTGGATT
GCCTGGCCCG CAGCGCGTCA GGCAGCAGCG GACGCTGCTC GATCTGCCGG GCGCCCGCAG
GCGGCATTCG AGATCGCCAG CATGGTGTGG GACCTTGCCA TGTCGCGTGC TGCGGCGCCA
CGTCTGGCGC CGCGCCTGTC GCTGAGCGAA TGGTTGAGTT CGTGGCGGTA G
 
Protein sequence
MRIVRILILT TDAGSGHRSA AQAVEAALLH VYRHNVQVTI ANPLHEPSSP SLLRHAEAFY 
LSTIQHAPER YDRAHTLTDA AAYAALLRGA MRLAIGDALH RLLVRHAPDV VISVYPLFTA
LVADAYRGAR GRPGLMTVVT DLGHVHHTWF SPVDDLCIVP NAQVRTRALS CGLNPRQVQI
VGIPVHPRFA AQRADPATVR RDLGWRTDLV TVLISGGGAG VGPLAELAIA ADEACQNLQI
AVIAGRNSDL AARLRAREWK NPVHIYGFVP LADMMYAADI IATKAGGLSV SEALAVGRPL
LIYGSAPGQE AGNLEYVMRR GAAQYTPDAA QFVAALQRWI AWPAARQAAA DAARSAGRPQ
AAFEIASMVW DLAMSRAAAP RLAPRLSLSE WLSSWR