Gene Rcas_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4271 
Symbol 
ID5541782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5513061 
End bp5514263 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID640896378 
Productglycosyl transferase group 1 
Protein accessionYP_001434316 
Protein GI156744187 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.614702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTCG GGATCGATTT TACGGCTGGC GCCTGGCAGG GCGCCGGCAT TGGGCGTTAC 
ACGCGCGAAC TGATCGGCGC CATTCTTGCT CAAAGCCCCG ATCTTCGTTT CACTCTGTTC
TACGCTGCTG GCTTTCCGGG CGCCGCTCCT CCGCCCTATC TGCCTGAGGT GCGCCGTCTC
TGCGCATCGC ATCCGCGTAC CCGCGCTGTG CCCATTCCGC TGCCGCCCCG CCGCCTGACG
CAACTCTGGC ATCGGTTGCG CGTCCCGCTG CCGATCGAAT GGCTTACCGG TCCACTCGAT
ATTCTGCACG CGCCCGATTT CGTGGCGCCG CCGACGCGCG CCCGCACTCT TGTGACCATC
CACGATCTTT CGTATATGGT GCATCCCGAG TGCGCCGTTC CCGGCGTTGC CGCCTTTCTG
CGCGACGCCG TGCCGCGTAC ACTGCGACGC GCTGATGTCA TCGTCGCCGA TTCGGAGTCG
ACCCGGCGCG ATCTCCAGCG CCTGTTGCAC ATCGCTCCTG AGCGTGTGTC GGTGGTCTAT
CCTGCGGTAG ACGAACGGTT TTGTCCCTTG CCGCCGGAGA TGTGCGAGCC GGTGCGCCGG
CGGTTGCGCC TGCCATCGCG CTTCATCCTG TTCGTCGGAA CAATCGAGCC ACGCAAGAAT
CTGGCGCGTC TGCTGGAAGC CTTTGCGCGG ATCGATCCGG CAACACGGCG ACAAGGCAAC
GGTGATCTCT TCCTGGTCAT CGCCGGTCGG CGGGGATGGA TGTACCAACC GGTGTTCGAG
ACTCTTGATC GCCTGGGACT GCGAGATCGG GTGCAGATAC TCGATTTTGT GGCGGATTCT
GACCTGCCAG TGGTGTATAA TCTTGCACAG GCATTCGTGT ATCCTTCGAT CTACGAAGGA
TTCGGCTTGC CGCCGCTGGA AGCGCTGGCA TGCGGAACAC CGGTAGTTAC ATCAGACAAT
TCGAGCCTTC CAGAGGTAGT GGGCAGCGCC GCCCTTCTTG TGCCTGCCGA CGATGTGGCG
GCGCTCACAC AGGGGATGAG CCGCCTGTTG AACGATGACG CCCTGCGTGC TCAACTGCGT
CAGGCGGGTC TGGAGCAGGC GCGACGATTT CGCTGGGAAG CGTCTGCCCG GCAGATGATC
GAACACTATC ACTCGTTGTC AACGGGAGCA TCGCATGAGG CCACAACCAG AGCTCTCCGG
TGA
 
Protein sequence
MHVGIDFTAG AWQGAGIGRY TRELIGAILA QSPDLRFTLF YAAGFPGAAP PPYLPEVRRL 
CASHPRTRAV PIPLPPRRLT QLWHRLRVPL PIEWLTGPLD ILHAPDFVAP PTRARTLVTI
HDLSYMVHPE CAVPGVAAFL RDAVPRTLRR ADVIVADSES TRRDLQRLLH IAPERVSVVY
PAVDERFCPL PPEMCEPVRR RLRLPSRFIL FVGTIEPRKN LARLLEAFAR IDPATRRQGN
GDLFLVIAGR RGWMYQPVFE TLDRLGLRDR VQILDFVADS DLPVVYNLAQ AFVYPSIYEG
FGLPPLEALA CGTPVVTSDN SSLPEVVGSA ALLVPADDVA ALTQGMSRLL NDDALRAQLR
QAGLEQARRF RWEASARQMI EHYHSLSTGA SHEATTRALR