Gene Rcas_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3884 
Symbol 
ID5541390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5081313 
End bp5082392 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content64% 
IMG OID640895995 
Productglycosyl transferase group 1 
Protein accessionYP_001433938 
Protein GI156743809 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.242445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG TCTGGCATTC ATCATTCGCA TCACTCACCG GCTACAGCGG CTCATCGCTT 
GCTTTCGTCC TGGGGCTTGA TGCGCGCGGC GTGGCGGTTC GTCCGCTCTA CCTGTACGGC
GCCGACCGCG ATGAGCATGT GATGATGGGG CGCATCCATC CGCGCATTGC CGAGTTACAG
CGCGCTCCGG TGCGTTTTGA TGCGCCGCAG GTCGTGTATG CGCCGGGTGA CCGATTCTCG
AAGAACAGCG GGCGCTACCG CATCGGCTTC ACCATGCTCG AATTTGATCG TCTGCCGCAG
GAATGGGTGC AGCAGGCCAA TCAGATGGAC GAAGTCTGGA CGCCGACTGC CTGGGGCGCC
GACGTGTTTG CGGCAAGCGG CGTCACCCGC CCGATCTTCG TTGTGCCGCT TGGCGTGGAT
TCAGGACGCT TTGAACCGGG AGAACCGCGC GCGCATTTGA CCGACCGCAC GGTATTCCTC
TCGGTCTTTG AGTGGGGACC GCGCAAAGGA TGGGATATTC TGCTGCGCGC CTACCGCGCA
GCCTTTCGTG CCGGTGATCC GGTTGTGCTG GTGTTGAAGA TCGACTGCCG CGCCCCTGGC
GAGAATCCTG TGCGTGAACT GGCGACGCTG TTGCCCATGC CATCACCGCC GGTTGTGCTT
CTCTACAACC GTTCCCTGGA CGCGCAGCGT ATGGCGGAAC TCTACCGCAG CGCCGACTGC
TTCGTGTTGC CGACGCGCGG GGAGGGGTGG GGCATGCCGA TCCTCGAAGC GATGGCGTGC
GGCATCCCTG CAATTGCAAC CGACTGGAGC GGACCGACGG CGTTTCTCAG CCGCGAGAAT
GGCTATCCAC TGCCGATTCG CGGTCTCGTT CCCGCCGATG CTGGCGGCGC CTACGGTATT
GGCGCGCAAT GGGCAGAGCC GGACGCCGAT GCCCTGGTTG ATCTGCTGCG TCAGGCGGTG
CAACACCCCG ATGAGCGCCG CCGCAAAGGG CTGCGCGCTG CCGCCGACGC CAACCGCTGG
ACGTGGGATC GCGCAGTGGA ACGGGTCTGT GCGCGTTTGA AGGAAACCGG AATCTGGTGA
 
Protein sequence
MELVWHSSFA SLTGYSGSSL AFVLGLDARG VAVRPLYLYG ADRDEHVMMG RIHPRIAELQ 
RAPVRFDAPQ VVYAPGDRFS KNSGRYRIGF TMLEFDRLPQ EWVQQANQMD EVWTPTAWGA
DVFAASGVTR PIFVVPLGVD SGRFEPGEPR AHLTDRTVFL SVFEWGPRKG WDILLRAYRA
AFRAGDPVVL VLKIDCRAPG ENPVRELATL LPMPSPPVVL LYNRSLDAQR MAELYRSADC
FVLPTRGEGW GMPILEAMAC GIPAIATDWS GPTAFLSREN GYPLPIRGLV PADAGGAYGI
GAQWAEPDAD ALVDLLRQAV QHPDERRRKG LRAAADANRW TWDRAVERVC ARLKETGIW