Gene Rcas_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1068 
Symbol 
ID5538534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1386147 
End bp1387262 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID640893204 
Productglycosyl transferase group 1 
Protein accessionYP_001431187 
Protein GI156741058 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.378739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTG CGCTCGATCT GCGCTTCGTC AATGATCATT TTCCAGGCAT TGGGCGCTAT 
GCCTTCAGTC TGGCATCGGC ATTCGCCATG CTCGACGGGC CGCACCGGTT CATTTTCATT
GTCCTCCGGG ATGCTCAGGT CCGCCGCTCT GTGCCGAATA CGCGCTACAA TCTGACGAGT
CTGCTTCGTT CACCGAAGGT GCGGGCGCTC GCCGCGCCTT CACCGTTCAG CGTCGCCGGG
CAGATTGCCC TACCCGCCCT GCTGCGCCTG GCGCGCGCCG ATGTGTATCA TACACCCTAC
CATGCGTTTC CCTACACCGG ACTTCCCTGC CCCGTCGTTG TGACGCTGTA CGACATTATC
CCACGCCTCT TCCCGCGTGA GTCGTCGCTG CGCGCGCGCC TGTTCTTTGA TCTGTCGGTC
AGAATGGCGC TACACGCGGC GCAGCGGATC GTGACATTGT CACAACATTC ACGTGCTGAT
CTTGCGCGGG TCTACCATGT GCCCGCTGCG CGGATCGATG TGGTGTACGG CGCTGCTGAC
GAACGCTTCC GTCCTACAGA ATCGAAGCAT GCGGCGGCAA TCCGGGCGCG GCATCAACTG
CCAGAGACGT TTGCGCTCTG TGTGACATCC GACAAACCGC ACAAGAACGT TGATACTCTG
GTGAAGGCGT GGCGACTGTC GGCGACCGCG CAACAGAGTG ATGCGCCCTG GCTGATCCTG
GCGGGCCACC GTTATCGTGG GGCGCTGGCG TTCGATCACC CACGAATCCG GGACCTTGGA
CCGGTTGCTG AAGCCGATCT GCCCGCGCTC TACGGCAGCG CCACTCTTTT TGTCTATCCG
TCGCGCTACG AAGGGTTTGG CTTGACGCCG CTGGAAGCGA TGGCATGCGG CACACCGGTG
ATCTGTAGCC ACGCCGGAAG TCTGACCGAG GTCGTTGGCG ACGCGGCGCT GCTCGTGGAT
CCCGACGATC CGCAGGCGCT GGCGGAAGCC ATCGATCGTG CGTTTGCCGA TCCGACACTG
CGGGCTTCCC TGCGCGCTGC CGGTCTCGCA CGCGCTGCAT CATTCTCCTG GCATCGCGCA
GCACAGGAAA TACTGGCGGT CTATGAGCGT GTGTAA
 
Protein sequence
MNIALDLRFV NDHFPGIGRY AFSLASAFAM LDGPHRFIFI VLRDAQVRRS VPNTRYNLTS 
LLRSPKVRAL AAPSPFSVAG QIALPALLRL ARADVYHTPY HAFPYTGLPC PVVVTLYDII
PRLFPRESSL RARLFFDLSV RMALHAAQRI VTLSQHSRAD LARVYHVPAA RIDVVYGAAD
ERFRPTESKH AAAIRARHQL PETFALCVTS DKPHKNVDTL VKAWRLSATA QQSDAPWLIL
AGHRYRGALA FDHPRIRDLG PVAEADLPAL YGSATLFVYP SRYEGFGLTP LEAMACGTPV
ICSHAGSLTE VVGDAALLVD PDDPQALAEA IDRAFADPTL RASLRAAGLA RAASFSWHRA
AQEILAVYER V