Gene Rcas_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3098 
Symbol 
ID5540594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4015214 
End bp4016389 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content64% 
IMG OID640895217 
Productglycosyl transferase group 1 
Protein accessionYP_001433170 
Protein GI156743041 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGCT TCAACGGCGT CCGGTGTCTG GAAGGGAAGT CTGTGACCGA CCGTATTCGC 
ATCTTGCTGC TCATCGAAAC ACTCTGGCTC GGCGGGGCGC AGCGCTTGCT TCCCGGATTG
GTGACCGGCC TCGATCCGCT GCGTTTTGAA GGGCATATCG TGGCGTTGCA CGATGGACCG
CTGCGCCAGG AGTTCGAAGC CGCGCGCCTG CCGCTGACGG TGCTGGGTGC GCGTCGTTTC
TATGAGCCGC GCGTCGTGTC TGCTATCGCT CGCCTGGTGC GCGCGCAGCA GATCGATGTG
ATCCATACCC ATCTGACCGG CGCCGATATT GTGGGGCGAA TGGTTGGCGC ACTCACCAGC
GTGCCGGTCG TTTCGACGAT GCACAATGTT CCGCACGATT ACGATCACCA GAAATGGCAC
CGTCGCGCGC TGCAACGCCT GACTGCGCGG ACGCTCGCTG CGCGCCTGGT GATGGTCGCG
CCGGGGATTG GCGTCGAGTA TATGCGCCGG TGGGGTATTC CGGCTTCGCG CGTGGTTACG
ATCAATAATT CTGTTCCTAT GGAACCGTAT CTGGCGATTG CCGAAGGGGT TGCTCCTCAT
GTTCCGCCGA CGGTGACGAC GATTGGGAGG TTGACCGAGC AGAAGGCGTA CCATCTGCTC
CTCGATGCTG CGCGTCTGGT GGTGCGCATA CGCCCCGACA CCCGCTTCCG CATGGTCGGC
GAGGGGCGCC TGGAGGCGGC GCTGCGTCGG CAGGCGCAGG ACCTCGGCAT TGCGCGTGCC
GTATCGTTCG ATGGATTGCG GCACGACATC CCGGACATTC TGGCGGAAAC GCACGTTTTT
GTGCTCTCGT CGCTCTGGGA AGGGTTGCCG GTGACGGCAG TCGAAGCCAT GGCAGCGGCG
CGTCCGGTTG TACTGACCGA TGTTGGCGGC TGTCGCACCC TGGTGACGCC GGGAGTCGAG
GGGTGGCTTG TGCCGCCGGG GAATGTCGAA GCGCTGGCAG CAGCGCTCCT CGATGCGCTC
GACAATCCTG AGCGCCAGCG CCTGTTTGGT CGGCGTGGAC GCGAGAAAGT GCGCCGCGCA
TTCGGATTGG AACAGTATGT GCGTGGGCAC GAACAGTTGT ATGAATCGCT GGCAGTGGTG
CGCGCTCAGG CGCGCGTCGC GCGCCTGCAC CGCTGA
 
Protein sequence
MPGFNGVRCL EGKSVTDRIR ILLLIETLWL GGAQRLLPGL VTGLDPLRFE GHIVALHDGP 
LRQEFEAARL PLTVLGARRF YEPRVVSAIA RLVRAQQIDV IHTHLTGADI VGRMVGALTS
VPVVSTMHNV PHDYDHQKWH RRALQRLTAR TLAARLVMVA PGIGVEYMRR WGIPASRVVT
INNSVPMEPY LAIAEGVAPH VPPTVTTIGR LTEQKAYHLL LDAARLVVRI RPDTRFRMVG
EGRLEAALRR QAQDLGIARA VSFDGLRHDI PDILAETHVF VLSSLWEGLP VTAVEAMAAA
RPVVLTDVGG CRTLVTPGVE GWLVPPGNVE ALAAALLDAL DNPERQRLFG RRGREKVRRA
FGLEQYVRGH EQLYESLAVV RAQARVARLH R