Gene Rcas_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2472 
Symbol 
ID5539953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3178177 
End bp3179436 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID640894602 
Productglycosyl transferase group 1 
Protein accessionYP_001432570 
Protein GI156742441 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTC TCTACTTCAT ACCACGTTAC GATCCGGCAC TGATGGGCAA TCGCATTCAC 
GCCGAGGTAA TCGATGCCTG GCGTGACCAT GGGATCGATG CCGAGGTGAT CACGCTCGCC
GCAGGCATTG CTCGCCTGAG CAGCGAGGTC CAGGAAGGCA TCGTCGTTCA TCGGCTGCCG
GTCAGTTCCG CTATGGCGCT GAAGGGTCTC AACCGCGCGC TGGCGTTTCC AACCGGATAC
CCCTATCTGG CGGGCGCACT TGTCCACTAC CGGCGCTTCA TTGCAACGCG CCGCTACGAT
CTCGTGCATG TCGAGACGGC ATTTCCGCTT GGTCTGGTCG CTGCACTGAC ACTGCGCTCG
ATCCATCCGC CGCTCGCCGT GACGCTGCCC GGCGCCGACA TTATGAGCGT GCCGGAGTTC
GATTATGGGT ACGCCCGATT TCTTGCGGTG CGTGTGCTGC TTCCTTTTGT GTTTCGACGC
AGCGCTACGC TGCGCGCCGA TTCGCCGCAG ATCCGCTCGC TGGCAGTGCG ATTGGGCGCG
CCACCCGCGA AAGTGGTCGC TATTCCCTAC AATATTACTG CTGACAGTTA TCCCCCTGCT
GGCGTCGAGA TCGAAACACT TCGCCGGCAG AGCAGAAACG ACATCTGCGC ACGCTACAAC
CTCGATCCCT CCCGTCCAAT TATTGTCAGC CTGAACCGCC TGCATCCGTT CAAGGGGATT
GAATATCTGG TGGAGTCCGT TCCTCATATC CGCGCTGGCG GCATCGATCC ACAGGTGCTG
ATTGTGGGTC CGAATCGCAG CACGCAACGG TTTGGCGACT ATGGCGCATA CCTGCGTCGT
CGTGCAGAGG ATCTGGGAGT CGCATCGGCG GTGATCTTCA CCGGCGGTAT TCCGCACGAT
CAGGCGATGG CGCATCTGGC TGCCGCCGAT GTTGTTGTTG TCCCGTCGGT CTCCGAGTCG
TTCAGCCGGG TAGTGGTCGA GGCGGCGGCG GTCGGCACGC CGCCGATTGT CACATCGACA
ACCGGCGTCA GTGAGTACGT CGCTGCGTCC GAATGCGGGA TCGTTGTGCC GCCACGCAGC
GGGGAAGCGA TCGGCGCGGC ATTGGTGCGC CTGCTGCGCG ATCGATCGCT GTGGGAAACG
TATGCGCGTC GGGGACCCAC AATGGCGGCG GCGTTCAATT CGCGCACGAT CGCCGAACAA
CTGTTGTGCC TCTATGCGCC ATTCCTGTCG GGCAAAGGAG AGACGGCGCC TGCCGGCTAA
 
Protein sequence
MTILYFIPRY DPALMGNRIH AEVIDAWRDH GIDAEVITLA AGIARLSSEV QEGIVVHRLP 
VSSAMALKGL NRALAFPTGY PYLAGALVHY RRFIATRRYD LVHVETAFPL GLVAALTLRS
IHPPLAVTLP GADIMSVPEF DYGYARFLAV RVLLPFVFRR SATLRADSPQ IRSLAVRLGA
PPAKVVAIPY NITADSYPPA GVEIETLRRQ SRNDICARYN LDPSRPIIVS LNRLHPFKGI
EYLVESVPHI RAGGIDPQVL IVGPNRSTQR FGDYGAYLRR RAEDLGVASA VIFTGGIPHD
QAMAHLAAAD VVVVPSVSES FSRVVVEAAA VGTPPIVTST TGVSEYVAAS ECGIVVPPRS
GEAIGAALVR LLRDRSLWET YARRGPTMAA AFNSRTIAEQ LLCLYAPFLS GKGETAPAG