Gene Rcas_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1489 
Symbol 
ID5538964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1904023 
End bp1905210 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content61% 
IMG OID640893627 
Productglycosyl transferase group 1 
Protein accessionYP_001431601 
Protein GI156741472 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.618426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATTC TCTACATCGC AAGCGGCATT CCTGTTCCCG GCACGCTTGG CGGCTCAATC 
CATACGTTTG AAGTCGCGCG CGGGCTTGCG CGGCGCGGGC ACGTGGTCGA TATCGTCGCT
GCATCTCAAC CCGGCGTGTT CGAGATTGCA TCGCTGGCGC GCCCGGTGTC GTCTCGTCTC
GATTATGGGC GTCTGCACCA CGTCGATATT CCCAAGACTC TGAGCCTTCT CGCCGCTCCG
GTGATCATGC GATTAGCGCG ATCCCTGAAA CCTGATGTCA TTATGGAGCG GTACTATAAT
TTCGCCGGAG CAGGCATCCT TGCCGCTCGA CGCCTGGGCA TCCCTTCGAT CCTCGAAGTC
AACGCGCTGA TCGTCGATCC GCCGACAGTG CTGAAACGAC GCCTCGATGA TATGCTCGGC
GGTCCCATGC GCCGCTGGGC AGTGGCGCAG TGCCGCTTCG CAGATCGGAT TGTGACGCCA
TTGCACACAA CCGTTCCGCC CGACATTCCG CGCACCAGGA TTGTTGAGTT GCCATGGGGC
GCCAATGTGG AACATTTCTC TCTTGACCGA CGAGCCGCCA GTACTGCGCC GATGCAGCCA
ACCGTCGTGT TTCTCGGCTC ATTCCGCGCC TGGCACGGCG TGCTCGATGC GGTGCGCGCC
GGCGGACTGC TCGTGGAGCA AGGGCGCGTA TGTCGTTTCC TGTTTATTGG CGATGGTCCG
CAACGCGCTA TCGCAGAACG ATTGTCGGCG CACTGGGGCG AACAGTTCAC TTTTACCGGC
GCTGTACCAT ACGATGACGT CCCTTCGCTG CTGGCGCAGG CGTCGGTTGC GGTTGCGCCA
TTCAACACGG CAGCCCATCC TGCGCTCCGC GCTGCTGGAT TCTTCTGGTC GCCGTTGAAA
GTGTTCGAGT ATATGGCTGC GGCGCTCCCG GTTGTGACGA TCAACCTTCC ACCGCTCAAC
CAGATCGTGC GCCACGGCAT CGAGGGGTTG CTCTATCCCG AAGGTGATAC CGGTGCACTG
GCGGCAGCAA TTGCCTACCT GCTCGACCAC CCCGACGAAG CGCGCGCAAT GGGCGCCCGC
GGGCGCGAGC GCGTCATGAT GCACTTCTCG TGGGCGCGGC ACTGCGAGCA ACTGGAACGT
GTTATGAGAG AGATTGTGGA AGAGCGTTGC CCGTCGCACA TCGTATAA
 
Protein sequence
MKILYIASGI PVPGTLGGSI HTFEVARGLA RRGHVVDIVA ASQPGVFEIA SLARPVSSRL 
DYGRLHHVDI PKTLSLLAAP VIMRLARSLK PDVIMERYYN FAGAGILAAR RLGIPSILEV
NALIVDPPTV LKRRLDDMLG GPMRRWAVAQ CRFADRIVTP LHTTVPPDIP RTRIVELPWG
ANVEHFSLDR RAASTAPMQP TVVFLGSFRA WHGVLDAVRA GGLLVEQGRV CRFLFIGDGP
QRAIAERLSA HWGEQFTFTG AVPYDDVPSL LAQASVAVAP FNTAAHPALR AAGFFWSPLK
VFEYMAAALP VVTINLPPLN QIVRHGIEGL LYPEGDTGAL AAAIAYLLDH PDEARAMGAR
GRERVMMHFS WARHCEQLER VMREIVEERC PSHIV