Gene Rcas_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0468 
Symbol 
ID5537931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp600017 
End bp601567 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content60% 
IMG OID640892631 
Producthypothetical protein 
Protein accessionYP_001430617 
Protein GI156740488 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.633748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCG TCTCTCGCTT CCGGGTTGAA ATCCTGCTCT TCCTCCTTCT GCTGGCGTGC 
TACGCTTACT TTCCGCCGCG CTGGGCGGAC TGGAATCAGA ACTCGCGGCT GAACCTGACA
CTGGCGATTG TTGATGATGG TTCCTTCCAG ATCGACCGGT TCGTCGCCAA TACCGGCGAT
TATGCGAAGT ACAACGGGCA CTACTACAGC GACAAAGCGC CGGGTACGTC ATTCCTCGCT
GTTCCGGTGT ACGCCGCCGT TCGCCCGTTG CTCCAGACGG CGCCGGTACA GCGGATGATC
GAGCGCGTTG GTTCGTCGGC GGCGTTTGGC GAGACGCTGC GACCCGATGG AAGCGGCCTG
GCGATGGAGA AGGTTTACTT CGCGCTGGTG TTAATGATCG TGTCGTTCGT GACGGTAGCC
GTTCCATCGG CGCTGCTTGG CGTTCTGTTG TATCGTTTTC TCGAACTGTT CGACCTGGGG
GCGAGTTGGC GCGTTGCGCT CGCGCTGATC TATGGGCTGG CGACGCCTGC CTTTCCCTAC
TCAAATGCGT TCGTAGGGCA TCAGCAGGTG GCGGCGATGC TTTTCGTCTC TTTCTGGATG
GCGTTCCTCA TCGGGCAACG ACGCCTGGCG CCGCACTGGT CGCTGCTCAT CGGTGTGTTG
CTGGGATGGA CACTGATCAC CGAGTATCCG GCGGCGCTGA TTGTGGCGGG TGTCGGATTG
TATCTGCTCG TCGTTCTGCC CGACCGTCGC TGGATTGTGG GTGCGGCGCT GGCGGGTGTG
CCGCCGCTAG CGTTGATGAT GGCGTACAAC GACGCAATCT TTGGCACAGT GATGCCGGTC
GGTTACAAGT ACTCGGAGTT GTGGCAGGCT GAGCATCAAT CCGGTTTTAT GAGCCTTGCA
GGACCGAACC GCGAGGCGCT GTGGGGCATC ACCTTTGGCG TACACCGCGG TCTCTTCCTC
CTGGCGCCGG TGTTACTGAT CGGTCTGGTG GGGTTCGTCG CCTGGTGGCG CAGTGGAACG
CACCGCCGCG AACTGGCGGT TTGCGTCTGG GCAGTCGTCA GTTTCCTGCT GTTCAACGGG
TCGTCCGTCA TGTGGAGCGG TGGTTTTGGC GTCGGACCAC GCTACCTTGT GCCGATGTTG
CCGTTTCTGG CATTCGGCAT CGGCGCCTTC GTTGCGACGT GGGGTGCGCA ATGGCGGGTG
CGCGCAGTGT TGGGCGTCAC AGGCGTCTGG TCGTTCCTGA ATGTGTGGGC GCAGACGATT
GGCGGGCAGA GTTTTCCGCA GTATCAGCCT AACCCATTAC TGGACTATTC GCTGCCGGAA
CTGGTTGCGG GCAATGTGGC GCGCAACCTG GGCATGGCGC TGGACCTCGG CGGTTGGGCG
AGTCTGCTGC CGCTGGCGCT GGTGGTGCTG CCGGGGTTGG TCATGCTCTT CCGGTCGGTA
GAAGAAAAAC GATCATTGCA GGTAGAAGGG CGTTGGATTG AGAACCGAGA ACTGAGAACT
GAGAACCGAG AACTGAGAAC TGAGAACCGA GAACTGAGAA CCGAGAATTG A
 
Protein sequence
MSLVSRFRVE ILLFLLLLAC YAYFPPRWAD WNQNSRLNLT LAIVDDGSFQ IDRFVANTGD 
YAKYNGHYYS DKAPGTSFLA VPVYAAVRPL LQTAPVQRMI ERVGSSAAFG ETLRPDGSGL
AMEKVYFALV LMIVSFVTVA VPSALLGVLL YRFLELFDLG ASWRVALALI YGLATPAFPY
SNAFVGHQQV AAMLFVSFWM AFLIGQRRLA PHWSLLIGVL LGWTLITEYP AALIVAGVGL
YLLVVLPDRR WIVGAALAGV PPLALMMAYN DAIFGTVMPV GYKYSELWQA EHQSGFMSLA
GPNREALWGI TFGVHRGLFL LAPVLLIGLV GFVAWWRSGT HRRELAVCVW AVVSFLLFNG
SSVMWSGGFG VGPRYLVPML PFLAFGIGAF VATWGAQWRV RAVLGVTGVW SFLNVWAQTI
GGQSFPQYQP NPLLDYSLPE LVAGNVARNL GMALDLGGWA SLLPLALVVL PGLVMLFRSV
EEKRSLQVEG RWIENRELRT ENRELRTENR ELRTEN