Gene Rcas_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1047 
Symbol 
ID5538513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1362723 
End bp1364984 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content65% 
IMG OID640893184 
Productglycosyl transferase family protein 
Protein accessionYP_001431167 
Protein GI156741038 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.172621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCA TGGCGAATGT TCCGGTCGAA CCGCGTGCTG CCGCGCAAGC GCAGCGAATG 
AGGACACACT GGCTACGGAT CGCGCCAGGG GCGCTGGCGC TGGCGTTGTT TCTGGCGCTG
GCGCTGGGGC GCGGCGTGCT CGGCGCCGTT GCGTCGCCGT TGCAGGGCGC GCTGGCGTTG
GCGTTGCTGG CATCGATCAT CGTGCAGCCG AGTGCGCTGG CGCGTATGCG TGTCACGCAA
ACCGCCGCGG CGTTGGCGGG CATCCTGACG GTCGCCTTGC TGCTGCGGGT TTGGGGCTTG
CGCTTTGGGT TGCCTTACTT CGAGCACCCC GATGAGTGGG CGGTTGCTGA GGAGGCGCTG
CGCATGCTGC GCACCGGTGA CTATACCCCC TTTTCGTATA CATACCCGAC GCTCTATACC
TATATGCAGG TCGGCGTGGC TGCCGCGCAC TTCATGTGGG GCGCAGGCGC CGGTCTCTAC
CGCTCCCCTG CCGACATCGA CCCGGTACCG TTCTATGTCT GGGCGCGCGC GCTGACGGCG
ATGCTCGGCG CGGGCGCTGT GGCGCTGACG TTCCTGGCGG GGCGGATGCT CTACGGCAAC
GCCGCCGGTC TGCTGGCAAC CGCGTTGATC GCCGTAATGC CCGCCGTCAC CGGCGATTCG
CACTATGTGA CAACCGATAC GCCGGCGATG TTTTTTACGC TGCTGGCATT TCTTGCGATT
GCCCGCCTTA CCGGTGTTGC AGCAGAGCGC ACTCCCTCAG ATGATGCGCC TCCCGCTACT
GCTCCCCCGA TCCTTACGCA GGCTTTCCTG GCAGGCGTCG GCGTTGGTCT GGCGACGGCA
ACCAAATGGA ATGCCGGCTC GCTCGTGATC GCGCTATTGG TCGCTATCGT CTTTGCGGCG
CGACGTTCAA CGCGCAATGG CCATGCTGAT GCTCTCGACG GCGACGTTGC ACTGCGACGT
CGCAACCTTC AACCCTTAAC CGTCAACCTT CTCACCGTTG CTCTTTCCGG TATTTTGTTG
GGTTTCACCC TCGGCGTGCC GTTCTGGTTG CGCGATCTGC CGCGCATTCT GACTGACCTT
GCCGGAATTA TCGCGCATTA TCGGTTTGAA GGGCATCCCG GCGCAGAATC GGATCAACCG
GCGCTGTTCT ACTGGTGGGC ATTGACCCGT GAAGGGACGC TGCTGGCATG GGTGTGCCTG
GGTGGTGTGG CGCTGGCGTT CCTGCGGCGT AAACCTGCCG ATGTGCTCGT GCTCGCCGTC
GTGGTTCCTG CGGTGCTGCA ACTGACTGGC GTCAAGGTGG TGTTCTTCCG CAATGCCATG
CCGCTGCTGC CGTTTCTCTG CATCCTGGCG GCGGCGCTGG TTGTCGTTGC CGTCGAATGG
GTGACTGAGC GACCTGGAGA AACAGGGGAT CGACTGACAG GCGGGCGGGC GTCTGTTGCG
TTCATGCGAC GGCTGGCGGG CAACCGCACG GTGCTGCTGC TGGCGGCAAC CGTCTTGCTG
GCGGCAGAAC CGCTGGCGCA GGCGATCCAC GATGAGGCGC TGCGCGCCCG CCCGACGACG
CGCATACTGG CGGGGGAATG GCTGGAAACG CGCGCGCAGG ATGGTGAGCG CATCTGGCTG
GAGGATAATA CGCTCATCCT TGTACCGCGT TTGCGCGCTG TTGGCGGCGA ACCGGCGGTA
CATCACGACC TGGCCTGGTA CCGCGAGCAG GGCATTCGTT TTGTAGTGGT ACACCTCGAC
CGCGAGACAG GCGCAGCGGC GCTGGCAGCC TTCGGTGAAC CGGCGGCGCG GTTTCTGCGC
GCTGGCGAGC GTCATGGTCC CGAACTGGCG ATTTTCGACA CTGGTGCACC GGATGCTGCT
GCTGAACCGC GCACGCCGTC GGGAGCGACG CTCGGCGCAG GCGCGATTGT GCTGGAGGGA
TACCGCCATC CGGGGCGCGC GCAGGCAGGC GGGGTGTTGT CGCTGGCGCT CTTCTGGCGC
GCGATGCGCC CGCCGCCGCT GGATTATACC GTGTATGTGC ACCTGGTTGA TGAAGCGGGT
GCGAAAGTCG CGCAGCGCGA CGTGCCGCCG CTCGAAGGAC GCCGCCCGAC CAGTCGCTGG
ATGCCTGGCG ATCGTGTGCG CGATGATCAG GACCTCTTTA TTCCTGAAAC CGTTCCGCCC
GGAACATACC GCCTGTTGAC AGGCATGTAT GACGCTGCAA CCATGACGCC GATCAACGAT
GCGGGACCGA TCGATCTGGG GGTGGTTGTG GTTGAACGTT GA
 
Protein sequence
MPGMANVPVE PRAAAQAQRM RTHWLRIAPG ALALALFLAL ALGRGVLGAV ASPLQGALAL 
ALLASIIVQP SALARMRVTQ TAAALAGILT VALLLRVWGL RFGLPYFEHP DEWAVAEEAL
RMLRTGDYTP FSYTYPTLYT YMQVGVAAAH FMWGAGAGLY RSPADIDPVP FYVWARALTA
MLGAGAVALT FLAGRMLYGN AAGLLATALI AVMPAVTGDS HYVTTDTPAM FFTLLAFLAI
ARLTGVAAER TPSDDAPPAT APPILTQAFL AGVGVGLATA TKWNAGSLVI ALLVAIVFAA
RRSTRNGHAD ALDGDVALRR RNLQPLTVNL LTVALSGILL GFTLGVPFWL RDLPRILTDL
AGIIAHYRFE GHPGAESDQP ALFYWWALTR EGTLLAWVCL GGVALAFLRR KPADVLVLAV
VVPAVLQLTG VKVVFFRNAM PLLPFLCILA AALVVVAVEW VTERPGETGD RLTGGRASVA
FMRRLAGNRT VLLLAATVLL AAEPLAQAIH DEALRARPTT RILAGEWLET RAQDGERIWL
EDNTLILVPR LRAVGGEPAV HHDLAWYREQ GIRFVVVHLD RETGAAALAA FGEPAARFLR
AGERHGPELA IFDTGAPDAA AEPRTPSGAT LGAGAIVLEG YRHPGRAQAG GVLSLALFWR
AMRPPPLDYT VYVHLVDEAG AKVAQRDVPP LEGRRPTSRW MPGDRVRDDQ DLFIPETVPP
GTYRLLTGMY DAATMTPIND AGPIDLGVVV VER