Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1047 |
Symbol | |
ID | 5538513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1362723 |
End bp | 1364984 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640893184 |
Product | glycosyl transferase family protein |
Protein accession | YP_001431167 |
Protein GI | 156741038 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.172621 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGGCA TGGCGAATGT TCCGGTCGAA CCGCGTGCTG CCGCGCAAGC GCAGCGAATG AGGACACACT GGCTACGGAT CGCGCCAGGG GCGCTGGCGC TGGCGTTGTT TCTGGCGCTG GCGCTGGGGC GCGGCGTGCT CGGCGCCGTT GCGTCGCCGT TGCAGGGCGC GCTGGCGTTG GCGTTGCTGG CATCGATCAT CGTGCAGCCG AGTGCGCTGG CGCGTATGCG TGTCACGCAA ACCGCCGCGG CGTTGGCGGG CATCCTGACG GTCGCCTTGC TGCTGCGGGT TTGGGGCTTG CGCTTTGGGT TGCCTTACTT CGAGCACCCC GATGAGTGGG CGGTTGCTGA GGAGGCGCTG CGCATGCTGC GCACCGGTGA CTATACCCCC TTTTCGTATA CATACCCGAC GCTCTATACC TATATGCAGG TCGGCGTGGC TGCCGCGCAC TTCATGTGGG GCGCAGGCGC CGGTCTCTAC CGCTCCCCTG CCGACATCGA CCCGGTACCG TTCTATGTCT GGGCGCGCGC GCTGACGGCG ATGCTCGGCG CGGGCGCTGT GGCGCTGACG TTCCTGGCGG GGCGGATGCT CTACGGCAAC GCCGCCGGTC TGCTGGCAAC CGCGTTGATC GCCGTAATGC CCGCCGTCAC CGGCGATTCG CACTATGTGA CAACCGATAC GCCGGCGATG TTTTTTACGC TGCTGGCATT TCTTGCGATT GCCCGCCTTA CCGGTGTTGC AGCAGAGCGC ACTCCCTCAG ATGATGCGCC TCCCGCTACT GCTCCCCCGA TCCTTACGCA GGCTTTCCTG GCAGGCGTCG GCGTTGGTCT GGCGACGGCA ACCAAATGGA ATGCCGGCTC GCTCGTGATC GCGCTATTGG TCGCTATCGT CTTTGCGGCG CGACGTTCAA CGCGCAATGG CCATGCTGAT GCTCTCGACG GCGACGTTGC ACTGCGACGT CGCAACCTTC AACCCTTAAC CGTCAACCTT CTCACCGTTG CTCTTTCCGG TATTTTGTTG GGTTTCACCC TCGGCGTGCC GTTCTGGTTG CGCGATCTGC CGCGCATTCT GACTGACCTT GCCGGAATTA TCGCGCATTA TCGGTTTGAA GGGCATCCCG GCGCAGAATC GGATCAACCG GCGCTGTTCT ACTGGTGGGC ATTGACCCGT GAAGGGACGC TGCTGGCATG GGTGTGCCTG GGTGGTGTGG CGCTGGCGTT CCTGCGGCGT AAACCTGCCG ATGTGCTCGT GCTCGCCGTC GTGGTTCCTG CGGTGCTGCA ACTGACTGGC GTCAAGGTGG TGTTCTTCCG CAATGCCATG CCGCTGCTGC CGTTTCTCTG CATCCTGGCG GCGGCGCTGG TTGTCGTTGC CGTCGAATGG GTGACTGAGC GACCTGGAGA AACAGGGGAT CGACTGACAG GCGGGCGGGC GTCTGTTGCG TTCATGCGAC GGCTGGCGGG CAACCGCACG GTGCTGCTGC TGGCGGCAAC CGTCTTGCTG GCGGCAGAAC CGCTGGCGCA GGCGATCCAC GATGAGGCGC TGCGCGCCCG CCCGACGACG CGCATACTGG CGGGGGAATG GCTGGAAACG CGCGCGCAGG ATGGTGAGCG CATCTGGCTG GAGGATAATA CGCTCATCCT TGTACCGCGT TTGCGCGCTG TTGGCGGCGA ACCGGCGGTA CATCACGACC TGGCCTGGTA CCGCGAGCAG GGCATTCGTT TTGTAGTGGT ACACCTCGAC CGCGAGACAG GCGCAGCGGC GCTGGCAGCC TTCGGTGAAC CGGCGGCGCG GTTTCTGCGC GCTGGCGAGC GTCATGGTCC CGAACTGGCG ATTTTCGACA CTGGTGCACC GGATGCTGCT GCTGAACCGC GCACGCCGTC GGGAGCGACG CTCGGCGCAG GCGCGATTGT GCTGGAGGGA TACCGCCATC CGGGGCGCGC GCAGGCAGGC GGGGTGTTGT CGCTGGCGCT CTTCTGGCGC GCGATGCGCC CGCCGCCGCT GGATTATACC GTGTATGTGC ACCTGGTTGA TGAAGCGGGT GCGAAAGTCG CGCAGCGCGA CGTGCCGCCG CTCGAAGGAC GCCGCCCGAC CAGTCGCTGG ATGCCTGGCG ATCGTGTGCG CGATGATCAG GACCTCTTTA TTCCTGAAAC CGTTCCGCCC GGAACATACC GCCTGTTGAC AGGCATGTAT GACGCTGCAA CCATGACGCC GATCAACGAT GCGGGACCGA TCGATCTGGG GGTGGTTGTG GTTGAACGTT GA
|
Protein sequence | MPGMANVPVE PRAAAQAQRM RTHWLRIAPG ALALALFLAL ALGRGVLGAV ASPLQGALAL ALLASIIVQP SALARMRVTQ TAAALAGILT VALLLRVWGL RFGLPYFEHP DEWAVAEEAL RMLRTGDYTP FSYTYPTLYT YMQVGVAAAH FMWGAGAGLY RSPADIDPVP FYVWARALTA MLGAGAVALT FLAGRMLYGN AAGLLATALI AVMPAVTGDS HYVTTDTPAM FFTLLAFLAI ARLTGVAAER TPSDDAPPAT APPILTQAFL AGVGVGLATA TKWNAGSLVI ALLVAIVFAA RRSTRNGHAD ALDGDVALRR RNLQPLTVNL LTVALSGILL GFTLGVPFWL RDLPRILTDL AGIIAHYRFE GHPGAESDQP ALFYWWALTR EGTLLAWVCL GGVALAFLRR KPADVLVLAV VVPAVLQLTG VKVVFFRNAM PLLPFLCILA AALVVVAVEW VTERPGETGD RLTGGRASVA FMRRLAGNRT VLLLAATVLL AAEPLAQAIH DEALRARPTT RILAGEWLET RAQDGERIWL EDNTLILVPR LRAVGGEPAV HHDLAWYREQ GIRFVVVHLD RETGAAALAA FGEPAARFLR AGERHGPELA IFDTGAPDAA AEPRTPSGAT LGAGAIVLEG YRHPGRAQAG GVLSLALFWR AMRPPPLDYT VYVHLVDEAG AKVAQRDVPP LEGRRPTSRW MPGDRVRDDQ DLFIPETVPP GTYRLLTGMY DAATMTPIND AGPIDLGVVV VER
|
| |