Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2727 |
Symbol | |
ID | 5540213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3525212 |
End bp | 3527401 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640894853 |
Product | glycosyl transferase family protein |
Protein accession | YP_001432816 |
Protein GI | 156742687 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGCG GATCGTGGAT GCGCCATAAA CGCAGGATTT CCTGTTCAAT GAACGCTTCG TCGAAGGAAC GCCTGGTCCT GCTACTCATT CTCGCATATG CGCTGGTTCT GCGCCTCCTG CTCTGGAGCC AACCGCTCCA TGAACCAGCC AATGATGAGG TCGAGTACAT TACCGTAGCG CGCGATCTGC TGGACGGGCG CGGCTGGTCG TTCTACGACC GATACCACTG GCTACGCGCG CCACTCTATC CGCTCTTTCT GGCAGCGTCC TGGGGGCTGG CAGGCGATGA CGGCTGGCCC CGCGCCACGC GCGCCCTGCA TTTGGCGGCG CTGCCCAATA TCCTCTTGAG CGTTCTGAGC GTCTCCCTGG CATATGCGCT GACTGCCCGG TTGGTGAACC GACGGGCGGC GCTCCTGGCG GCGCTGATCA CTGCGACGCT CTGGACACAC GCTACATTTG CCAGCCTGTA CATGTCGGAA ACGCTCTTTA CCGTGCTGTT CCAGATCGGA CTACTCGGTC TGGTTCAGGC TGGTGACCGG CAACCAACGG CGAAACGCTG GTTGTTGATC ATTGCGGCAG GCGTGGCGCT AGGGCTTGCG GCGCTCACCC GATCGCTGGC GCTGCTGTTT CTGCCAATTG CCGCAGGCTG GTTGGCAGTT CACCTGCACC GGCAGCGGTC AGGTCTTCCC CATCGACCGG CGACGCCGTT GCTGGCGGCT GTCGTCCTTC TGGCGAGCGC AGGGGCGGTT ATCGCACCCT GGACAATCCG CAACTACCAG GCGTATGGCG GTTTTATTCT CATCGAAACG GGTCTCTCGT ACAACCTATG GGTATTCAAC GAACCACACG AGGATCGTGA GACTATCCAT CGGATCCTTG AAAACATCAG CAACCCGGTG GAGCGGGCGA ACGATGCCAC TGAACGCGGG CTGGAGCGTC TGCGCGAAGA CCCCATGATT ATCGCGCGCA AACTCTGGCC CAACTGGGTC TTTCTGGCGC GCATCAAGCC CATCCAGGAT CGCTTTCTCA TGGAAAGTTA CTACGCGGAT GTCGATCTGC CACTGTTCGT TGCAGCGTTG ATCTTCGATG ATCTGCTGTA CGTCCTGATC GCCTTTGGAG CAATCGTCGG TCTGACGCAC GCGGTGCGCC TGCGGCGACA GGCATTGCAC ACGCATCAAT GCTGTCGTCT GCCGGTGGCT CCTGCCATCC TCTGCCTGCT CTGGATTGGC TACGCAATCG CCACGATGCT GCTGACGCAT GGTGAAGCGC GCTACCGGCA TTTTTTGTTT CCGGCGCTCA TTCCATATGC AGCATGGACG TTCACGGCAC TGAAGCGCGG CGCCGCCTGT TTCGCGCGTG GGCGGGTGAT TGCAATCGCA CCGTTGATGG GGGTATTTCT CTGGACGGTG TTGACCGCTT ACCCATGGGG GTGGGCAGTC GAAAACCTGA CGCGCGGCAG CCGCGCGCTG GCAGGGGATG TCTGGATGGC GCTTGGCGCG CCGGATCGAG CGGCAGACGC CTACCTGAGC GCGCTGGCAG CGAAAGCCAC CCCCGACGGT TGGTTGCGTT TGGGTGCGGC GCACCTGGCG CGCGGCGACA TCGAGCGGGC GCGGGTGGCT TACCGCGCGG CGTGGGATGC GAGCCGCGCC TACTATATTG CCAGCGCGCA CCTTGGCGAT CTGGAGCGTT CGCTTGGCAA CCTGGACGAG GCGCGGCGCG CCTTTGCCGG AGCATATGCC GACGAGCAGC GGGTGCTCGA CTGGTCGTGG CGCATGCTTG GGCGGAATCC GCCAACATCG CTGGACGTTG GCGACGGTCT CGATTTCGGG TACGTTGGAG GCGTGTACCC TGCCGAAGAA CTTTTGAGCA GACAGGCGCG CTGGAGCGCC GGGCGGGCAT TGCTGCGACT GGGAGGACTC GACGCGCGCG CCCCGGTAGT GTTGACGCTC CGGCTGGCGG CGCCCCATCC GAACCGACCA AGTATTCCTG TGCGCATCTG TATCACAGGG ACGTGCCGAC AAATTGAAGC GCATGCCGGG TGGCGCACGT ACACCCTGAT GGTCGAGCGA CGCAACACAC CGGCGCTTGT CGAAGTGCAT AGCCCGACAT TCATGGCTGC CGACGGACGG CGACTGGGGG TGATCATCGA TTGGGCGGCG ATGATTCCAC TTACCGGGAG AGAGCGCTGA
|
Protein sequence | MRRGSWMRHK RRISCSMNAS SKERLVLLLI LAYALVLRLL LWSQPLHEPA NDEVEYITVA RDLLDGRGWS FYDRYHWLRA PLYPLFLAAS WGLAGDDGWP RATRALHLAA LPNILLSVLS VSLAYALTAR LVNRRAALLA ALITATLWTH ATFASLYMSE TLFTVLFQIG LLGLVQAGDR QPTAKRWLLI IAAGVALGLA ALTRSLALLF LPIAAGWLAV HLHRQRSGLP HRPATPLLAA VVLLASAGAV IAPWTIRNYQ AYGGFILIET GLSYNLWVFN EPHEDRETIH RILENISNPV ERANDATERG LERLREDPMI IARKLWPNWV FLARIKPIQD RFLMESYYAD VDLPLFVAAL IFDDLLYVLI AFGAIVGLTH AVRLRRQALH THQCCRLPVA PAILCLLWIG YAIATMLLTH GEARYRHFLF PALIPYAAWT FTALKRGAAC FARGRVIAIA PLMGVFLWTV LTAYPWGWAV ENLTRGSRAL AGDVWMALGA PDRAADAYLS ALAAKATPDG WLRLGAAHLA RGDIERARVA YRAAWDASRA YYIASAHLGD LERSLGNLDE ARRAFAGAYA DEQRVLDWSW RMLGRNPPTS LDVGDGLDFG YVGGVYPAEE LLSRQARWSA GRALLRLGGL DARAPVVLTL RLAAPHPNRP SIPVRICITG TCRQIEAHAG WRTYTLMVER RNTPALVEVH SPTFMAADGR RLGVIIDWAA MIPLTGRER
|
| |