Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0469 |
Symbol | |
ID | 5537932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 601840 |
End bp | 603414 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892632 |
Product | glycosyltransferase |
Protein accession | YP_001430618 |
Protein GI | 156740489 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.171688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.657168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTC CAGAAGCGCA CAGCGCTCCA CATCTTCAAA CAACGCCAAT GACCGTTGGC GCCTCTCGGA TTCTTGTGGC AACGCTGTTT ATCGTGGCAG TGTTGCTGGC GACGATCAAT CTGCCATATG CGCCGCGGAC CTGGTTTGAT GAGGGATCGC ACCTGCACGT GCCGAAAGCA TTGGTGCAGT ACGGCAAGTA CGCCGACATC AGCGCCATCC CTGATGGACG CATCGAGTTT CGCTACCACG GACCCACGAT TGGTATCGGT CCGACCATTA TGCTGCCGGT TGCGGCGGTC TACCAGGTGT TCGGTGTCGG TCTGACGCAA GGGCGACTGG TAATTGTGAT CTATTTTGCC ATTGCGCTTG TTGCCGGATA TGCGCTTGCG CGTCGTCTGT ATGATCGCCA GACTGCACTG ATCGCGCTGG CGCTCCTGCT GGCGTCACGC ACGGTCAATT ATGAGGGGCT GATCGAGTAT GGGAGGCAGG TGCTCGGCGA GGCGCCCGGC GTGGCATTCG TCTTTCTGGG AATGCTGGCG TGGCTGACGG CGTTGAAGAC AGCAGCGCAA CCGGCGCTGC GGCATGCGCA CCGGACGTGG AGCATACTGG CGGGGCTGGG GTTTGGTCTG GCGTTGGTCA CCAAGAACCA ATTTGTGCTG ATTATCCCGC CGGCGCTGGC GCTGACAGCG TTGCTCGACT GGCGCTACTA TCGGGCGGGA ACCTGGACGC TGCGCCTGAT TCCGCCGATT GTTGCTGTTG GTTGTTTTGC GCTCTGGACG GTCGTGCAGT TTGCGCTGCT CGGTCCTGGC ACATTCTTTG AAAATCTTCA GCAAACCCGG CAGGCTGCTG GCGGGGCGAT TTTCGTTTTC AACCTGCGTT CGACCCTGCG CGCCGGGTAT TACCTGTTGC GTCCCGATCT GTTCGGCGGG CTGGTTGTGC CGGCGCTGGC ATACACTATC TGGCGCGCGC GCCGCCGCAC ATCGCAGGGG TTGAACGAAG CGCTGCTGGC ACTGATCATT GGTCTCTGGC TGGCGTGGTT CGTCGGCGTT TCCCTCGGCT GGCCCCGCTA CGCCTTTCCG GCAGTCGCAC TGAGCGCTCT GACCGTCGCG CGGCTGGCAT TCGATACGAT TGTCTGGCTG CGCCGCGTGT TGCCGGCGGC AGCAACAATT GCCGCCATCT ACCTGGTTGT CATCATCGTG CTGCCGATGG CACTAACAGT GCGCGTGGTG TTCACGCCCG ATGATAGTGC ACAGCGGTTC GCCGCATATC TGAATGCGAA TGTGCCTGAA TCGGCGATCA TTGCTACCTG GGAGCCGGAA TTGGGGGTGC TGACCGATCA CCGCTATCTC TACCCGCCCC AACCGACCCT GGATCAGGCA GTGCGGCACA CCTGGCTGGG AGGTGATCCG GTGCGCTACG ACTGGTACGC AGATCGACCG GAGTATGTTG TGGTCGGCAG TTTCGGCGGT TATACCGGCG TGTACCATAC GCCCGAACTG GAACGCCACT ATATTCGTGT GGCGCAGATG GGTACGTATG CGTTGTACCA GGTTCGGGCG GGGAGTGGGG AGTAG
|
Protein sequence | MIRPEAHSAP HLQTTPMTVG ASRILVATLF IVAVLLATIN LPYAPRTWFD EGSHLHVPKA LVQYGKYADI SAIPDGRIEF RYHGPTIGIG PTIMLPVAAV YQVFGVGLTQ GRLVIVIYFA IALVAGYALA RRLYDRQTAL IALALLLASR TVNYEGLIEY GRQVLGEAPG VAFVFLGMLA WLTALKTAAQ PALRHAHRTW SILAGLGFGL ALVTKNQFVL IIPPALALTA LLDWRYYRAG TWTLRLIPPI VAVGCFALWT VVQFALLGPG TFFENLQQTR QAAGGAIFVF NLRSTLRAGY YLLRPDLFGG LVVPALAYTI WRARRRTSQG LNEALLALII GLWLAWFVGV SLGWPRYAFP AVALSALTVA RLAFDTIVWL RRVLPAAATI AAIYLVVIIV LPMALTVRVV FTPDDSAQRF AAYLNANVPE SAIIATWEPE LGVLTDHRYL YPPQPTLDQA VRHTWLGGDP VRYDWYADRP EYVVVGSFGG YTGVYHTPEL ERHYIRVAQM GTYALYQVRA GSGE
|
| |