Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0334 |
Symbol | |
ID | 5207269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 427570 |
End bp | 429510 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640593960 |
Product | glycosyltransferase |
Protein accession | YP_001274716 |
Protein GI | 148654511 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.184181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.020995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATAG CGGAGCAGAA CGCTGCGAGC GATGCAAGCG TATGTGCAGA ACACCGGGCT GCTCGTGATG TTTTGAAGGC GCCGATCATC GCGCTGATGC TTGGTGTGCT TGCACTTGCG CCGCGTGTCA TCGGGCTTGC CGATTTTCTC ACCACCGACG AAGCGTACCA CTGGATCCGT TTTACCGAAC GTTTCGATGC AGCAATTTCC GAGGGGCGCT GGGCTGATAC CATTTTCGTC GGGCATCCCG CCATCACGAT GTTCTGGTTG GGGCGCGCAG GATTGGTGCT CGAGCGCGCT GCGCGCGATT TGGGCTGGAT AGGCGCCCCT TCGATGATCG AACATCTGGC CTGGCTGCGG CTGCCGGGGG TGTTCCTGCA GGTGGTGTTT GGGGTAACCA CCTGGATGGT GTTGCGTCGC CTCGTTGATC CGATGGTCGC GCTGGTTGCG GGATTCCTGT GGTCTACATC GCCATATCTG ATTGCGCACG GGCGGGTGCT GCATCTCGAT GCACTGCTGA CGGGGTTGCT CACACTGAGC CTGTTGCTCC TGCTGGTTTC CTTGCGGCAA CAGCAGGCAG GCGCAGGCGG ATGGACAGCG CTGCTCGGTT CCGGTGCGTT GACCGGACTG GCGCTCCTGA CCAAAGGACC GGCGATCATT TTTTTGCCAT TTGCCGGTCT GATGCTGTTC GCTCTTGCGC CTGCGAAAGA CGCCTCGAAC CGACGTGTTT CCGGCGTGGT GTCGGATGTA TTTCGTCGCC TGAGGTATGC GATCGTGCGT TATGGCGTAT GGCTGGGGGT TGCGCTCGGT GTTGCATTCG CCGGATGGCC CGCGCTGTGG GTGACGCCGG AAGCGGCACT GCAAGCCTAT GTGGGTGAAA TTATCTTCAA CGGCGGACGT CCCAACGGCG ATGGGCAGTT CTTCAACGGT CAGGCAGTTG GTGATCCTGG CGTGTGGTTC TATCCGGTCG CCAGTCTGTT CCGCACGACG TCGGTGATGT TCATTGGTTT GGTCGCTTTT GTGGTCTTTG CGGTGATCGA TGGCCGCCGC TTCTTCACGC AACGCGATGC CGTCATTCCT GTCCTGATCG CTTTTGCCGC CTTCTGGACA CTGGTCATGA CGCTGGGTCC AAAGAAGTTC GACCGATATG TCCTTCCGAT CTGGCCGGTG TTGCTCGTGC TGGCGGCAAC CGGAATCGTG CGCGGGTACA ATGCTGCGCG GGCATGGTGC ATCCGGCGTG CGATTGTCGT GCCCCGGGGC GGTGATTTTC TCAAACGCGC GCCTCTGGCG GGGTTGCTGA TAATGGGCGC AATAGAGATC GGTCAGGTCG TCTGGTACCA TCCCTACTAT CTGAGTTACT ACAATCCCTT GTTCGGCGGC GGTGCGGCAG CGCAGCGCAT GTTTCTGATC GGATGGGGAG AGGGTATGGA TCAGGTCGGC GCATGGTTGA GTTCACGCCC TGATATCGGG TACGGACCGG TTATCTCGGC GCTCAGACCA ACGTTGCAAC CGTTCGTTCC GGTCGATGTT CGTGACATCA CCGATCTGGG GAAACTGCCG GTCAACTATG CCGTCGTCTA TCTGGAGTCG ATCCAGCGCG GCGCGCATCC TGATATCTAT CGCCAGTTCG AGCCGATGAC TCCCATCCAT ACAATCACCA TTCATGGCAT CGAATATGCA AAGATCTACC AGTTGCCGCG CCCATACCGG CAGCCGGTCG GCGCGCGCTT CGGCGATGCA ATCATGCTCC ACGGCGTCTC AGTCGAATAT GATCAGAACC ATCTGACGGT CACGCCTTCG TGGGGGGCGC TGGCGCCTCC GCAGGGCGAT TACGTCGTAT TCCTTCAGGT GATCGATGCA CAGGGACAGC GGGTTGCCGG TGTGGACGTA CCGCCATCCG GCGTTGGGGG GATGCCGACC GGCGCCTGGC TGCCGGGGTA G
|
Protein sequence | MQIAEQNAAS DASVCAEHRA ARDVLKAPII ALMLGVLALA PRVIGLADFL TTDEAYHWIR FTERFDAAIS EGRWADTIFV GHPAITMFWL GRAGLVLERA ARDLGWIGAP SMIEHLAWLR LPGVFLQVVF GVTTWMVLRR LVDPMVALVA GFLWSTSPYL IAHGRVLHLD ALLTGLLTLS LLLLLVSLRQ QQAGAGGWTA LLGSGALTGL ALLTKGPAII FLPFAGLMLF ALAPAKDASN RRVSGVVSDV FRRLRYAIVR YGVWLGVALG VAFAGWPALW VTPEAALQAY VGEIIFNGGR PNGDGQFFNG QAVGDPGVWF YPVASLFRTT SVMFIGLVAF VVFAVIDGRR FFTQRDAVIP VLIAFAAFWT LVMTLGPKKF DRYVLPIWPV LLVLAATGIV RGYNAARAWC IRRAIVVPRG GDFLKRAPLA GLLIMGAIEI GQVVWYHPYY LSYYNPLFGG GAAAQRMFLI GWGEGMDQVG AWLSSRPDIG YGPVISALRP TLQPFVPVDV RDITDLGKLP VNYAVVYLES IQRGAHPDIY RQFEPMTPIH TITIHGIEYA KIYQLPRPYR QPVGARFGDA IMLHGVSVEY DQNHLTVTPS WGALAPPQGD YVVFLQVIDA QGQRVAGVDV PPSGVGGMPT GAWLPG
|
| |