Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1232 |
Symbol | |
ID | 4446261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1352896 |
End bp | 1354341 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639689040 |
Product | glycosyl transferase family protein |
Protein accession | YP_830726 |
Protein GI | 116669793 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAGCCG TATACATCGT CCTTGTCCTT GGAATCAGCA CGATCTTCTG GTCGTTTGTG GGCTTGTTGC GCCTGGCCAA TGAGCAGTAC ACCCGGCGGA TCACGCTGGG CCCGCCGCTG AGCGGGCGCA TGCTGGGGCT CCTGCAGCGC TGGCCGCTGG CCAACCGGCT CGGCCCCGGA GTCCATGGCG CACATACGTT GCCGGCGCCC AGGGCACACC GGGGCCAGAT CCAGCGGGAG CATGGCCGCC ATCGCGCCAG GGACCCCAGG ATCCTGCCCG CAAATGTCGC GGTGCTTATT GCCGCCCACA ACGAAGCACT TGTCATCAGG GAGACCATCC GGGCCGCCTC GGCGCTGGTG CCGCTCAGGA ACATCCACGT GATTTCGGAC ATGTCCACGG ACGATACCTC GGCGATTGCC CGGGCAGCGG GGGTCAAAGT GCTGGACCTC GAACCGAACA GGGGAAAGGC CGGCGCATTG GCCGCCGGAA TCGCTTATTT TGAGCTGTGC CGGAAGTTCA AAGTGGTGAT GCTGCTTGAC GCGGATACCC GGCCCACCGC GGATTACCTG GAGACCGGGC TGCCGCTCTT CCTCGATCAC TCTGTGGTGG CTGTGGCCGG AAGGGCAAAG TCCATCATGA CCCCGCCTCC ACCCACCGCG CTGGGCCGGT TCCTGGTGGC ATATCGGGAA CGGCTGTACA TCGTGGTGCA ATTGCTTCTC AAATACGGCC AGGCCGCCAG GGGCGCAAAC GTTGTCTCGA TCGTGCCGGG CTTTGCCAGC ATGTACCGGA CGAGCGCACT CGCGAAAATC AAGGTGCTGG CCCCGGGCCT GGTCATTGAG GACTTCAACA TGACCTTCGA GATCCATGCC AAGAAATTAG GGCGGATTGC GTTCCACCCC TCAGCCGCCG TGGCATACAC CCAGGATCCG GACAACCTGC AGGACTATAC GCGGCAAGTC CGGCGTTGGA TCCTCGGTTT CTGGCAGACC GTCCGGCGGC ACCGCCTGCA GTCCGGAAAG TTCTGGTTTG TTCTGGTGTT CTACATCATC GAACTCGTTA TCAGCTGCCT GTTCTTCGTC TTGCTGATCC CGGTCTTCCT GCTCTCGCTG GTGGCGTCCA TGCAGCTCCA GGCCTTCGGT GACAACGGCG AATCCTTCCT TTACCTGTCC GGTCTCATGC GCCCGCAGGA CGTCCTCTTG GGTGTCCTCA TTCCGGACTT CCTGCTCACC GTCATCGCGG CCCTTGCCCT CCGCCGGCCT GGCATCCTGC TGATGGCGCC ACTCTTCCCG CTCATGCGGA TCCTGGACTC GGTGATCTGC CTGATGGTGC TGCCCCGGGC CTTCTCTGCG GCCTCATCGG GCGTGTGGGT CAGTCCGATG CGGCGCATCC AGGGCGAAGG CCTCGAGCTG GAAGAAGCCA CGGCGGGTGC CGGCATCACC CGTTAG
|
Protein sequence | MIAVYIVLVL GISTIFWSFV GLLRLANEQY TRRITLGPPL SGRMLGLLQR WPLANRLGPG VHGAHTLPAP RAHRGQIQRE HGRHRARDPR ILPANVAVLI AAHNEALVIR ETIRAASALV PLRNIHVISD MSTDDTSAIA RAAGVKVLDL EPNRGKAGAL AAGIAYFELC RKFKVVMLLD ADTRPTADYL ETGLPLFLDH SVVAVAGRAK SIMTPPPPTA LGRFLVAYRE RLYIVVQLLL KYGQAARGAN VVSIVPGFAS MYRTSALAKI KVLAPGLVIE DFNMTFEIHA KKLGRIAFHP SAAVAYTQDP DNLQDYTRQV RRWILGFWQT VRRHRLQSGK FWFVLVFYII ELVISCLFFV LLIPVFLLSL VASMQLQAFG DNGESFLYLS GLMRPQDVLL GVLIPDFLLT VIAALALRRP GILLMAPLFP LMRILDSVIC LMVLPRAFSA ASSGVWVSPM RRIQGEGLEL EEATAGAGIT R
|
| |