Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2693 |
Symbol | |
ID | 4444781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3020678 |
End bp | 3022693 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639690513 |
Product | glycosyl transferase family protein |
Protein accession | YP_832172 |
Protein GI | 116671239 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.316585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTCT CAACCGAAAG CCCGAAGGGC ACCGACAGCC AGGAGTCCAG TGAACAGCGG TGGGACACGC TCCAGCGCGT AATCCTGCCG AGTTCCAGCC AGATGGACAC GGTCCCGTTG TACATGGACA TGGGAACGGC CACGGGTGTC CAGCTGCCCA CGGTCGGGGA TCGTGACGGC AAGGCAAGCA AGCCGCAGGC CTTCAGCAGT CCCACGAAGG AAGCCCACGT GGAGGACTTC CTGTCCCGTT TCTCAACGTC CGTGCGGTCC GGGGAACGTG TCTCCTTCGG CAGTTACTTC AACGCTTTTC CTGCCAGCTA CTGGCGGCGT TGGACCAATG TCGAGAAAAT CCGGCTTCAT GTCCGCACCC AGGGTGCCGG GTCCGTCATC GTCTATAAGT CCAACGCCCG CGGATCACTC CAGCGTGTGG ACACCCGCAG GGTTGAGGGA ATCGCGGAGA ACTTCTTCGA TCTATCCCTG GCACCGTTTG GTGACGGCGG CTGGTACTGG TTCGACCTCG TGGCCGGCTC GGAACCCCTC GTCATGCTGG ACGCGGAATG GCAGGGTCCT GCAGCGGACA CCCAGCCTGG TTCGGTGACG CTGCAGATCA CCACCCTGAA CAAGACTGAT TTCTGCCTCA ACAACCTGCG GCTCCTCGCT GAGAACGCCG AGGCGCTGGA GCACGTCAAG GAAATCCTGA TCGTGGACCA GGGTTCGCAG AAGGTTGCGG AAGCGGAAGG CTTCGCGGAG GTCCGTGACT CCCTGCAGGG CAAGCTGCGG ATCATCAACC AGTCCAACCT CGGCGGCTCG GGCGGTTTTG CGCGCGGCAT GTTCGAAGCC GTGGAAAACG GCAGCGATTA CGTGCTGCTA ATGGACGACG ACATTGTCGT GGAACCGGAA AGCATCATCC GCCTGCTGAC GTTTGCGGAC CGCTGCAAGA CGCCGACCAT CGTCGGCGGA CACATGTTCG ACCTGTACAA CCGGACCGTG CTGCATACTT TTGGCGAGAT TGTGAACCCC TACCGGTTCC AGCCGTCGCT ACAGAGCGAA GAGATGATCC TCGGGCACGA TTTCATGTCT TCAAACCTCC GGCAGACGTC CTGGCTGCAC CGCCGCTGCG ATGTGGACTA CAACGGATGG TGGATGTGCC TCATTCCCAC GAAGGTGATT CGCGAAATCG GGCTTTCACT CCCGCTGTTC ATCAAATGGG ACGACTCCGA ATACGGTCTT CGGGCGAAGG CCCACGGCTT CCCAACGGTC TCGCTGCCCG GCTCCGCAGT CTGGCACGTG TCCTGGATCG ATAAGGACGA CCTTGTGGGC TGGCAGGCGT ACTTCCATGC ACGCAACCGT GTCATTGCTG CGCTGCTGCA CAGCCCCTAT GAACATGGCG GACGCGTGGT CCGGGAATCC CAGTACATCG ACGTCAAGCA CTTGGTATCG ATGCAGTACG CCACAGCGCA CGGCCGCGGC TGGGCGCTCG AAGACATCCT GAAGGGCCCG GAGGCCCTGC GGGAGCTGCT CCCGTCCAAG CTGCCGCAGA TCCGGGAAAT GATGTCGGGT TACTCGGACT CCGTCGTGCG CCCGGACCCG GATGACTTCC CTGCACCGAA GATGGACAAG CCCCCGCGCC GGGGTCACGG AATCTCGCAG CCGTCCAAGG TATCGCTGCT GCCGTGGGCC GCCAAGACTG TCATCCGGCA GCTTGCCGCT CCGGTGAGCG GTTCAAGCGC GGAGCGGCCG CAGGCCACCG TGGCCCACCA GGACAACCGC TGGTGGCGGA TGGCTCAGTA CGACAGCGCA ATAGTGTCCA ACGCTGAAGG AACGGGCGCA TCGTGGTACC GGCGGGATCC GAAACAGCTT CGAACGATGC TGGCTGAAAG CGCGCGCCTC CACTCCCAGC TCCTGCAGAA CTGGCCGGCA CTCAGCAAGA AGTACAAGGC CGCAATGAAC GACCTCACGT CGATTGAGTC CTGGAGGAAA ACGTTCGAGC AGCACACTCA GAACGAGATC AAGTGA
|
Protein sequence | MSVSTESPKG TDSQESSEQR WDTLQRVILP SSSQMDTVPL YMDMGTATGV QLPTVGDRDG KASKPQAFSS PTKEAHVEDF LSRFSTSVRS GERVSFGSYF NAFPASYWRR WTNVEKIRLH VRTQGAGSVI VYKSNARGSL QRVDTRRVEG IAENFFDLSL APFGDGGWYW FDLVAGSEPL VMLDAEWQGP AADTQPGSVT LQITTLNKTD FCLNNLRLLA ENAEALEHVK EILIVDQGSQ KVAEAEGFAE VRDSLQGKLR IINQSNLGGS GGFARGMFEA VENGSDYVLL MDDDIVVEPE SIIRLLTFAD RCKTPTIVGG HMFDLYNRTV LHTFGEIVNP YRFQPSLQSE EMILGHDFMS SNLRQTSWLH RRCDVDYNGW WMCLIPTKVI REIGLSLPLF IKWDDSEYGL RAKAHGFPTV SLPGSAVWHV SWIDKDDLVG WQAYFHARNR VIAALLHSPY EHGGRVVRES QYIDVKHLVS MQYATAHGRG WALEDILKGP EALRELLPSK LPQIREMMSG YSDSVVRPDP DDFPAPKMDK PPRRGHGISQ PSKVSLLPWA AKTVIRQLAA PVSGSSAERP QATVAHQDNR WWRMAQYDSA IVSNAEGTGA SWYRRDPKQL RTMLAESARL HSQLLQNWPA LSKKYKAAMN DLTSIESWRK TFEQHTQNEI K
|
| |