Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1834 |
Symbol | |
ID | 8447439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2013191 |
End bp | 2014966 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645040963 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003201213 |
Protein GI | 258652057 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000172396 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.276871 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGA ACCAAGAGAA CGCTGATCTG ATCCCGATGG AACCCGACAC CGGCGTGCGC GAGTCCGATG ACCGCCGCCG CGGTGCCCGG GATTCACCCT CCCTGATGAT GCTGGTGCTC GTCGCCACCA TCGGTGTGCT GCTGTACACC ACGTTCCTGT TCGACTTCTC GAACCGGGGC AACTGGCTTC CGTACCTGAT GGTGCTGTCC GCCGAGTCGG TCATCATCTT CCAGGCCCTG ATCGCGCTGT GGACCATCCT GTCCAGCGGT CACAACCCGC GGGGCTACCG CTTCCACAAC GCGCAGAACC GGATCTACGG ACCGAATCAC AAGACTCTGG ATCCGGATCT GGACCTGACC ACGCTGCCGA TGCACCTGCA CGATTCGCCG GTCGAGCTGG ACGTCTACAT CACCACCTAC GGTGAGGACC TCGCGACCAT CCGCCGGACG ATTACCGCTG CGCTGGCCAT GCACGGCAAG CACACCACCT ACGTGCTCGA TGACGGCAAG TCCGACGACG TCCGGGCGCT GGCCGCCGAG CTGGGCGCCG AGTACATCGT CCGTGAGGGC AACGCCGGCG CGAAGGCCGG CAACATCAAC AACGCGCTGA GCGTCACCAC CGGCGAGTTC TACGTCGTGC TGGACGCCGA TTTCGTGCCC AAGGAAGACT TCCTGTACCA GACCGTGCCC TTCTTCGCGG AGACCAATGT GGCCTTCGTG CAGACCCCGC AGGCCTACGG CAACCTGGAC AACCTGATCT CCCGTGGCGC CGGCTACATG CAGTCCGTGT TCTACCGGTT CATCCAGCCG GGCAAGAACC GCTTCAACGC CGCGTTCTGC GTGGGCACCA ACGTGATCTT CCGCCGCAAG GCGATCGAGT CCATCGGTGG CATGTACACC GAGTCCAAGT CCGAGGACGT GTGGACCTCG CTCAAGCTGC ACGAGAACGG CTGGAAGTCG GTCTACATCT CCACCGTGCT GGCCGTCGGC GACACCCCCG AGACCATCGA GGCCTACACC AAGCAGCAGC AGCGCTGGGC GACCGGCGGG TTCGAGATCC TGCTCAAGGC CAACCCGTTC TCCCGCAAGC GCAAGCTGAC CCTGGACCAG CGCCTGCAGT ACTTCGGCAC CGCCACGTTC TACCTGATCG GCATCGCCCC CGGCGTCCTG TTGCTGGTGC CGCCGCTGCA GATCTACTTC GGTCTGGCCC CGATCAACAC CGGCGTCAGC TTCGGCCAGT GGCTGCTGTA CTACGCGGGC TTCTACTTCA TGCAGATCAT CGTCGCGCTG TACACCATCG GGTCCTTCCG CTGGGAAACC CTGATGCTGG CCACCGCCTC GTTCCCGATC TACGGCAAGG CCCTGGTCAA CGCGGTGTTC AAGAAGGACA CCAAGTGGCA CGTGACCGGT GCCCAGCGGC GCAAGGCCTC CCCGTTCAAC TTCATCACCC AGCAGCTGAT GGCCTTCGTC TTCCTGGCCA TCACCTCCGT GGTCGGCATC TGGCAGGCCA TGACGGTCAG CGCCTTCACC CTGGCGCTGT TCTGGAACCT GCTGAACACC TTCATCCTCG GCGCGTTCGT GATCACCGCG TTCCGCGAGA GCCGGCACAA CCGCCGCGAG GAGAAGGGCC TGCCGCCCAA GGGCAGCGCG AAGGTGGCCG CCGAGGCCGC TGCGGCCCGG GCCCTGGCCC AGGACAAGAT CGGTTCCGGC CTGTCTGAGC GGACCTACGA GGGACTGCCC CCGGTGCGGA TCGACACCGC CGCCGGCGCC CGCTGA
|
Protein sequence | MSLNQENADL IPMEPDTGVR ESDDRRRGAR DSPSLMMLVL VATIGVLLYT TFLFDFSNRG NWLPYLMVLS AESVIIFQAL IALWTILSSG HNPRGYRFHN AQNRIYGPNH KTLDPDLDLT TLPMHLHDSP VELDVYITTY GEDLATIRRT ITAALAMHGK HTTYVLDDGK SDDVRALAAE LGAEYIVREG NAGAKAGNIN NALSVTTGEF YVVLDADFVP KEDFLYQTVP FFAETNVAFV QTPQAYGNLD NLISRGAGYM QSVFYRFIQP GKNRFNAAFC VGTNVIFRRK AIESIGGMYT ESKSEDVWTS LKLHENGWKS VYISTVLAVG DTPETIEAYT KQQQRWATGG FEILLKANPF SRKRKLTLDQ RLQYFGTATF YLIGIAPGVL LLVPPLQIYF GLAPINTGVS FGQWLLYYAG FYFMQIIVAL YTIGSFRWET LMLATASFPI YGKALVNAVF KKDTKWHVTG AQRRKASPFN FITQQLMAFV FLAITSVVGI WQAMTVSAFT LALFWNLLNT FILGAFVITA FRESRHNRRE EKGLPPKGSA KVAAEAAAAR ALAQDKIGSG LSERTYEGLP PVRIDTAAGA R
|
| |