Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4203 |
Symbol | |
ID | 8449829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4644356 |
End bp | 4646881 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645043252 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003203481 |
Protein GI | 258654325 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.138699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAAGC GCACATCCGA GATGGTGTCC ATCGTCCTGG TCAACTTCCG CGGCGCCGAT GACACGATCA CCTGCATCAG GTCGCTGCGC AAGGTCGACT GGCCGGCCGA GAAGCTGGAA ATCGTCTGCG TGGAGAACGG GTCGGGCGAC GACAGCGCCG CGCGAATCGC CGCGGCCGAT CCTGGCGTGA CGCTGGTGAA GTCCGACGAC AACCTTGGCT TCGCCGGAGG CTGCAACCTG GGCGTCCGGC ATGCGCGGGG CGAGTACGTG GCCTTCTTGA ACAACGACGC GCGGCCCGAC CCGGGCTGGG TGCGGGCGGC GGTCGACGCG TTCAAGACCT CGCCCAACGT GGGTTCGGTC GCCTCCAAGG TGCTCGACTG GGACGGGCAG AAGATCGACT TCGTCGAGGC GGCGATCACC TGGTTCGGCA TGGGGTACAA GCCGTTCTGC GAGTCCCCGG ACACCGGCGC CTTCGACGAG CCGCGGGACG TCCTGTTCGC TACGGGCGCG GCGATGTTCG TGCGGGCCGA CGTGTTCGAC AGCGTCGGCG GCTTCGACGA GCGCTACTTC ATGTTCTATG AGGACGTGGA TCTGGGCTGG CGCCTGAACC TGCTGGGCTG GAAGGTCCGC TACGAGCCGC GCTCGCTGGC TTTCCACAAG CATCACGCAT CAATGAACAA GTTCGGTGCG TTCCGGGAGA CCTACCTGCT GGAGCGCAAC GCGCTGTACA CGATGTACAA GAATCTGGAC GACCGTTCCC TGGCTCAGTT CCTGCCCGGT GCGCTGCTGC TGGCCGTTCG TCGCGCGGTG GCCCGGGGCG AATTGGACAG CACCGAGCTG GACATTCGGC GACCGGGAGA TGACGCCACG CCGGATCGCC CGGTGGCCAA GCAAGCCATG GCCGGGATCT ACGCCATCGA TCAGCTGGTG GAGAACATCA CCTCGTTCAC GGAGACCCGG CAGCTGCTCC AGCAGCGGCG CCGGGTCGGC GACTCCGAGC TGCGGCCGTT GTTCGGCAAG CTGATGGAGC CGGCCTACCC GCTGCCGACT TACCTGGAGG CGCACGAGGA ACTGGTGTCC GCGCTGGGCA TCGACGCGGC CGGACGCAAG AAGCGTGTCG TCATCATCAC CGGTGAGCCG GTCTCCGCGG TGATGGCCGG GCCGGCGATC CGTTCCTGGA ACATGGCGCA GTATCTGAGC CGTGAGCACG AGGTGCGCCT GTTGACGTTC GGTACCGCCG GGGTTCGGCC GGATAAGTTC GAGGTTCTCT CGGTCTCGCC GCGGGACGCG CACGCGGCCG ACGTGCACAT CGACTGGGCC GACGTGATCA TCTTTCAGGG ACACGCGATG GCCGTGTTCC CCGCTCTCTA CGAGACCGAC AAGGTCGTGG TCGTCGACCT GTACGACCCA ATGCATCTGG AGCAGTTGGA GCAGGCGAAG GAGAAGGGGC CCAAGGCCTG GGCCTTCGAG GTGAACTCGG CCACCGAGGT CCTCAATCAG CAGTTGGCCC GCGGCGACTT CTTTTTGTGC GCGAGTGAAC GTCAGCGTCA CTTCTGGCTC GGCCAGTTGG CCGGTGAGGG ACGCCTCAAC CCCCTCACCT ATGCCCAGGA CAATTCGCTG GGCAGTCTGC TGGCCCTGGT CCCGTTCGGA CTTCCGGCCG CGGAACCGGT TCGGACTGCT CCGGCCCTGC GGGGGGTCGT CGACGGCATC GGCGCCGACG ACAAGATCGT GATCTGGGGA GGCGGCATCT ACAACTGGTT CGACCCGCTG AGTCTGATCC AGGCGATTTC CGGGCTCGCT CGAACCCATC AGGACATCCG GCTGTTCTTT CTGGGCATGC AGCATCCCAA CCCGGCGGTC CCGGAGATGC AGATGGCGGT GCGCGCCCGG CAGCTGTCGG AGGAGCTCGG CCTCACCGGA CGCCATGTGT TCTTCAACGA GGAGTGGGTG GCCTACAACG CCCGGCAGAA CTACCTGCTG GATGCCGATG TCGGGGTCAG CACGCATTTC GAACACATCG AAACCACCTT CTCTTTCCGT ACCAGGATCC TGGACTACCT GTGGACCAGG CTGCCGATCG TGACGACTCG CGGCGATGGG TTCGGTGATC TGGTCGCGGC CGAGGGTCTG GGCGTCGCCG TGCGCGAGAA CGACCCGCAG GCCCTGGCCG ACGCCCTCGA AATCATGCTG TACGACGACG TCGAACGAGG CCGGGTGATC CGCAATCTGG ATCGGGTCCG GGCCGAGTTC ACCTGGGACA AAACGCTGGC CCCGTTGCTG GAGTTCTGCC GCGATCCGCA CCCGGCCGCC GACCGGGTCT TGCCGGAAAC GATGACGCCG GTGCGAACGG CCGGGCCGAC ACGGGTTCTG GCCGAGCGAG TCCGCTCCGA CCTGGCGATC GCCGGGCGAC ACCTGCGCAG CGGCGGCGTC CGCGAGATGG TGGGCGCGGC GGCCGGCCGC GTGAAGCGGC AGCTGACCGC CCGCAAACGT AAAGCCGAGC GGGCACGGGC GGCGAGGGCG CAATGA
|
Protein sequence | MSKRTSEMVS IVLVNFRGAD DTITCIRSLR KVDWPAEKLE IVCVENGSGD DSAARIAAAD PGVTLVKSDD NLGFAGGCNL GVRHARGEYV AFLNNDARPD PGWVRAAVDA FKTSPNVGSV ASKVLDWDGQ KIDFVEAAIT WFGMGYKPFC ESPDTGAFDE PRDVLFATGA AMFVRADVFD SVGGFDERYF MFYEDVDLGW RLNLLGWKVR YEPRSLAFHK HHASMNKFGA FRETYLLERN ALYTMYKNLD DRSLAQFLPG ALLLAVRRAV ARGELDSTEL DIRRPGDDAT PDRPVAKQAM AGIYAIDQLV ENITSFTETR QLLQQRRRVG DSELRPLFGK LMEPAYPLPT YLEAHEELVS ALGIDAAGRK KRVVIITGEP VSAVMAGPAI RSWNMAQYLS REHEVRLLTF GTAGVRPDKF EVLSVSPRDA HAADVHIDWA DVIIFQGHAM AVFPALYETD KVVVVDLYDP MHLEQLEQAK EKGPKAWAFE VNSATEVLNQ QLARGDFFLC ASERQRHFWL GQLAGEGRLN PLTYAQDNSL GSLLALVPFG LPAAEPVRTA PALRGVVDGI GADDKIVIWG GGIYNWFDPL SLIQAISGLA RTHQDIRLFF LGMQHPNPAV PEMQMAVRAR QLSEELGLTG RHVFFNEEWV AYNARQNYLL DADVGVSTHF EHIETTFSFR TRILDYLWTR LPIVTTRGDG FGDLVAAEGL GVAVRENDPQ ALADALEIML YDDVERGRVI RNLDRVRAEF TWDKTLAPLL EFCRDPHPAA DRVLPETMTP VRTAGPTRVL AERVRSDLAI AGRHLRSGGV REMVGAAAGR VKRQLTARKR KAERARAARA Q
|
| |