Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5534 |
Symbol | |
ID | 9249437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 725990 |
End bp | 728251 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003683419 |
Protein GI | 297564446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0283272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCACGC TGACACCGCC ACCCCGCGCC GAGGAGGCTC CGGTCGCTCC CGCGCCGCCG GACCCGCCCC CGGAGGGCAT CCCCGTTGCC GCGGGGCTGG TCATGGCCGC CGCCGCGGGC GGTATCGCCG TCGCCCCCGG ATACGCCGAC CCCGCCCCGG TCCACGTCCT GCTACCCGCC TGCGCCCTGC TCAGCGTCGT GGTGACCCTG CTGCTGCGGC GCCGGACGAG CGCGCTCCGT ACCGCCCTGG CCGGGCTGCC GCTCCCGCTG GCGGTGGTGG CGGCGTGCTC GGTGTGGGTG CCCGGAGAGG GGACCGGGGT GCTCGGCTCC ACCGTGGAGG CGGTCCTGCA CTCGGGGGCG CGGATCCTCA CCAGCGCCGC GCCGACCCCG CTGAACGCGG ACACCCTCGC CCTGCCGGTA CTGGCCACCT GGCTGACCGG CGCCGCCTCC GCTCTGGCCT GGCGCGGACG CCGCCGGGCG CTCGCCCTGC TGCCGGGGCT GCTGCTGCTC GTCGGCGCGG TCGTGCTCAA CGGTCCCGTC GCCCCGCCCG GCTTCCCCGC CATCGGACTG CTCACGGTGT CCTCGGTCAT CCTCATGTCC GCGTCCCGGG AGGAGCACGA CCGCGGCGCC CCGGCGGGCC GGACGGCGCT GAGCGTCGAG GTCGACACCC CCGGCGGCAC CCCGCCGGGA AGACCGCGCC GGGTACTGGT GACCGGGGTC ATCGCCGTGC TGACCGCCAC GGCCACCGTG TACGGCGGAC CCGTCCTGCT CGCCGGATGG GACGCGGAGC CGGGCGACCC CCGCACGGTC GTGAGCCCGC CGATGGACCC CCAGGCGGCG CTCAACCCGC TCGTCTACCT GTCCGGCTGG GCCGCCGACC CCGACGAGCC GCTGCTCACC GTCGCCGCCG ACGAACCCGT CAGCCTGCGC TGGGTCACCC TCTCCGACTT CACCGGCACC ACCTGGCTGC CCGAGGGCGG CTACCGGGCG GCGGGTCAGG AGCTTCCCGA GCCCGTGCCG CCCCTGCCCC ACGCCACCGG GGTCAGCGCC GAGATCACCG TCGGGGAGGA CCTGCCCGGC AGCTGGGCCC CGGTCGTCGG CGCCCCCCGG CGGATCGGCC TTCCCTCCCC GGGCTACAGC GCGCTCTCGG GAACCGTGGT GAACATGGAC GGCGCCGTCG CGGGCGCCCG GTACCAGGTC ACCGGCGACG TCGCCGACTG GCGTCCGGGC GAGCTGGCCG GGGCCTCCAC GCCCGCGGAC GAGATCTTCG ACCGCTACCG CGAACTGCCC GCCGGGGCGC CGTCCGTGCT CAACGACGTC GTCGCCGCGG TCGCCAGCGA GGGTTCCCCG TACCAGCGCG CCAGCGCCCT GGCGGAGTAT CTGAGGCGGT CGCACCGGTT CGACCCCGAG ACCCCCGGCG GCCACGGCTA CGCCAACGTC GCCGCGGTCC TCGCCCCGCC CGGGGCGGAG GGGGGCGCGG GCACCTCCGA GCAGTTCGCG AGCGCCTTCG CCATGCTGGC CCGCGCGGCC GGACTGCCCA GCCGCGTGGC CGTCGGCTTC GGGGCGGGCA CGGAGGACGC CGGCGGTACC CGCACCGTCC GCACGGGCGA CGCCGTGGCC TGGGGCGAGG TCTACTTCGA CGGCGTCGGA TGGGTGCCGT TCGCCGTCAC CCCCGGGGAG GAGGGCGGGG ACGACGCGAG CGGGTCCGCG CGGAGCGAGC GGACCCCGGA GGGAGCCGAT TCCGCGCAGG ACCAGGTCCT GCCCGAGGAC GACGCCCACG ACGTCGCCGC GCCCCGGCGC GAGCCGGAGC GGTTTCCGGT GTGGGGGGTG CCCGCTATCG CGGGCGGTCT GCTCGCCGCC GTGCTCGCCG TCCCCGCGCT GCGCCTGGCC CGCAGCGGGC GGCGGCTGCG CGCGGGCACG CCCGAGCGCC GCGTCCTGGG CGCGTGGCAC GAGCTGCGCG ACGGCCTGCG CCTGTGCGCG TCGGAGCCGC CGCCGGGGCA CACGGTCAGC GACACCGTCG CGCTCGCCCG GAGCCTGCTG CCCGAGACCG CCCAGGCCTG GGCCCACGTG GACCTGGGCA GGCTGGGCCG CACGGTCAAC GGGATCGGGT TCGCCGCGGG GCCGGGCGTC ACCGGGGAGC AGGCGGCCTC CATCGCCGAG GGCGTGCGCC GCCAGCTGCG CGCGCTGCGT CTCAGCAGGT CCCGTACCAG GAGGCTCACC TGGTGGTTCG ACCCCCGTCC GCTGCTGTGG CGGGAGCGGT GA
|
Protein sequence | MVTLTPPPRA EEAPVAPAPP DPPPEGIPVA AGLVMAAAAG GIAVAPGYAD PAPVHVLLPA CALLSVVVTL LLRRRTSALR TALAGLPLPL AVVAACSVWV PGEGTGVLGS TVEAVLHSGA RILTSAAPTP LNADTLALPV LATWLTGAAS ALAWRGRRRA LALLPGLLLL VGAVVLNGPV APPGFPAIGL LTVSSVILMS ASREEHDRGA PAGRTALSVE VDTPGGTPPG RPRRVLVTGV IAVLTATATV YGGPVLLAGW DAEPGDPRTV VSPPMDPQAA LNPLVYLSGW AADPDEPLLT VAADEPVSLR WVTLSDFTGT TWLPEGGYRA AGQELPEPVP PLPHATGVSA EITVGEDLPG SWAPVVGAPR RIGLPSPGYS ALSGTVVNMD GAVAGARYQV TGDVADWRPG ELAGASTPAD EIFDRYRELP AGAPSVLNDV VAAVASEGSP YQRASALAEY LRRSHRFDPE TPGGHGYANV AAVLAPPGAE GGAGTSEQFA SAFAMLARAA GLPSRVAVGF GAGTEDAGGT RTVRTGDAVA WGEVYFDGVG WVPFAVTPGE EGGDDASGSA RSERTPEGAD SAQDQVLPED DAHDVAAPRR EPERFPVWGV PAIAGGLLAA VLAVPALRLA RSGRRLRAGT PERRVLGAWH ELRDGLRLCA SEPPPGHTVS DTVALARSLL PETAQAWAHV DLGRLGRTVN GIGFAAGPGV TGEQAASIAE GVRRQLRALR LSRSRTRRLT WWFDPRPLLW RER
|
| |