Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0884 |
Symbol | |
ID | 9244729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1081028 |
End bp | 1083430 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003678834 |
Protein GI | 297559860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGG CCACCCTGGC CTGCCTGTTG ATGGCGATGC CACTGCTGGA CGGCCTCGTA CGGGGCGAGG CCTGGTGGGT CCCGGCCCTC GTGATGACGG TGGCGGTGGG CGGGGTGTCC GCGCTGTACC GCCTCAGCGG CTGGAGCTCC TTCCCGGTGC CGTTCCTCCA GGTGCTGGCG GCGGCGCTGC TGATGACGCC GCTGTTCGCC GGACACGTCG CGCCGCTGGG GCTGCTGCCC TCCGGTGACA CGGCGCTGCA CCTGCTGCGG GTGTTCGACG AGGGCCTGGA GACGATCGAC ACCAGCACGC CCCCGGTCTC CTCCACCGCC GGGGTCATGC TGATCATCGC GCTGGTGTTC GTCCTGTTCG CGATCATCGC GGACTTCCTG GCGGTGACCG CGCGCTGCCC CGGGATGGTC GGCGCCCTGG TGGCCGTGCT GATGGCGGTC CCGCTCATCG TGGACGACGC CGGACTGGGC TGGCCGGCCG CGACCAGCGG CGCCGTCGGC TTCCTGCTGC TGCTCGCGGT GGACGTGTGG GTGCGCGGCC GCGAGTGGGG CGTGCGCGTC CCCGACGGCC ACGACTCCTC GGCCCGTGTG CTGGGCGCGG TGGGCAGGGC GACGACGGTG GCGGTGGCCT CGGCCGCCGC GGTGCTGCTG GCGCTGACGG TGCCGCTCGC GGTGCCGTCG CTGCGCACCG ACGTGCTCCA CACGATGGCC GACGGCACCT ACATCGGCAC CGGCGGGGAC CGCATCACCA CCACGCACCC GCTGGTCTCG CTGCGGCGCG AGCTGGCCTC CTCCTCCGAC CGCACCGTGC TGACCTACCG GACCGACGCC GAGCGGCCCG ACTACCTGCG CACGTACGTG CTGGACGAGT TCGACGGCGC GAACTGGACG ATGACGCCGG TCAGCGCCTC GGGCGACACC CTGGTCGACG GGGAGCTGCC GCTGCCCACC GGGTGGGGCT CGGAGCCCTC AGGCGACGCG GTGACCACGC GGATCTCGAT GGACTCGGAC GCGCCCGGGA TCGACTTCCT GCCGTTGCCG TACTGGGCGC GGACGGTGGA CGTGCCCGGG GAGTGGTACG CCGACCCGGC GAGCCACGTG GTGTTCACGA CCGAGAGCGC GCCCACGGGG CTGAACTTCA CCGTGCAGAC CCTGGGCCGG GAGCCGTCCG CCGAGGAGCT GGCCGACAGC GGCAGCCCGC GGTCGCTGCC GGGCGACTTC CGCGACCTGC CCGGCGACCT GGACCCGCGG GTCCGGGAGC TGGCCGAGGA CCTGACCGAG GACGCGCGGA GCCCCTACGA GCGGGCGGTG GCCATCCAGG ACCACTTCAC GGGGGGCGCG TTCACCTACG ACCTGTCCCC GCCCGCGGTG CCCGACGGCG CCGACCCGCT CGCCCACTTC CTGTTCGGGG ACCGGGTGGG CTACTGCGAA CAGTTCGCCG GGGCGATGGC GGTCATGGCC CGACAGGTGG ACATCCCGGC GCGGGTGGCG GTGGGCTACA CCGCGGGCGA GCAGCAGGGC GACGGCCGCT GGGCGGTGTC CGTGGGCGAC GCCCACGCCT GGCCGGAGCT GTACTTCGAG GGCGCGGGCT GGGTGCGGTT CGAGCCGACC CCCTCCTCCG CCGGGGGGCA GGGCTCGGCG TCGGTGCCCG ACTACTCCCG GGGCGGGCAG GGGGGCGTCG ACGACCCGGA CGGGGCCGCG GAGGAGACGC GGGAGCCCTC GCCCGAGGAG ACGGAGGCCG CGCGGGACGA GGCCACCGCG GAGGCCGAGG AGCCCAGCGG GACCCCCGAG GCGGAGCCCG AGGACGAGCC CTCCACGTCG GTGGCCGCCC CGGGCGACGA CGCGCGGGGC GGCCCGGACC TGTCGTGGCT GCCCGCGGCG GGCGCCGCCA CGGGTGTGCT GCTGCTGTTG GCCCTGCCCG CGCTGGTGCG CGCCCTGACG CGCTGGTCCC GGACGGCGTC GCTGACCGGC GGCGCGGGAG CTGCGGGCGC GCACACCGCG TGGCGGGAGC TGCGGGACAC CTGTCTGGAC CTGGGCGGCG CGTGGTCGCT GGCGGAGAGC CCGCGTGCCA CGGCCGAACG CCTGGCCGGT TCCGGTCCGG TGCCCCAGCC CGACGCGGCC CCGGCGGGGC TGATGTCGGG GACCGTTCCG GGCCCGGTGC CGCCGGAGGC CGCGGCGGCG CTGCGGCGGC TCGCCCTGGC CGAGGAGGAG TCGCGCTACG CGCCCTCGCC CCGGACCCCG GAAGGGCTGC GCGAGGACCT GCGGACGGCT CTGGCGGGGC TGACCGGTGT GGTCGGCGCG GGAACGCGCG TCCGCGCGGT GCTGCTGCCC CGGTCGCTGG CGCCGTGGCA CCGTCCGCGC CGCGCGGCCG ACCCCGAGCC CGTGACGCCC TAG
|
Protein sequence | MPLATLACLL MAMPLLDGLV RGEAWWVPAL VMTVAVGGVS ALYRLSGWSS FPVPFLQVLA AALLMTPLFA GHVAPLGLLP SGDTALHLLR VFDEGLETID TSTPPVSSTA GVMLIIALVF VLFAIIADFL AVTARCPGMV GALVAVLMAV PLIVDDAGLG WPAATSGAVG FLLLLAVDVW VRGREWGVRV PDGHDSSARV LGAVGRATTV AVASAAAVLL ALTVPLAVPS LRTDVLHTMA DGTYIGTGGD RITTTHPLVS LRRELASSSD RTVLTYRTDA ERPDYLRTYV LDEFDGANWT MTPVSASGDT LVDGELPLPT GWGSEPSGDA VTTRISMDSD APGIDFLPLP YWARTVDVPG EWYADPASHV VFTTESAPTG LNFTVQTLGR EPSAEELADS GSPRSLPGDF RDLPGDLDPR VRELAEDLTE DARSPYERAV AIQDHFTGGA FTYDLSPPAV PDGADPLAHF LFGDRVGYCE QFAGAMAVMA RQVDIPARVA VGYTAGEQQG DGRWAVSVGD AHAWPELYFE GAGWVRFEPT PSSAGGQGSA SVPDYSRGGQ GGVDDPDGAA EETREPSPEE TEAARDEATA EAEEPSGTPE AEPEDEPSTS VAAPGDDARG GPDLSWLPAA GAATGVLLLL ALPALVRALT RWSRTASLTG GAGAAGAHTA WRELRDTCLD LGGAWSLAES PRATAERLAG SGPVPQPDAA PAGLMSGTVP GPVPPEAAAA LRRLALAEEE SRYAPSPRTP EGLREDLRTA LAGLTGVVGA GTRVRAVLLP RSLAPWHRPR RAADPEPVTP
|
| |