Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0391 |
Symbol | |
ID | 9244229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 481257 |
End bp | 482630 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003678345 |
Protein GI | 297559371 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.380998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCAGT CGGGCACCAC ACCGAGCCCA CGGCCCCGTG CCGAGCGCCC CCTGCGCGTC GCCCTGCTGT CCTACCGCAG CAAGCAGCAC GTCGGCGGAC AGGGCGTGTA CGTCCGCCAC CTCTCGCGCG AACTGGCCGC CCTGGGCCAC GAGGTCACCG TCCTCTCCGG CCAGCCCTAC CCCGTCCTGG ACGAGGGCGT GACCCTGGAG AAGGTCCCCT CCCTGGACCT GTACAACGAC GCCCACCCCT TCAAGGCCCC GCCCGTACGC GAGTGGCGCG ACTGGATCGA CGCCCTGGAG GTCGCCACCA TGTGGACCGC CGGGTTCCCC GAGCCCCTCA CCTTCTCCCT GCGCGCCAAC CGCGAACTGC GCCGCCGCCT GGACGACTTC GACGTCGTCC ACGACAACCA GACACTGGGC TGGGGACTGC TCGGCATCAG GTCCGCCGGG CTGCCGCTGG TCACCACCAT CCACCACCCC ATCAGCGTGG ACCGCAGGAT CGAACTGGCC GAGGCCCGGG GCCTGCACAG GCTCACCAAG CGCCGCTGGT ACGGGTTCGT GGGCATGCAG GCCAGGGTCG CCCGCCGGCT CGACCCGATC CTGGTGCCCT CCCAGTCCTC CGCCGACGAC ATCGCCCGCG AGTTCGGCGT CGCCCCCTCC GCCATGGAGG TCACCCCGCT GGGCGTGGAC ACCCGCCACT TCCACCCGCG CCCCGCACTG GAGCGCGTAC CGGGACGCAT CGTGTGCACC GCCAGCGCCG ACAGCCCCCT CAAGGGTGTG GCGGTCCTGC TGCGCGCCGC GGCCAAACTC GCCACCGAAC GCGACATCAC CCTGACCGTC GTCAGCAGAC CCAAGCCCGG CGGCCCCACC GACCAGCTGG TGGACGAGCT GAGCCTGCGC GACCGCGTCG AGTTCGTCAG CGGCATCGAC GACACCGCCC TGGCCGAACT CATCGCCAGC GCGCAGGTGG CGGTCGTCCC GTCCTTCTAC GAGGGGTTCT CCCTGCCCGC GGTGGAGGCC ATGGCCTGCG CCACGCCGCT GGTGGCCAGC CACGCCGGAG CGCTGCCCGA GGTCGTGGGC ACCGACGGGG ACGCCGGGCG CCTGGTGCCC CCGGGCGACC CCGAGGTGCT CGCCGAGGCG CTCGCCGCCC TGCTCGACGA CGACGCCGAA CGCGAGCGCA TGGGCGCCGC CGCCTGGCGC CGGGTCCAGG AACGCTTCAC GTGGAGAGCC GTCGCCGAGC TGACCGCGCG CCGCTACGCC TCCACCATCG ACGCCGTGAG GGGCAACCGC CCCGTCGACG GCGACACGCG CCCCGGGCCC GCCGAGCCCG CGTCCCCGCC AGCACCCGAG CGGGCCCGCG CGAACGGCGC CTGA
|
Protein sequence | MTQSGTTPSP RPRAERPLRV ALLSYRSKQH VGGQGVYVRH LSRELAALGH EVTVLSGQPY PVLDEGVTLE KVPSLDLYND AHPFKAPPVR EWRDWIDALE VATMWTAGFP EPLTFSLRAN RELRRRLDDF DVVHDNQTLG WGLLGIRSAG LPLVTTIHHP ISVDRRIELA EARGLHRLTK RRWYGFVGMQ ARVARRLDPI LVPSQSSADD IAREFGVAPS AMEVTPLGVD TRHFHPRPAL ERVPGRIVCT ASADSPLKGV AVLLRAAAKL ATERDITLTV VSRPKPGGPT DQLVDELSLR DRVEFVSGID DTALAELIAS AQVAVVPSFY EGFSLPAVEA MACATPLVAS HAGALPEVVG TDGDAGRLVP PGDPEVLAEA LAALLDDDAE RERMGAAAWR RVQERFTWRA VAELTARRYA STIDAVRGNR PVDGDTRPGP AEPASPPAPE RARANGA
|
| |