Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0619 |
Symbol | |
ID | 9244461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 759467 |
End bp | 761122 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003678572 |
Protein GI | 297559598 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCACGC TTGACATGAC TCCGGAACCG GTCTCCGGCG ACCGCGCGAC CGCGCCGATC CTGCCCCGCG CGGAGATCGA CTACCGAGAC CAGCCCCGCG CGCGGCGCAA CGACTACTCC GTGCTCCAGC CGCCCGCGAT CGGCGAGTGG ACCCCGACCC TGTCGGTGTC GGTGGTCATC CCCGCCCACG GGCACCAGGA AAAGCTCGAA CTCGTCCTGG CCTCCCTGTC CGCGCAGAGC TACCCCGCGC ACCTGATGGA GGTGATCGTG GTGGACGACG GCACGCCCGA ACCCCTCACG CTCCCACCGG TGCGCCCGGA GAACACCCGG CTGATCACCT CGGCCCCCGG CGGCTGGGGC TCGGCGCACG CCGTCAACAG CGGCGTGGCC GTCTCCTCCG GCCAGGTGGT CCTGCGCCTG GACGCGGACA TGCTGGTCTA CCGCGACCAC GTCGAGTCGC AGATGCGCTG GCACCACCTG GTCGACTACG GGGTCGCCCT GGGCCACAAG ATGTTCGTGG ACTTCGACCC GAAGGCCATG ACCGCCGAGT ACGTGGCCAC CGAGGTGCGC GAGGGACGCG CCGCCCAGCT GTTCGACCGC GAGAGCGCCG ACCCGCACTG GGTGGAGCAG ACCATCGACG GCAAGGACAA GCTGCGGACC GCCGACCGCC TGGCCTACAA GGTGTTCATC GGCGCCACCG GCTCCCTGCA CCGCACCCTG TTCGACGCCG CGGGCGGCCT GAACGGGGAG CTGGTCCTGG GCGGCGACAG CGAGTTCGCC TACCGGGTCT CCCAGCAGGG CGCCCTGTTC GTCCCCGACC TGGACACCAG CAGCTGGCAC CTGGGCCGCA CCCAGATGCA GACACGCCGC GACGCGGGCA CCCGCTACCG CGCCCCCTTC GTGGCCAACC GGGTGCCCGA CTTCCACCTG CGGCGCAAGC GCCCCGACCG GCAGTGGGAG GTCCCGCTGG TCGACGTGGT GATGGACGTG GACGGCGCCA CCCTGGAGGA CGTGGACACC ACCGCGTCCG CGCTGCTGTC CGGTACCACT CCCGACATCC GGCTGTGGCT GGTCGGCCCC TGGGACGTGC TGGACGAGGG GCGGCGCTCG CCGCTGGACG AGGAGCGGCT CGACCTGCGG CTGATCCGGG AGACCTTCCG GGGCGACCCC CGGGTGCGCC TGGTGGAGGA CGCGCCTGGC CACGACCCGC TCGTCAACTT CCAGCTGCGT GTGCCCGCCG GTCCGGCCCT GAGGGAGCGG GCGGTCGTCG AGCTGGTCGA CATGGCCAAC AAGAACAAGG CCGGGCTGCT GTGCTCTCCC GTGCCCGGCG CCACACGCGG TGACGGCGTC ATGCGCCTGG AGCGGATCGC CGCCTACGCC AGGGCCCGCC ACCTGTGGCC CGAGGCGACC TCCGAGGAGC TGGACCGCCG GGTGGAGGAG GTCTACGGCA CCCACTGGGT CCCCGGGACG GACTTCGTGT TCCCGGAGGA GGGGGAGCGG ACCAAGCCCG AGAACCCCGA GACGCTGCGG CGCAAGCTCG ACCAGGCGCT CGCCGAGGTG GAGCGCATGC GTGCCCGGGC CAGGCGCGCC GAGCGCAAGC TGCGCTGGTT CACCCCGGGG CTCACCCGCA GGGCGCTGCG CAAGCTGGCG CGCTGA
|
Protein sequence | MTTLDMTPEP VSGDRATAPI LPRAEIDYRD QPRARRNDYS VLQPPAIGEW TPTLSVSVVI PAHGHQEKLE LVLASLSAQS YPAHLMEVIV VDDGTPEPLT LPPVRPENTR LITSAPGGWG SAHAVNSGVA VSSGQVVLRL DADMLVYRDH VESQMRWHHL VDYGVALGHK MFVDFDPKAM TAEYVATEVR EGRAAQLFDR ESADPHWVEQ TIDGKDKLRT ADRLAYKVFI GATGSLHRTL FDAAGGLNGE LVLGGDSEFA YRVSQQGALF VPDLDTSSWH LGRTQMQTRR DAGTRYRAPF VANRVPDFHL RRKRPDRQWE VPLVDVVMDV DGATLEDVDT TASALLSGTT PDIRLWLVGP WDVLDEGRRS PLDEERLDLR LIRETFRGDP RVRLVEDAPG HDPLVNFQLR VPAGPALRER AVVELVDMAN KNKAGLLCSP VPGATRGDGV MRLERIAAYA RARHLWPEAT SEELDRRVEE VYGTHWVPGT DFVFPEEGER TKPENPETLR RKLDQALAEV ERMRARARRA ERKLRWFTPG LTRRALRKLA R
|
| |