Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3774 |
Symbol | |
ID | 9247643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4535753 |
End bp | 4536871 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003681678 |
Protein GI | 297562704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.291858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.904998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCGG CGGCGCGGGT GGCCCTGGTC GGGCCCGCCC ACCCCTACAA GGGCGGCGGC GCCCGGCACA CCACCGAGCT GGCGCACCGG CTGAGCGCGC TGGGCCACCC CACGGCCGTG GAGTCCTGGC GGGCGCAGTA CCCGGCCGCC CTCTACCCCG GACAGCAGAC CATCGAGGTC CCCGAGGGCG AGCCCTACCC CGGCACCCGC CACGAGCTGG CCTGGTACCG GCCGGACGGG TGGTGGCGGA CGGGGCGGCG CCTGGCCCGC GAGGCCGACC TGGTGGTGCT CACCCTGTTC TCCCCGGTGC AGGTGCCCGC CTACCTCGGC GTCCTCGCCG GGGTGCGCTC CGTGCGGCGT TCCGGGGGCG CGCGGGTGGT GGCGCTGTGC CACAACGTGC TGCCCCACGA ACGGCGTGCC GTGGACGTGT CCCTCGTGCG GGCCCTGCTG CGGCGGGTGG ACGGCGTGGT CGCCCACTCG CCCGAGCAGG CGCGCCTGGC CGAGGGGCTC GGGGGCCCCG GGGCGCGGCG GCCCGTGGTC GCCCAGATGC CCCCGCACCT GCCCGAGACC GGCGGCGCGG TCGCCCCGGC GCCGGGGGAG CGGCGGCACC TGCTCTTCCT GGGCATCGTC CGCCCCTACA AGGGGGTGGA CCTGCTGCTG CGCGCCCTGG CCGACGGAGC GCCCGGGGAC GTGGCGCTCA CCGTGGCCGG GGAGTTCTGG GGCGGCACAG CGGAGCTGGA GGAACTGGCC GCCGGGCTGG GGATCGCCGA CCGGGTGCGG CTGCGGGACG GGTACGTCCC CGCCGCCGAG CTGCCGGAGC TGTTCGCCTC GGCCGACGCG GTGGTGCTGC CCTACCGCAC CGCCACCGCC ACCCAGAACG TGTGGCTGGC GCACGAGCAC GGGATTCCGG TGGTGGCCAC CCGGGCCGGG ACCCTCCCCG ACCACGTGCG CGAGGGGGTC GACGGCCTGC TGTGCGCGCC GGGCGACGCC GCCGACCTGG CCCGCGCGCT GGGGGAGTTC TACGCGCCGG GGGAGCCCGA GCGGCTGCGC GCCGGGGTCC GCCCGGTGGA GACCGAGCCG TACTGGCGGG CCTACACCGA GCGGTTGCTC GGGGCCTGA
|
Protein sequence | MSPAARVALV GPAHPYKGGG ARHTTELAHR LSALGHPTAV ESWRAQYPAA LYPGQQTIEV PEGEPYPGTR HELAWYRPDG WWRTGRRLAR EADLVVLTLF SPVQVPAYLG VLAGVRSVRR SGGARVVALC HNVLPHERRA VDVSLVRALL RRVDGVVAHS PEQARLAEGL GGPGARRPVV AQMPPHLPET GGAVAPAPGE RRHLLFLGIV RPYKGVDLLL RALADGAPGD VALTVAGEFW GGTAELEELA AGLGIADRVR LRDGYVPAAE LPELFASADA VVLPYRTATA TQNVWLAHEH GIPVVATRAG TLPDHVREGV DGLLCAPGDA ADLARALGEF YAPGEPERLR AGVRPVETEP YWRAYTERLL GA
|
| |