Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3981 |
Symbol | |
ID | 9247852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4761688 |
End bp | 4763256 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003681884 |
Protein GI | 297562910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCACAA CCAGCCGGAC GAAGACCGTC CACGAACAGT CGACGCGGAT CGTGGTCGGC CCCGGAGCCC CAGGGGCCCC AGGGGTCCCA GGAATCCCTG GGGCCGCGGG CGCCACCGGG CGGCGCGCGG CGGTCGCCTC CGGCGGTGGG ACCAACGGCG CCTGGGTGCG CCCCTACGTC GTGAGCCTGG TCGGCCTGGA CCTGGCCGCC GCCCTGACGG CCACCCTCAG CGGCGCGGCG GTCCGCTTCC CCTCCGCTCT CGGAACCCTC ACCACCCTGC CCTACCTGGC CCTCTCCCTC CTGCTGCCCC CGGTGTGGAT CCTCTTCGTC TACCTGGGCG GCGGCTACGC CCGCCGCTTC CTGGGCGTGG GCACGGAGGA GTACCGCCGC GTCGCCACGG CCGGGATCGC CCTCGCCGCC GCCGTGGCCG TCGGCGCCTA CGCCCTGCGC TTCGACCTCG CCCGCGGCTA CGCGCTGGTC ACCCTGCCGC TCATCCTCCT GCTCACCCTC GGTCTGAGGT ACGCGCGCCG CAAGGCCCTG CACCGGCGCC GCGGTTCGGG CCAGTGCATG AGCGGGGTCG TGGTCGTCGG CTACCGCGTG GCCGTCCGCG ACCTGGTCCG CCGGTTCCGC GGCGAGGTCT ACCACGGCAT GCGCGTGGTG GGCGTGTGCC TGCCCCAGGA GGAGGTCGCC TCCGGTCCGG GCGCCGACGA GGTGGAGGGC TGTCCGGTCC TGGGCACCTT CACCGGCGCG GCCGAGGCCG CCGCCCTGGC CGGGGCCGAC ACCGTCGCGG TCCTGGCCTG TCCGGAGATG GACGGGGCCG AGCTGCGCCG CCTGGCCTGG CGGCTGGAGG AGACCGGCAC CGACCTCATC GTCGCCTCCG CGCTCATGGA CGTGGCCGGA CCGCGCACCT CCATCCGGCC GGTCGCGGGG CTGCCCCTGC TGCACGTGGA GCACCCCGAA CTGGTGGGCG CGCGCCGCGT CCTCAAGGGC GCCTTCGACC GCTGCGCCGC CGCCCTGGCC CTGATACTGC TGTCGCCGCT GTTCCTCGCG CTGTGCGTCC TCGTCCGGGC CGAGGGCGGC GGGCCCGCCC TCTTCACCCA GACGCGGGTG GGCAGGGGCG GCCGCGAGTT CACCGTCTAC AAGTTCCGTA CGATGGTGGT GGGGGCCGAG GCGTTGAAGG CGATGCTCCA GCCCCGCAAC GAGCACGAGG GCGTGCTGTT CAAGATGCGC CGCGACCCCA GGGTGACGGC CGTGGGGGCC TGGCTGCGCC GGTACTCGCT CGACGAGCTT CCCCAGCTCG TCAACGTGGT CCGGGGGGAG ATGTCGCTCG TCGGCCCGAG GCCGCCGCTT CCGGAGGAGG TCGCCCGCTA CGGGGACGAC GTCCGCCGCA GGTTGGTGGT CAAGCCGGGT ATGACGGGTC TGTGGCAGGT GAGCGGCCGC TCCGACCTCT CCTGGGAGGA ATCGGTCCGC CTCGACCTGC GGTACGTGGA AAACTGGTCG CTGACACTGG ACGTCCAGAT CTTGTGGAAG ACGTGGTCAG CGGTGATCCG TGGGGCGGGA GCATACTAG
|
Protein sequence | MVTTSRTKTV HEQSTRIVVG PGAPGAPGVP GIPGAAGATG RRAAVASGGG TNGAWVRPYV VSLVGLDLAA ALTATLSGAA VRFPSALGTL TTLPYLALSL LLPPVWILFV YLGGGYARRF LGVGTEEYRR VATAGIALAA AVAVGAYALR FDLARGYALV TLPLILLLTL GLRYARRKAL HRRRGSGQCM SGVVVVGYRV AVRDLVRRFR GEVYHGMRVV GVCLPQEEVA SGPGADEVEG CPVLGTFTGA AEAAALAGAD TVAVLACPEM DGAELRRLAW RLEETGTDLI VASALMDVAG PRTSIRPVAG LPLLHVEHPE LVGARRVLKG AFDRCAAALA LILLSPLFLA LCVLVRAEGG GPALFTQTRV GRGGREFTVY KFRTMVVGAE ALKAMLQPRN EHEGVLFKMR RDPRVTAVGA WLRRYSLDEL PQLVNVVRGE MSLVGPRPPL PEEVARYGDD VRRRLVVKPG MTGLWQVSGR SDLSWEESVR LDLRYVENWS LTLDVQILWK TWSAVIRGAG AY
|
| |