Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2271 |
Symbol | |
ID | 9246121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2716961 |
End bp | 2718628 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003680199 |
Protein GI | 297561225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.464934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.787256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACGCG TCCTCACCCG CGTCTTCCGC AACGACTGGA GTGCGCTGAC ACCGCCCGAC ATCGGACGCT GGACCCCCGA CCTGCGGGTC AGCGTCGTCA TCCCGGCGCG CGGCGGGCAG CGCCGCCTCG ACCTCGCCCT GGCCTCCCTG GCCGCGCAGA CCTACCCCGA GGACCTGATG GAGGTCGTGG TCGTGGACGA CCACTCCTCC CCGGCGCTGC GCCTGCCCGA CCTGCGCCCC GCGCACTGCC GCGTCCTGAC CGTCCCCGAC GGCGGCTGGG GCGCCGGGTA CGCCCGCGCC TACGGTGCCC ACTCCAGCAC CGGCGACGTC CTGCTGTGGA TGGACGCCGA CATGGTGGTG TGCCGCGAGT TCGTCGAGGC CCAGGCCCGC TGGCACCACG TGCACGCCGA GGCCGTCACG CTCGGCCGGG TCCGCTTCGC CGACACCGGG CCCCAGAGCC CCACCGACGT CCTGGCCCTG GCCCGCACCG ACGCCCTGCA CGGCGCCCTG GACACCGGCC GCCACCACGC GTGGGTCGAG CGCGTCCTCA CCGGGAGCGA CGGCCTGCGC GACGCCGACC ACCTGGGCTT CCACGCCTAC GTCGGCGCGG CGGCGGCCGT GCGCCGCAGC CTGTACGAGG CCGCCGGGGG AGTGGACCCC GACCTCGACC TGGGCCAGGA CACCGAGTTC GGCTACCGCC TCTGGCAGGC GGGCGCCGTC CTGCTGCCCG AACCCGCCGC CACCGGCTGG CACGTGGGCC GCGCGGGCAC CGCGCGCACC CGGCTGCCCT CCGAACGCTT CCGCACCGAG GTCCTCGCCG AGTTCATGCC GCACCCGCAC GCCTACCGCG AGCGGGTACC GGCCCACCGG CGCCGCATCC CGCTCGTGCA CGCCGTGGTC GAGGTCTCCG GCGCGCCCTA CGACCTGGTC CGCGGCTGTG TGGACCGGCT CCTGGACAGC GCCGAGACCG ATCTGGCCCT GACCCTGGTC GCCGACTGGG AGGGCGCCGA GGAGGGAGGG GGAGCGCGCG GGGCGCGGCG GCTGCGGGCG GTGGACGGCC GGGCCCGGCG CGACGTCGGC GGACCCCACC TGGACCTGCG GCTGATCCAG GCCAACTACC TGCGCGAACC CCGGATCTCC TTCGCCACGT CCGCGCCCCG CACCGGTTTC CCCTCACCCT TCCTGCTCCA GGTCCCGGTC TCCTGGGGCC TGGGCCAGGT GGCCCTGTCG CGCCTGCTGG CCAGCGCCGA GCGCGCCCGG GCGGGACTCA CCGAACTCTT CCCGGCCGCC TCGCCCACCC GCGACGCCGG GGTGAGGCTG TGGCGCACCC GGGCGCTGGC CCGGGCCCTG CGGGTGCGCG AGGAGGACGA GGACCTGGGC GACGTGGTCG CCGCGCTGCA CGGCCGCTAC CGGATCCACG CCGGGGAGGA GACGCTGACC GACCTGTCGC TGTACCGCTC GGTGCCGCCG CCCCCGCGCA CCGAACCCGA ACCGGCGGAG TTGGCCGCAC CGTCCCCGCG GGCCGGGACC GAGGAGCGCC CGGCGGGCGA GTGCGGGACG TGGGAGTGCG GGACGGGGGA GGAGCGCACC GGCGCCGCGC GGTCCGGAGG GTGGCTGCGC TCGGGATGGG CGCGGGCCCG CCAGCGGCTG CGCCGCGAGC GCGGCTGA
|
Protein sequence | MERVLTRVFR NDWSALTPPD IGRWTPDLRV SVVIPARGGQ RRLDLALASL AAQTYPEDLM EVVVVDDHSS PALRLPDLRP AHCRVLTVPD GGWGAGYARA YGAHSSTGDV LLWMDADMVV CREFVEAQAR WHHVHAEAVT LGRVRFADTG PQSPTDVLAL ARTDALHGAL DTGRHHAWVE RVLTGSDGLR DADHLGFHAY VGAAAAVRRS LYEAAGGVDP DLDLGQDTEF GYRLWQAGAV LLPEPAATGW HVGRAGTART RLPSERFRTE VLAEFMPHPH AYRERVPAHR RRIPLVHAVV EVSGAPYDLV RGCVDRLLDS AETDLALTLV ADWEGAEEGG GARGARRLRA VDGRARRDVG GPHLDLRLIQ ANYLREPRIS FATSAPRTGF PSPFLLQVPV SWGLGQVALS RLLASAERAR AGLTELFPAA SPTRDAGVRL WRTRALARAL RVREEDEDLG DVVAALHGRY RIHAGEETLT DLSLYRSVPP PPRTEPEPAE LAAPSPRAGT EERPAGECGT WECGTGEERT GAARSGGWLR SGWARARQRL RRERG
|
| |