Gene Ndas_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1567 
Symbol 
ID9245417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1917910 
End bp1918986 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content79% 
IMG OID 
Productglycosyl transferase family 9 
Protein accessionYP_003679502 
Protein GI297560528 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.029243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACCGG TGGGAGAGAA CCCGGGCCCC GGCCACGGGG GCCCGGACGG GGGCCGGGTC 
CCCGGCACGG GAGGGCGGCC GACCCTGCTG GCGCTGCGGG CGCTGGGGCT GGGCGACTTC
GCCACGGCCG TTCCCGCGCT GCGCGCGCTG GAGCGGGCGC TGCCGTCCTG GCGGCGCACC
CTGGCGGGCC CCTCCTGGTA CCGGCACCTG GTCGCGCTGG CCGGGCTGGA CTGGGAGGTC
CTGCCGACCG AGCCGCTGCG CGCGCCCGAC ACGCGCTCTC CGGACCTGGC GGTCAACCTG
CACGGCCGGG GGCCGCAGAG CACCGCGGCC CTGGCGGCGC TGACGCCGGA GCGGCTGTGG
ACGCACGGCC ACCCGTCCGC CCCGCAGTGG CCGGGCCCGG AGTGGCCCGA GGGGGTCCAC
GACGCCGAGA TCTGGTGCCG CCTGCTGCTC GCGCACGGTG TGGCGGCCGA CCCGGACGAC
CTGCGGTGGC CGGACCCGTC GCGGGGGCGG GGCGCGGTGG TCGAGGGGGA CACCGCGATC
GTCCACCCGG GGGCCGCCTC CGGGTCGCGG CGCTGGCCCC CGGAGCGTTT CGCGCGCGTG
GCGGGGGCCC TGGCGGCCTC GGGGCTGCGG GTGCTGGTGA CCGGCTCGCC GAACGAGACC
GCGCTGGCCG AGCGGGTGGC CGAGGCCGCC GGGCTCGGCG GTCGGGCGGT GCTGGCCGGG
CGCACCTCCC TGGACCTGCT GGCGCGGCTG GTCGGCGAGG CTCGGCTGGT GGTGTGCGGG
GACACCGGGG TCGGTCACCT GGCCACGGCC TACGGCACGC CGTCGGTGCG CCTGTTCGGG
CCGGTCTCCC CGCGGCTGTG GGGCCCGCGG GTGGACCGGG ACGTCCACGT GTGCCTGTGG
GCGGGTCGCC TGGGCGATCC GCACGCCGCC GCACTGGACC CGGGACTGGA CGAGATCGGC
GTGGAGGAGG TCGTCGCCGC GTGCCGGAGC GTGTGCGCCT CCGACCGCCC CACCGCCGAA
CAGGCGGTCC CCGGCTCCCG AGCGCCGACC CCCTGCCCGA AAGCGTGGAC GATGTGA
 
Protein sequence
MEPVGENPGP GHGGPDGGRV PGTGGRPTLL ALRALGLGDF ATAVPALRAL ERALPSWRRT 
LAGPSWYRHL VALAGLDWEV LPTEPLRAPD TRSPDLAVNL HGRGPQSTAA LAALTPERLW
THGHPSAPQW PGPEWPEGVH DAEIWCRLLL AHGVAADPDD LRWPDPSRGR GAVVEGDTAI
VHPGAASGSR RWPPERFARV AGALAASGLR VLVTGSPNET ALAERVAEAA GLGGRAVLAG
RTSLDLLARL VGEARLVVCG DTGVGHLATA YGTPSVRLFG PVSPRLWGPR VDRDVHVCLW
AGRLGDPHAA ALDPGLDEIG VEEVVAACRS VCASDRPTAE QAVPGSRAPT PCPKAWTM