Gene Ndas_0580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0580 
Symbol 
ID9244422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp721827 
End bp723026 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003678533 
Protein GI297559559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.55279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC GCCCGCTGCG CGTCCTGATC GCCAGCGACA CCTATCCCCC CGACGTCAAC 
GGGGCCGGAT ACTTCACCCA CCGCCTGGCC GAGGGACTGG CCGGACGGGG GCACCGGGTC
CACGTGGTGT GCCCCTCCGA GCGGGGCGAA CCCCACGTCA CGGTCAACGG CGCGGTGACC
GAGCACCGCC TGCGCTCGGC CCCCATCCCC TTCATGCGCG CCGCCGTCCC GCTGGGGATG
GGCGGGCACA TCGCCAGGGT CATCGAGCGC CTGGACCCCG ACGTCGTGCA CGCGCAGAGC
CACTTCCCGC TCAGCCGCTC GGCGATGCGC AGGGGCCGTG CCGCAGGCGT CCCCGTCGTG
CTCACCAACC ACTTCATGCC GGACAACCTG TACGCGCACG CCCGGATCCC CGCGCCGATG
CAGGAACTCG CGGGCCTCCT GGCCTGGAGG GACATGGTCC GGGTGGCGGG GGAGGCAGAC
CACGTGACCA CGCCGACCCC GCGGGCGGCC CGGCTCCTGC GGGAGAAGGG GTTCACGCGT
GAGGTCGAGG CCATCTCGTG CGGGATCGAC CTGGAGCGGT TCCGCCCCCA CGAGGACCCC
GCCGCGGCCC GGCGCCGGTT CGGTCTGCCC GACCGGGACA CGATCGTGTT CGTGGGCAGG
CTGGACGCGG AGAAGAGGAT CGACGACACC ATCCGGGCGC TCGCGCGGAT CGTGCCCGAA
CGCGACGCCC AGCTGGCCCT GGCCGGGACC GGTCAGCGCG AGGAGGAGCT GCGCGCGCTG
GCCGCGGAGC TGGGGGTGGC GGACCGGGTG TTCTTCCTGG GGTTCGTCCC GGACGAGGAC
CTGCCCCTGG TCTACGCGGC GGGGGACGCC TTCGCCATCG CGGGTGTGGC CGAGTTGCAG
AGCATCGCCA CCCTGGAGGC GATGTCCACC GGTCTGCCGG TGGTGGCCGC GGACGCGATG
GCGCTGCCGC ACCTGGTGGA GGAGGGGCGC AACGGGTTCC TCTTCCCGCC GGGCGACCCC
GTGCGCCTGG CGGACCGGCT GCTCCTCGTG CTCGGCCCCG GTCGGCGCGC GCTCGGCGCG
GCCAGCCGTG AGCTCGCGCG GCGGCACGAC CACCACCGGT CCCTGGAGCG GTTCGAGGAG
GTCTACACCC GTCTGCGGGG CACGGCTCCC GCGCGGGTGG AGGGGCTGGT CGCCGCCTGA
 
Protein sequence
MAERPLRVLI ASDTYPPDVN GAGYFTHRLA EGLAGRGHRV HVVCPSERGE PHVTVNGAVT 
EHRLRSAPIP FMRAAVPLGM GGHIARVIER LDPDVVHAQS HFPLSRSAMR RGRAAGVPVV
LTNHFMPDNL YAHARIPAPM QELAGLLAWR DMVRVAGEAD HVTTPTPRAA RLLREKGFTR
EVEAISCGID LERFRPHEDP AAARRRFGLP DRDTIVFVGR LDAEKRIDDT IRALARIVPE
RDAQLALAGT GQREEELRAL AAELGVADRV FFLGFVPDED LPLVYAAGDA FAIAGVAELQ
SIATLEAMST GLPVVAADAM ALPHLVEEGR NGFLFPPGDP VRLADRLLLV LGPGRRALGA
ASRELARRHD HHRSLERFEE VYTRLRGTAP ARVEGLVAA