Gene Ndas_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5104 
Symbol 
ID9248996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp250809 
End bp251873 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content74% 
IMG OID 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_003682991 
Protein GI297564018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.486269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCAC TCTCCGAGTA CACCACCCTG CGTCTGGGCG GCCCGGCGCG GACCTTCCTG 
GTCGCGGGCA CCACCGACGA ACTGGTCGCC GCCGTGACCC GGGCCGACGC CGCCGGCGAG
CCCGTCCTCG TCCTGGGCGG CGGCAGCAAC CTCGTGGTCT CCGACGACGG GTTCCCCGGA
ACGGTGGTCC TCGTGGACTC CAAGGGCGTC TCCTTCGAGG AGGCGGGCAC CGACGACGAG
GGCGAACCGG TCGTGCTGCT GCGCGCCGAC GCGGGCGTGG AGTGGGACCC GCTGGTGGAG
CGCGTCGTGG CCGAGGGCCT CAGCGGGCTG GAGTTCCTGT CGGGCATCCC CGGCCGGGTC
GGCTCCACGC CCATCCAGAA CGTGGGCGCC TACGGCCAGG ACGTCAGCCA GACCATCCGC
GAGGTGCTCG TCCACGACCG GCGCACCGGC GAGCGGCGCC GGATGACCAA CGCCGAGTGC
GGCTTCAGCT ACCGCGACAG CGTCTTCAAG GGCGACGACC GCCACGTGGT GTGCGAGGTG
GTCTTCGCGC TGCGCCGCTC CAAGCTCAGC CGTCCGGTCG CCTACGCCGA GGTGGCCCGC
ACCTTGGGGG CCGAGGCGGG CACCCGGGTG CCGCTGGAGC GGGCGCGCGA GACCGTCCTG
GGGCTGCGCA GGGGCAAGGG CATGGTCCTG GACCCCGCCG ACCCCGACAC CCGCAGCGCC
GGGTCGTTCT TCACCAACCC GGTGGTCACC GCCGAGGAGT TCGCGGCCGT GCGCGAACGG
GCGGCCGCCC GCCTGGGCGC CGACGTACAG GTGCCCGGCC ACCCCGACGC ACGGGGGAAC
GTCAAGCTCT CCGCGGCCTG GCTGATCGAC CGCGCCGGGT TCACCAAGGG GTACGGCGAC
GGCCCCGCGC GCATCTCCGG CAAGCACAGC CTGGCCCTGA CCAACCCCGG CGGCGCCACC
ACGAAGGACC TGCTGGAGCT GGCCCGCGAG GTGCGGGCGG GAGTGGAGGA GGCCTTCGGG
GTCCGCCTGG TCAACGAGCC GGTGATGGTC GGCGTCTCCC TCTGA
 
Protein sequence
MTALSEYTTL RLGGPARTFL VAGTTDELVA AVTRADAAGE PVLVLGGGSN LVVSDDGFPG 
TVVLVDSKGV SFEEAGTDDE GEPVVLLRAD AGVEWDPLVE RVVAEGLSGL EFLSGIPGRV
GSTPIQNVGA YGQDVSQTIR EVLVHDRRTG ERRRMTNAEC GFSYRDSVFK GDDRHVVCEV
VFALRRSKLS RPVAYAEVAR TLGAEAGTRV PLERARETVL GLRRGKGMVL DPADPDTRSA
GSFFTNPVVT AEEFAAVRER AAARLGADVQ VPGHPDARGN VKLSAAWLID RAGFTKGYGD
GPARISGKHS LALTNPGGAT TKDLLELARE VRAGVEEAFG VRLVNEPVMV GVSL