Gene Ndas_5160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5160 
Symbol 
ID9249053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp299738 
End bp301075 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content74% 
IMG OID 
Productphosphoglucosamine mutase 
Protein accessionYP_003683046 
Protein GI297564073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.24824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCGGC TGTTCGGAAC AGATGGTGTA CGCGGTGTCG CCGGTCGGGA CCTGACCGCC 
GCCCTCGCGC TGGAGCTGTC CGTCGCGGCG GCGCGGGTGC TGACCCGCGG CGGCGACGGG
GCGCGGCCCA AGGCCGTGGT GGGCCGCGAC CCGCGCGCCT CCGGCGAGTT CCTCGAGGCC
GCCGTCGTCG CGGGCCTGGC CAGCACCGGT GTGGACGTGG TGCGCGTGGG CGTGCTGCCC
ACCCCGGCCG TGGCCTACCT GACCGGCGCG CTCGGCGCCG ACTTCGGCGT GATGCTCTCC
GCCAGCCACA ACCCGGCCCC CGACAACGGC ATCAAGTTCT TCGCGCGGGG CGGCCAGAAG
CTGGAGGACG CCCTGGAGGA GGAGATCGAG GGCATGCTCG GCGTGCCCTC CACCCCCGCC
GTGGGCGACC TCGTCGGCCG GGTCACCGAC GCCGACGACG GCGCCGAGCG CTACGTCGAG
CACGTGCTGG CCTCGGTGCC CCACAGCCTC AAGGGCCTCA AGGTCGTGGT GGACTGCGCC
AACGGCGCCT CCGCCCTGGT CGCGCCCGAG GCGCTGCGCC GCGCGGGCGC GGAGGTCGTG
GCGATCGGTG ACCAGCCGGA CGGCCACAAC ATCAACGACG GGTGCGGCTC CACCCACCTG
GAGGTGCTCC AGGAGGCCGT CCGCCTGCAC GGCGCGGACG CCGGTATCGC CAACGACGGC
GACGCCGACC GCTGCCTGGC CGTGGACGCC GAGGGCCGCG TGGTGGACGG CGACCAGATC
CTGGCGATCC TGGCCACCGA GGCCAAGGAG GAGGGCAGGC TCGCCCAGGA CACGCTCGTG
GTCACGGTCA TGTCCAACCT GGGCCTGAAG CTGGCGATGG AGCGGGAGGG CATCACCGTG
GTGGAGACCG CGGTGGGCGA CCGCTACGTG CTGGAGGAGA TGAAGCGCGG CGGGTTCGGC
CTGGGCGGTG AGCAGTCCGG GCACGTGATC CTGCTGGAGC ACGCCACCAC CGGAGACGGC
GTCCTGACCG GCCTGCACCT GCTGGCCGCG ATGGCCCACC GCGAGCAGGG CCTGGCGGAG
CTGGCCAAGG TGATGACCCG CCTGCCGCAG GTGCTCGTCA ACGTGCCCGA CGTGGACAAG
GCCCGCGCCA AGGACTCCGC GGAGCTCGCG GCGGCCGTGC GCGAGGCCGA GGAGGAGCTG
GGCGAGACCG GCCGGGTGCT GATCCGGCCC AGCGGGACCG AGCCCAAGGT CAGGGTCATG
GTCGAGGCGC CGGAGCAGGA GCAGGCCACC GCGGTGGCCG AGCGGCTGGC CGCCGTGGTG
CGCTCCGCAC TCGGCTGA
 
Protein sequence
MARLFGTDGV RGVAGRDLTA ALALELSVAA ARVLTRGGDG ARPKAVVGRD PRASGEFLEA 
AVVAGLASTG VDVVRVGVLP TPAVAYLTGA LGADFGVMLS ASHNPAPDNG IKFFARGGQK
LEDALEEEIE GMLGVPSTPA VGDLVGRVTD ADDGAERYVE HVLASVPHSL KGLKVVVDCA
NGASALVAPE ALRRAGAEVV AIGDQPDGHN INDGCGSTHL EVLQEAVRLH GADAGIANDG
DADRCLAVDA EGRVVDGDQI LAILATEAKE EGRLAQDTLV VTVMSNLGLK LAMEREGITV
VETAVGDRYV LEEMKRGGFG LGGEQSGHVI LLEHATTGDG VLTGLHLLAA MAHREQGLAE
LAKVMTRLPQ VLVNVPDVDK ARAKDSAELA AAVREAEEEL GETGRVLIRP SGTEPKVRVM
VEAPEQEQAT AVAERLAAVV RSALG