Gene Ndas_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4149 
Symbol 
ID9248023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4953819 
End bp4955666 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content71% 
IMG OID 
Productglucosamine/fructose-6-phosphate aminotransferase, isomerizing 
Protein accessionYP_003682050 
Protein GI297563076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGAA TCGTTGGCTA CGTCGGGCCG CAACCGGCGC TTGAAGTCGT CGTGGACGGC 
CTCGCAAGGT TGGAGTACCG CGGATATGAC TCCGCGGGTG TGGCTGTGCT GGGCGACGGC
GCTCTGCGCA CCGAGAAGCG CGCGGGCAAG CTCGCGAACC TGCGCCGGGC GCTGGAGGAG
CGGCCCGTCA GCGGCGACGG CGCCGGGATC GGACACACGC GCTGGGCCAC GCACGGCGCG
CCGAACGACG TCAACGCCCA CCCCCACGTG GACAACGACA ACCGCGTCGC CATCGTCCAC
AACGGCATCA TCGAGAACTT CGCCGCGCTG CGCCTGGAGC TGGAGGAGCA GGGCTGCAAG
TTCCTCTCCG AGACCGACAC CGAGGTCGCC GCGCACCTGC TCAACGCCGA GCTGGCCCGG
ACCGGCGAGC TGCCCTCGGC CATGCGCGCG GTGTGCAAGC GCCTGGAGGG CGCGTTCACC
CTCGTGGCCG TCTCCGTGGA CGACCCCGAC CTGGTCGTGG CGGCCCGCCG CAACTCCCCG
CTGGTCGTCG GCCGCGGCGA GGGCGAGAAC TTCCTGGCCA GCGACGTGGC GGCGTTCATC
GCCCACACCC GCGAGGCGGT CGAGCTGGGC CAGGACCAGG TCGTGGAGCT GCGCGCGGAC
TCGATCACGG TGACCGACTA CGACGGCAAC CCCGTCGACG TGCGCGAGTA CCACGTGGAC
TGGGACGCCT CCGCCGCCGA GAAGGGCGGT TACGACTACT TCATGCTCAA GGAGATCGTC
GAGCAGCCGC GCGCGGTGGC CGACACCCTC CTGGGCCGCG TGACGGTCGA CGGCCAGCTC
ACCCTGGACG AGATGCGCCT GTCCCCCGAG GACCTGCGCT CGGTCGAGAA GATCGTCATC
ATCGCCTGCG GCACCTCCTA CCACGCGGGC CTGATCGCCA AGTACGCCAT CGAGCACTGG
TGCCGCATCC CGTGCGAGGT CGAGGTGGCC AGCGAGTTCC GCTACCGGGA CCCGATCCTG
GACCAGCAGA CCCTGGTCAT CGCCATCTCC CAGTCCGGCG AGAGCATGGA CACCCTGATG
GCGGTCCGCT ACGCCCGCGA GCAGCGCGCC CGGGTGCTGG CCATCTGCAA CGTCAACGGG
TCCACCATCC CGCGCGAGTC CGACGGCGTG CTGTACACGC ACGCGGGCCC CGAGGTCGGC
GTCGCCGCCA CCAAGACCTT CCTCACCCAG CTGGCCGCCT GCTACCTGAT CGGCCTGTAC
CTGGCGCAGG TGCGCGGGCT GAAGTTCGGC GACGAGATCA ACGCCGTGAT CGCCCAGCTG
GCCACCATGC CCGAGCAGAT CGAGCGGGTC CTGGAGACGG CCGAGCCGGT CCGCGAGCTG
GCCCGGTCGC TGGCCGACGC GGACACGGTG CTGTTCCTGG GCCGCCACGT GGGCTACCCC
GTGGCGATGG AGGGCGCGCT CAAGCTCAAG GAGCTGGCGT ACATGCACGC CGAGGCGTTC
GCCGCCGGTG AGCTCAAGCA CGGGTCGATC GCGCTGATCG AGGACGGTGT GCCGGTCGTC
GTGGTGGTGC CCTCCCCCGA GGGCCGCAGC GTGCTGCACG ACAAGATCGT GTCCAACATC
CAGGAGGTGC GCGCCCGCGG CGCCCGCACC ATCGTCATCG CCGAGGAGGG CGACGAGGTG
GTCCGCCCGT ACGCGGACGT GCTCATCCCC ATCCCGGCCG TGCCGACCCT GCTCCAGCCG
CTGGTGTCCA CGATCCCCAT GCAGGTGTTC GCCTGCGAGC TGGCCCTGGC CAAGGGCAAC
GACGTGGACC AGCCGCGCAA CCTGGCCAAG AGCGTGACGG TCGAGTAG
 
Protein sequence
MCGIVGYVGP QPALEVVVDG LARLEYRGYD SAGVAVLGDG ALRTEKRAGK LANLRRALEE 
RPVSGDGAGI GHTRWATHGA PNDVNAHPHV DNDNRVAIVH NGIIENFAAL RLELEEQGCK
FLSETDTEVA AHLLNAELAR TGELPSAMRA VCKRLEGAFT LVAVSVDDPD LVVAARRNSP
LVVGRGEGEN FLASDVAAFI AHTREAVELG QDQVVELRAD SITVTDYDGN PVDVREYHVD
WDASAAEKGG YDYFMLKEIV EQPRAVADTL LGRVTVDGQL TLDEMRLSPE DLRSVEKIVI
IACGTSYHAG LIAKYAIEHW CRIPCEVEVA SEFRYRDPIL DQQTLVIAIS QSGESMDTLM
AVRYAREQRA RVLAICNVNG STIPRESDGV LYTHAGPEVG VAATKTFLTQ LAACYLIGLY
LAQVRGLKFG DEINAVIAQL ATMPEQIERV LETAEPVREL ARSLADADTV LFLGRHVGYP
VAMEGALKLK ELAYMHAEAF AAGELKHGSI ALIEDGVPVV VVVPSPEGRS VLHDKIVSNI
QEVRARGART IVIAEEGDEV VRPYADVLIP IPAVPTLLQP LVSTIPMQVF ACELALAKGN
DVDQPRNLAK SVTVE