Gene Ndas_4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4359 
Symbol 
ID9248234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5193252 
End bp5194547 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content73% 
IMG OID 
Productglutamate-1-semialdehyde-2,1-aminomutase 
Protein accessionYP_003682254 
Protein GI297563280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.803676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTACTC AACAGACTTC AGCAGAGCTT TTCCAGCGTG CCTCCGCCGT CGTGCCGGGT 
GGCGTGAACT CCCCCGTCCG CGCCTTCGGC GCCGTCGGCG GCACCCCGCC CTTCTTCGTC
AAGGGTGAGG GCCCCTACCT CACCGACGCC GACGGCCGCC AGTACGTCGA CCTCGTGTGC
TCGTGGGGGC CGTTGATCCT CGGCCACGCG GCCCCCGCCG TCGTCGAGGC CCTCCACGGC
GCGGTCGACG CCGGGACCTC CTACGGCGCG CCGACCCCCG GCGAGGTCGA GCTGGCCGAG
CTGATCGTCG AGCGCACCCC GGTGGAGAAG GTCCGCCTCG TCAACTCCGG CACCGAGGCC
ACCATGTCCG CGATCCGCCT CGCGCGCGGC TTCACCGGGC GCAGCAAGAT CGTCAAGTTC
GCGGGCAACT ACCACGGCCA CGTCGACGCC CTCCTGGCCT CCGCCGGTTC GGGCCTGGCC
ACCTTCGCCC TGCCCGACTC CCCGGGCGTG ACCGGGGCCA GCGCCGCCGA CACCCTCGTG
CTGCCCTACA ACGACCCCGA GGCCGTCGAG CGAGCCTTCG CCGAGCACGG CGACGAGATC
GCCTGCGTGA TCGCCGAGGC CTGCCCCGCC AACATGGGCG TCGTCGCACC CCGGGACGGG
TTCAACGCCC GGATCAAGGA GATCGCGCAC GCCAACGGCG CCCTCCTCAT CCTCGACGAG
GTCCTCACCG GCTTCCGCGT CAGCGCCTCG GGCTGGTTCG GCCTGGAGGG CGTCGCCCCC
GACCTCATGA CCTTCGGCAA GGTCATGGGC GGCGGCCTGC CCGCCGCCGC GTTCGGCGGA
CGCGCCGAGA TCATGGACCG CCTCGCGCCG AACGGTCCCG TCTACCAGGC GGGCACCCTG
TCCGGGAACC CGCTTGCCAC CGCCGCCGGC CTGGCCACAC TGCGGGGGGC CACCCCCGAG
GTCTACGCCC GCATCGACGA GGTCTCCGCC CGGGTGGCGG CCGAGGTCTC CAAGGCGCTC
GGCGAGGCCG GGGTCGTCCA CCGGCTCCAG AACGGCGGCA ACCTCTTCAC GGTGTTCTTC
ACCGGCCAGG AGGCCGTCGA CTTCGACACC GCGCGCACCA CCGACACCGC GGTCTTCTCC
GCGTTCTTCC ACGCCATGCT CGACCAGGGC GTGTACCTGC CGCCCGCCGC CTTCGAGGCC
TGGTTCTTCT CCGCCGCGCA CGACGACGCC GCCGTGGACC GGGTGGTCTC GGCGCTGCCC
AGGGCGGCCC GCGCCGCGGC CGAGGCCCAG GGCTGA
 
Protein sequence
MGTQQTSAEL FQRASAVVPG GVNSPVRAFG AVGGTPPFFV KGEGPYLTDA DGRQYVDLVC 
SWGPLILGHA APAVVEALHG AVDAGTSYGA PTPGEVELAE LIVERTPVEK VRLVNSGTEA
TMSAIRLARG FTGRSKIVKF AGNYHGHVDA LLASAGSGLA TFALPDSPGV TGASAADTLV
LPYNDPEAVE RAFAEHGDEI ACVIAEACPA NMGVVAPRDG FNARIKEIAH ANGALLILDE
VLTGFRVSAS GWFGLEGVAP DLMTFGKVMG GGLPAAAFGG RAEIMDRLAP NGPVYQAGTL
SGNPLATAAG LATLRGATPE VYARIDEVSA RVAAEVSKAL GEAGVVHRLQ NGGNLFTVFF
TGQEAVDFDT ARTTDTAVFS AFFHAMLDQG VYLPPAAFEA WFFSAAHDDA AVDRVVSALP
RAARAAAEAQ G