Gene Ndas_4757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4757 
Symbol 
ID9248639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5643373 
End bp5644836 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSuccinate-semialdehyde dehydrogenase 
Protein accessionYP_003682648 
Protein GI297563674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.613804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATC AGGAAGCACG GGTCGTCGCG CAGGTGGACA AGAGGCTGTT CATCGGCGGG 
ACGTGGCGCG ACGCCGCCTC CGGGAACGCC TTCGAGGTGG AGGACCCCTC CACCCGTGAG
GTGCTCTGCG AGGTCGCGGA CGCGGGGAAG GAGGACGCGC TCGACGCCCT GGACGCGGCG
CACCGGGCCC AGGCCGACTG GATGCGCACG GCGCCCCGGG ACCGCGGCGA GATCCTGTAC
CGGGGCTACG AGCTGCTCAT GGAGCGCCGG GAGGACCTGG CGGTCCTGAT GACCCTGGAG
ATGGGCAAGC CCCTCGCCGA GGCCCGGGGC GAGATCGCCT ACGCCGCCGA GTTCCTGCGC
TGGTTCTCGG AGGAGGCCGT GCGCATCGAG GGAGGGTTCT CCACCTCCCC CGACGGCAAG
TCCCGCTTCC TGGTCATGCG CCAGGCGGTC GGCCCCTGCA TGCTCATCAC CCCCTGGAAC
TTCCCCATGG CGATGGGCAC CCGCAAGATC GGCCCCGCCA TCGCCGCGGG GTGCACCATG
ATCCTCAAGC CCGCCCACCA GACGCCCCTG TCCGCCCTCG CGCTGGCGGG CATCCTCTCC
GAGGCCGGGC TCCCCGAGGG CGTCCTCAGC GTCATCCCCA CCACGGATCC GGGCTCGGTC
ACCGAGCCCC TGCTCAGCGA CGGCCGCATC CGCAAGGTCT CCTTCACCGG CTCCACCGCC
GTGGGCCGCA AGCTCCTCGA ACAGAGCGCC GGGCAGGTTC TGCGCACCTC CATGGAGCTG
GGCGGCAACG CGCCCTTCCT GGTCTTCGAC GACGCCGACA TGGACGCCGC CGTGGACGGC
GCGATGCTCG CCAAGATGCG CAACATCGGC GAGGCCTGCA CCGCCGCCAA CCGCATCTAC
GCCCAGGCGG GCATCGCCGA GGAGTTCGCC GAGCGCCTCA GCCGCCGCAT GGGCGCGCTG
CGCCTGGGCC GCGGCGTGGA CGAGGGCGTC GACGTGGGCC CGCTCATCGA CGACAAGGCC
CGCGACAAGG TGCAGGGCCT GGTGGACGAC GCCGTCGGCA AGGGCGCCCG CGTGCTGGTC
GGCGGCGGCC CCGGCGACGG TCCGGGCCAC TTCTACAAGC CCACCGTGCT CGCCGACGTG
CCCTTCGAGG CCGAGCTGTC CACCACCGAG ATCTTCGGCC CGGTGGCCCC CGTGCTGCCC
TTCGAGACGG AGGACGAGGT GCTGCGGGCC GCCAACGACA CCGAGTACGG ACTGGTCAGC
TACGTCTACA CCCGCGACCT CAACCGGGCG CTGCGGGTCA GCGAGAACCT GGAGACCGGC
ATGGTCGGCC TCAACCAGGG CGTGGTCTCC AACCCGGCCG CGCCCTTCGG CGGGGTCAAG
CACTCCGGTC TGGGCCGCGA GGGCGGCCGG GTCGGCATCG ACGAGTTCCT GGAGACCAAG
TACGTCGGCA TCGGCGGAAT CTGA
 
Protein sequence
MSDQEARVVA QVDKRLFIGG TWRDAASGNA FEVEDPSTRE VLCEVADAGK EDALDALDAA 
HRAQADWMRT APRDRGEILY RGYELLMERR EDLAVLMTLE MGKPLAEARG EIAYAAEFLR
WFSEEAVRIE GGFSTSPDGK SRFLVMRQAV GPCMLITPWN FPMAMGTRKI GPAIAAGCTM
ILKPAHQTPL SALALAGILS EAGLPEGVLS VIPTTDPGSV TEPLLSDGRI RKVSFTGSTA
VGRKLLEQSA GQVLRTSMEL GGNAPFLVFD DADMDAAVDG AMLAKMRNIG EACTAANRIY
AQAGIAEEFA ERLSRRMGAL RLGRGVDEGV DVGPLIDDKA RDKVQGLVDD AVGKGARVLV
GGGPGDGPGH FYKPTVLADV PFEAELSTTE IFGPVAPVLP FETEDEVLRA ANDTEYGLVS
YVYTRDLNRA LRVSENLETG MVGLNQGVVS NPAAPFGGVK HSGLGREGGR VGIDEFLETK
YVGIGGI