Gene Ndas_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3902 
Symbol 
ID9247773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4674540 
End bp4675646 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content77% 
IMG OID 
Productsuccinyldiaminopimelate transaminase 
Protein accessionYP_003681805 
Protein GI297562831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.116371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC GGCGACCCGT GGCGAAGCGG CTCCCGACCT TCCCGTGGGA CCGGCTGGCG 
CCGTACAAGC GCAGGGCCGC CGAGCACCCC GGCGGCATCG TCGACCTGTC CGTCGGCACC
CCCGTGGACC CGGTGCCCGC CCTGCTGCGC AAGGCCCTCG CCGACGCCGC CGACGCGCCC
GGCTACCCCC AGACCTGGGG CACGCCCGCG CTGCGCGCCT CCGTCGCGGG CTGGCTGGAG
CGCCGCCACG GCGTGCGCGT GGCGGAGGGC GCCGTCCTGC CCACCGTGGG CTCCAAGGAG
CTCGTGGCCT GGCTGCCCAC GCTCCTGGGC CTCGGCGCGG GCGACACGGT CGTCCACCCC
GAGCTGGCCT ACCCCACCTA CGACATCGGC GCGCGCGTGG CCGGGGCCAC CCCGGTGGCC
TCCGACGGCC TCACCTCCCT CGGCCCCGCC CCCGTCGGAC TGGTGTGGGT GAACTCGCCG
AGCAACCCGA CCGGCCGGGT CCTGGGCACC GCGCACCTGC GCAAGGTGGT GGAGTGGGCC
CGCGAGCGCG GCGCGATCGT GGCCTCCGAC GAGTGCTACC TCGACCTGGG CTGGGACGGC
GCCGAACCGG TGTCCATCCT GCACCCGGAC GTGTGCGGCG GATCCCACGA CAACCTGCTG
GCCGTGCACT CGCTGTCCAA GCGCTCCAAC CTGGCCGGGT ACCGCGCGGC CTTCGTCACC
GGGGACCCCG CGCTGGTCGA GGAGCTGCTG GCGGTGCGCA AGCACGCCGG GATGATCGTC
CCCGCGCCCG TCCAGGCGGC CATGGGCGCC GCCCTGGACG ACGACGCGCA CGCCACCGAG
CAGAAGGAGC GCTACCGGTC CCGCCGTGCC AGGCTGCGCG AGGCCCTGGA GGGCGCGGGC
TGGCGCATCG AGCACTCCGA CGCCGGGCTG TACCTGTGGG CCAGCCACCC CGACCACGAC
GCCTGGGGCG CGGTGGCCCA CCTGGCCGAA CGCGGCGTGC TGGTCGCTCC CGGGGACTTC
TACGGCCCGG CGGGCGCCGG GCACGTGCGC GTGGCGTTCA CCGCCACCGA CGAGCGGGTC
GAGGCCGCGG CCGAGCGCCT GGCCTGA
 
Protein sequence
MAERRPVAKR LPTFPWDRLA PYKRRAAEHP GGIVDLSVGT PVDPVPALLR KALADAADAP 
GYPQTWGTPA LRASVAGWLE RRHGVRVAEG AVLPTVGSKE LVAWLPTLLG LGAGDTVVHP
ELAYPTYDIG ARVAGATPVA SDGLTSLGPA PVGLVWVNSP SNPTGRVLGT AHLRKVVEWA
RERGAIVASD ECYLDLGWDG AEPVSILHPD VCGGSHDNLL AVHSLSKRSN LAGYRAAFVT
GDPALVEELL AVRKHAGMIV PAPVQAAMGA ALDDDAHATE QKERYRSRRA RLREALEGAG
WRIEHSDAGL YLWASHPDHD AWGAVAHLAE RGVLVAPGDF YGPAGAGHVR VAFTATDERV
EAAAERLA