Gene Ndas_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1451 
Symbol 
ID9245301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1775643 
End bp1776857 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID 
ProductDegT/DnrJ/EryC1/StrS aminotransferase 
Protein accessionYP_003679389 
Protein GI297560415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.362156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0812638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGAAGT TGAAGAGAAC GCTCGAACCC CCCGAGGCGG CGCCCCCCGG CAGAACACGC 
CTCCCCGTCC CCTTCTTCGA CCAGTCCCGG AGCTTCGCGG AACTGTGGCC GCGCATCCGG
GACAACTGCC TGCGGGTCAT GGACCGGGGC AAGTTCTCGC ACGGGGCCAT GGTCGCCGAG
TTCGAGGACG CCCTGGCCCG CTGGACCGGC GCCCGGCACG TGGTCGGCGT CAACTCCGGA
ACCGACGCCC TCGTCATCCT GCTGCGCGCC GCCGGGCTGC GCCCCGGCGA CGAGGTGATC
GTCCCCGCCT ACTCCTTCGT CGCCACCGCC AGCTCCGTCG TCCTCGCCGG AGGCGTCCCG
GTCTTCGCCG ACATCGAGGA GCACGGGTAC GGCATCGACC CGGCCTCGGT GGACGCGGTG
GCCACCTCCC GCACCCGGAT GGTCATGCCG GTCCACCTGT TCGACCGCCT GGCCGACATG
GAGGGCGTGC GCGAGGTCGC CCGGCGCCGC GGCCTGACCG TGCTGGAGGA CAGCGCCGAG
GCCATCGGCA TGCGGCTGCG CGGCGTGCAC GCCGGGCTGC TGGGCACCGG CGGCGTGCTG
TCCTTCTTCC CCTCCAAGAC CCTCGGCGCC ATCGGCGACG CGGGCGCGCT GCTCACCGAC
GACGACGCCG TCGCCGAGAC CGCGCGGGCG CTGCGCCACC ACGGCCGCTC CGGACGCACC
CTGGACGACT TCCCCGGCAT CGCCAACCCG ACGGTCGTCG CGGGCTGCAA CAGCAAGATG
GACGACCTCC AGGCCGCCGT GCTGCTGGCC AAGCTCTCCC GCCTCGACGC CGACATCGCC
CGCCGCGCCG AGCTGTCCGC GCGCTACGAC GCCCGCCTGC GCGACCTGCC CGGGATACGC
GCCGTGCCCG GCGCCGTTCC GCCCCACCCC GGCGGCAACC GGGTCGTCTA CGTCCACCTG
GTCGAGGCCG ACGACCGCGA CGCCCTGGTC GCCCACCTGG CCGAGGCCGG GATCGGCACC
GAGACCTACT ACCCGATCCC GCTGCACCTG CAACCCTGCT TCACCCACCT GGGACACGCG
CCCGGCGACT TCCCGCGCGC CGAGGCGGCC TGCGAGGGGG CGGTGGCCCT ACCGCTCTAC
CCCGACCTGA CCGACGCCCA GGCGGACCGG GTCTGCGAGG AGATCGAGGA CTTCTGCCTT
CGGAGGCACG GATGA
 
Protein sequence
MEKLKRTLEP PEAAPPGRTR LPVPFFDQSR SFAELWPRIR DNCLRVMDRG KFSHGAMVAE 
FEDALARWTG ARHVVGVNSG TDALVILLRA AGLRPGDEVI VPAYSFVATA SSVVLAGGVP
VFADIEEHGY GIDPASVDAV ATSRTRMVMP VHLFDRLADM EGVREVARRR GLTVLEDSAE
AIGMRLRGVH AGLLGTGGVL SFFPSKTLGA IGDAGALLTD DDAVAETARA LRHHGRSGRT
LDDFPGIANP TVVAGCNSKM DDLQAAVLLA KLSRLDADIA RRAELSARYD ARLRDLPGIR
AVPGAVPPHP GGNRVVYVHL VEADDRDALV AHLAEAGIGT ETYYPIPLHL QPCFTHLGHA
PGDFPRAEAA CEGAVALPLY PDLTDAQADR VCEEIEDFCL RRHG