Gene Ndas_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3433 
Symbol 
ID9247300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4109925 
End bp4110962 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content77% 
IMG OID 
Productaminotransferase class I and II 
Protein accessionYP_003681344 
Protein GI297562370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0324026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.394034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGAGT ACATGGACGC GGGCCACGAC CTGCGCCACC ACGGTGACGC CGAGGTCGGC 
GGCGGGCTGC TCGACCTGGC CGTCAACGTG CGGGGCCGGA CGCCGCCCGC CTGGCTGGGG
CGGCTCCTCG CCGACTCGCT GACCGGCCTG GGCGCCTACC CCGACCCCTC CCGCGCGCGG
AAGGCGGTCG CCCGGCGCCA CGGACGGGAG CCGGGCGAGG TCCTGCTCAC CGCCGGGGCG
GCCGAGGCCT TCGTCCTCCT GGCGCGGGTC CTGAACCCCC GGCGGGCGGT CGTCGTGCAC
CCCCAGTTCA CCGAGCCCGA GGCCGCCCTG CGCGCCGCCG GGCACGCCGT GGACCGGGTG
CTGCTGGAAC CGGACTTCAC CCTCGACCCC GCACTGGTCC CCGAGGACGC CGACCTGGTC
GTGGTGGGCA ACCCGACCAA CCCGACCTCC GTGCTCCACC CCGGACCGGT CCTGGCCGGG
CTGGCCCGCC CCGGGCGCGT GCTGGTCGTC GACGAGGCCT TCGCCGACTG CGTTCCCGGC
GAAACGGAGT CGCTGGCCTC CCGCGGGGAC CTGCCGGGCC TGGTGGTGGT GCGCAGCCTC
ACCAAGACCT GGTCCCTCGC CGGGCTGCGC GCCGGCTACC TGCTCGCCGA ACCCGACCTG
GTGGCCAGGT TCTCCGAGGC ACAGCCCCTG TGGTCGGTGT CCACGCCCGC CCTGGTCGCG
GTGGAGGCGT GCTGCAGGCC GGAGGCCCTC GCCGAGGCCG ACGCCTGGGC GACGTCCCTG
ACCGAGCACC GCGACGACCT CGCCGCGGGC CTGCGGAACC TGGGCCTGCG GGTGGTCCCG
GGAGCCCGGG CCTCGTTCCT GCTGGTAGCG GACCCTGAGG CGGACCGGCT GCGGGCCCGC
CTCAGGGAAG GGGGGATCGC CGTCCGGCGC GGTGACACCT TTCCCGGCCT CGGCCCGGAG
TGGTTCCGGG TGGCGGTCCG CGAACCCGCC GTCCACCGGG TCCTGACGGA CGCGCTGGGG
GAGTTGCTCG ACCGGTGA
 
Protein sequence
MGEYMDAGHD LRHHGDAEVG GGLLDLAVNV RGRTPPAWLG RLLADSLTGL GAYPDPSRAR 
KAVARRHGRE PGEVLLTAGA AEAFVLLARV LNPRRAVVVH PQFTEPEAAL RAAGHAVDRV
LLEPDFTLDP ALVPEDADLV VVGNPTNPTS VLHPGPVLAG LARPGRVLVV DEAFADCVPG
ETESLASRGD LPGLVVVRSL TKTWSLAGLR AGYLLAEPDL VARFSEAQPL WSVSTPALVA
VEACCRPEAL AEADAWATSL TEHRDDLAAG LRNLGLRVVP GARASFLLVA DPEADRLRAR
LREGGIAVRR GDTFPGLGPE WFRVAVREPA VHRVLTDALG ELLDR