Gene Ndas_4697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4697 
Symbol 
ID9248579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5575567 
End bp5576889 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID 
ProductDegT/DnrJ/EryC1/StrS aminotransferase 
Protein accessionYP_003682589 
Protein GI297563615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.674112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA GTGACGAGGC GGCGGCGGTT CTCGATGCGG TCCGTGAGCA CCACCGGTCC 
ACCCGCACCG AACGCGAGTT CGTTCCGGGT GTGACGGAGA TCTGGCCCTC CGGGGCGGTC
CTGGACGAGG AGGACAGGGT GGCCCTGGTC GCGGCCGCCC TCGACATGCG CATCGCGGCG
GGCCCCAGCG CACGCCGCTT CGAGTCCGCG TTCGCCCGCA GGCTCGGAAG GCGCAAGGCC
CACCTCACGA ACTCCGGCTC CTCGGCCAAC CTGCTCGCGC TGTCCTCCCT GACCTCGCAC
CTGCTGGAGG ACCAGCGGCT GCGCCCCGGC GACGAGGTCG TCACGGTCGC CGCCGGGTTC
CCGACGACCG TCAACCCGAT CCTCCAGAAC GGGCTCGTGC CCGTGTTCGT GGACATCGAG
CTGCGCACGT ACAACACCAC GGTCGAAAGG GTGGAACGGG CCATCGGCCC CCGTACCCGG
GCGATCATGA TCGCCCACGC GCTGGGCAAC CCGTTCGAGG CGCGGGAGAT GGCGCGCCTG
GCGGAGGAGC GCGACCTGTT CCTGATCGAG GACAACTGCG ACGCCGTGGG CTCCCTCTAC
GACGGGCAGG TGACGGGCTC CTTCGGCGAC CTGTCGACGG TCAGCTTCTA CCCGGCGCAC
CACCTCACCA TGGGCGAGGG CGGCTGCGTG CTCACGTCCA ACCTCATGCT GGCCCGTGTG
GTGGAGTCCA TGCGCGACTG GGGGCGGGAC TGCTGGTGCG AGCCCGGTGA GAGCGACACC
TGCCGCAAGC GCTTCAGCTA CCAGCTCGGG ACCCTGCCCC CCGGCTACGA CCACAAGTAC
ACGTTCTCCC ACGTGGGCTA CAACCTGAAG GGGACCGACC TCCAGGCGGC GCTCGGGCTG
AGCCAGCTCG ACAAGCTCGA CTCCTTCGGC GAGGCCCGGC GCCGCAACTG GCGCCGGATG
CGGGAGGGGC TGGACGGGCT TCCGGGCCTG ATCCTGCCGG AGGCCACGCC CAACAGCGAT
CCGAGCTGGT TCGGCTTCGT GGTCACCGTG GACCCCGGGG CGCCGTTCGA CCGGGCCGAG
CTGGTCCACT TCCTGGAGTC CCGGCGGATC GGCACCCGCC TGCTCTTCGC GGGCAACCTG
ACCCGGCACC CCGCCTACCT GGACCGGCCG CACCGCGTGT CCGGAGAGCT GGAGAACAGC
GACATCGCCA CGGAGCGGAC CTTCTGGACC GGGGTCTACC CCGGGCTCAC GGACGAGATG
ATCGACTACG TGGTCTCCTC GGTCACCGAG TTCGTCAAGG AGCGGCACAA GGGCGTCTTC
TGA
 
Protein sequence
MATSDEAAAV LDAVREHHRS TRTEREFVPG VTEIWPSGAV LDEEDRVALV AAALDMRIAA 
GPSARRFESA FARRLGRRKA HLTNSGSSAN LLALSSLTSH LLEDQRLRPG DEVVTVAAGF
PTTVNPILQN GLVPVFVDIE LRTYNTTVER VERAIGPRTR AIMIAHALGN PFEAREMARL
AEERDLFLIE DNCDAVGSLY DGQVTGSFGD LSTVSFYPAH HLTMGEGGCV LTSNLMLARV
VESMRDWGRD CWCEPGESDT CRKRFSYQLG TLPPGYDHKY TFSHVGYNLK GTDLQAALGL
SQLDKLDSFG EARRRNWRRM REGLDGLPGL ILPEATPNSD PSWFGFVVTV DPGAPFDRAE
LVHFLESRRI GTRLLFAGNL TRHPAYLDRP HRVSGELENS DIATERTFWT GVYPGLTDEM
IDYVVSSVTE FVKERHKGVF