Gene Ndas_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0847 
Symbol 
ID9244692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1039758 
End bp1041107 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content77% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003678797 
Protein GI297559823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.178822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTG CCGTTTCCGT CCACACCGCC CGTTCCGCCG AGTCCCCGGC CGACGCCGCC 
GGGTGCACCG TCGTCACCTC CGACGAGGGC GTCCCCCTGG TCACCGGCGA ACGGGTCGAG
TACGCCAACT TCGACTACGC GGCCAGTGCC CCGTGTCTGA CCCCGGTGGC CGAGGCCCTG
TCCGCGGCGC TGCCCTACTA CGCCAGCGTG CACCGCGGCG CGGGCCACCA CTCCCAGGTC
AGCACCGACG CCTACGAGCG CGCCCGCCGC AGCGTCGCCC GCTTCGTGGG CGTGCGCGCA
CAGCCCGCCG ACGCCGTGGT GTTCGTGCGC GGCACCACCG ACGCCCTCAA CCTGCTGGCC
CGCTCGGTGC CCGAGGGCTG CACGGTGGTC GTGTTCGAGT CCGAGCACCA CGCCAGCCTG
CTGCCCTGGG AGCACCCCGC CGCCCGCGCC GGACGGGTGG TGCGCCTGCC GCTGCCCGCG
TCCCCCGAGG ACGCGGTGGA GGCCGCGCGC GCCGCGCTGC GCGGGGCCCC CGAGGGCCCG
CGCCTGCTGT GCGTGACCGC CGCCTCCAAC GTCACCGGCG AGATCTGGCC GGTGGAGGAA
CTGGTGCGGG CCGCCCACGC CGAGGGCGCG CGCGTGGTCG TGGACGCCGC CCAGTACGTG
CCGCACCTGC CCTTCAGCCT GGAGGAGACC GGCGCCGACT ACGTCGCCCT GTCCGCCCAC
AAGCTCTACG CCCCCTTCGG CTCCGGGGTG CTGGCCGGCC GCGCCGACTG GCTGCGCGAG
GCCGCGCCCT ACCTGTCCGG CGGCGGAGCC ACGCGGCTGG TCGGCGAACA CGACGTGGTG
TGGAACGACC TGCCCGCGCG GCACGAGGCG GGCAGCCCCA ACGTGCTGGG AGCCGTGGCG
CTGGCCGCCG CCTGCGACAC CCTCGCCCCC GCCACCCAGG AACTCCTGCA CGTGCGCGAG
TCGGCGCTGC TGGAGCGGCT GCGCGCGGGG CTGACGGCGG TCGACGGCGT CACCGAGCTG
ACCCTGTGGG GCGGTGACCA CCCGCGGGTG GGCATCGTGT CGTTCACCGT GGACGGCCTG
CCCGCCGACC TGCTGGCCGC GGCCCTGTCC GCCGAGTACG GGATCGGGGT GCGCGACGGC
CTGTTCTGCG CCCACCCCCT CACCAGGCAC CTGCTGCCCC GGGGGCACGG CCAGGCGGTG
CGGGCGAGCC TGGGAGTGGG CACCACGCGG GAGCACGTGG ACCGCCTCGT CGGCGCGGTC
GCCGAACTCG TGGCCCGCGG ACCGCGCTGG GAGTACGCCG ACTGCGACGG CCGCCTCGCC
CCGGTGCCCG ACCCCGCGCG GGTGCTCTGA
 
Protein sequence
MTTAVSVHTA RSAESPADAA GCTVVTSDEG VPLVTGERVE YANFDYAASA PCLTPVAEAL 
SAALPYYASV HRGAGHHSQV STDAYERARR SVARFVGVRA QPADAVVFVR GTTDALNLLA
RSVPEGCTVV VFESEHHASL LPWEHPAARA GRVVRLPLPA SPEDAVEAAR AALRGAPEGP
RLLCVTAASN VTGEIWPVEE LVRAAHAEGA RVVVDAAQYV PHLPFSLEET GADYVALSAH
KLYAPFGSGV LAGRADWLRE AAPYLSGGGA TRLVGEHDVV WNDLPARHEA GSPNVLGAVA
LAAACDTLAP ATQELLHVRE SALLERLRAG LTAVDGVTEL TLWGGDHPRV GIVSFTVDGL
PADLLAAALS AEYGIGVRDG LFCAHPLTRH LLPRGHGQAV RASLGVGTTR EHVDRLVGAV
AELVARGPRW EYADCDGRLA PVPDPARVL