Gene Ndas_0176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0176 
Symbol 
ID9244007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp224648 
End bp225763 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content70% 
IMG OID 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_003678132 
Protein GI297559158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.473738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG ACACCACCAC AAGCGGGTTG ACGTTCGACA TCCAGCTCTC CGACAGGCGG 
AAGACCCCGC AGGAACGAGA GGCGCTGCTG GAGAGTCCCG GGTTCGGCAA GGTGTTCACC
GACCACATGG TGAGCATCCA CTACACCGAG GGGAAGGGGT GGCACGACGC TAAGCTGGAG
CCGTACGGCC CGCTGAGCCT GGACCCGGCC ACCGCCGCCC TCCACTACGC CCAGGAGATC
TTCGAGGGCC TCAAGGCCTA CCGGCACCCC GACGGCTCGC TCGCCTCCTT CCGCCCCGAG
TCCAACGCGG CCCGCTTCAA CCGCAGCGCG GCGCGCATGG CGATGCCCGA GCTCCCCGAG
GAGCTCTTCC TGAAGTCCAT CGAACTCCTC CTGGAGCACG ACGGCGACTG GGTGCCGACC
AAGGAGGACT TCAGCCTGTA CCTGCGCCCG TTCATGGTCG CCACCGACGT CGGCCTGGGC
GTCAACCACC CGTCCCGCTC CTACGTCTAC CTGCTGATCG CCTCGCCGGT CGGCTCCTAC
TTCTCGGGCG GCGTCCAGCC GGTGACGGTG TGGCTGTCCA GGGACTACAC GCGCGCCGCG
CCGGGCGGCA CGGGCGCGGC CAAGTTCGCG GGCAACTACG CGGCGAGCTT CCTCGCCCAG
GCGCAGGCGG TGGAGCAGGG CTGCGACCAG GTGGTCTGGC TCGACGCCCG CGAGCACCGC
TGGGTCGAGG AGATGGGCGG CATGAACCTG TGGTTCGTGT TCGGCTCGGG TGAGAACGCG
CGTCTGCGCA CGCCCCCGCT GACCGGGACC CTGCTGCCGG GCATCACCCG CGAGTCGCTG
CTGACCCTGG CCCCCGACCT CGGCATCCCG GCCGAGGAGG CGCCCATCTC CACCGACGAG
TGGCGTGAGG CGGCCGAGTC CGGCGAGCTC ACCGAGGTGT TCGCCTGCGG CACCGCGGCC
GTCATCACCC CCGTCGGCCG GGTCAAGGGC GACGACGGCG AGTTCACCGT CGGCGACGGC
ACCCCGGGCC CGGTCACCAT GCGCCTGCGC GAGGAGCTGG TGGGCATCCA GACGGGTCTG
CGCGCCGACA AGCACGACTG GATCACCCGG TTCTGA
 
Protein sequence
MNNDTTTSGL TFDIQLSDRR KTPQEREALL ESPGFGKVFT DHMVSIHYTE GKGWHDAKLE 
PYGPLSLDPA TAALHYAQEI FEGLKAYRHP DGSLASFRPE SNAARFNRSA ARMAMPELPE
ELFLKSIELL LEHDGDWVPT KEDFSLYLRP FMVATDVGLG VNHPSRSYVY LLIASPVGSY
FSGGVQPVTV WLSRDYTRAA PGGTGAAKFA GNYAASFLAQ AQAVEQGCDQ VVWLDAREHR
WVEEMGGMNL WFVFGSGENA RLRTPPLTGT LLPGITRESL LTLAPDLGIP AEEAPISTDE
WREAAESGEL TEVFACGTAA VITPVGRVKG DDGEFTVGDG TPGPVTMRLR EELVGIQTGL
RADKHDWITR F