Gene Ndas_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0221 
Symbol 
ID9244055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp278655 
End bp279995 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID 
Productammonium transporter 
Protein accessionYP_003678177 
Protein GI297559203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACT CCGGCAACAC GGCGTGGCTG CTGATCAGCG CAGCCCTCGT GATGCTCATG 
ACCCCGGGCC TGGCCTTCTT CTACGGAGGC ATGTCACGGG CCAAGAGCGT CCTCAACATG
ATGCTGATGA GCTTCGCGAG CATCGCGGTC GTGAGCGTGC TCTGGGTCGT CATCGGCCAC
TCGCTCACCT ACTCCGACGG GCCCGGGGCG CTCGACACCT TCATCGGCGG CCTGGACTAC
GTCGGCCTGT CCAACCTGAT CGGTGAGATC GAGCCCGCGG CCGAGGACGG CACCGGCGGC
TACCCGCTGC TGGTCGACGC CGGGTTCCAG ATGATGTTCG CGGTCATCAC CGTGGCCCTC
ATCAGCGGCG CCATCGCCGA CCGCGCCAAG TTCGGCGCCT GGCTGCTGTT CGTCCCGGTC
TGGGCTCTCC TGGTCTACTT CCCCGTCGCC CACTGGGTCT GGGGCGAGGG CTGGATCGAG
CAGCTGGAGA TCGGCGGCTA CACGGTCATC GACTTCGCGG GCGGCACCGC GGTGCACATC
AACGCCGGCG CCGCCGCGCT GGCGCTGACC TTCGTCCTGG GCCGCCGCAA GGGCTTCGGC
TCGGAGTCGA TGCGCCCCCA CAACCTGCCG TTCGTCCTGC TGGGCACCGC GCTCCTGTGG
TTCGGCTGGT TCGGCTTCAA CGCGGGCTCG GCCTACGCCG CCGACGGCAC CGCCGCCCTG
GCCCTGGTCA ACACCCAGGT CGCCACCGCC GCCGCCACCG GCGCCTGGAT GCTCGTCGAG
CGCTTCCGCT ACGGCAAGGT CAGCGCGCTG GGCTTCGCCT CCGGCGCCGT CGCGGGCCTG
GTCGCCATCA CCCCGGCCGC CGCCAACGTC ACGCCGCTCG GCGCCATCGC CGTCGGCCTG
CTCTCCGGTG CGGTCTGCGC CTACGCCATC AGCTGGAAGT TCAAGTTCAA GTACGACGAC
GCGCTCGACG TGGTGGGCAT CCACATGGTC GGCGGCATTG TCGGCTCCCT GATCCTCGGC
CTGGTCGCCG CGGGTGTGGC GGGCGGCTCC GACGGCCTGC TCTACGGCGG CGGCATCGGC
CTGCTGGCCG TCCAGACCAT CGCCGTCATC GGCGTCATGC TCTACTCCTT CGCCGTCACC
TGGGTCATCG CCAAGGTCAT CCACCTCGTC ATCGGGTTCC GCATCCCCGA GGAGGTGGAG
ACCAACGGTC TGGACCACGA GCTGCACGCC GAGTCCGCCT ACGCCTTCGA CGAACTCGAC
GAGCTCGAGG ACGCGACGGA AGCGGTCTCC CTGCCGACGC CTCCGGGCGG GGAGACGGCC
TCGCCGAAGG CCAAGGCCTA G
 
Protein sequence
MIDSGNTAWL LISAALVMLM TPGLAFFYGG MSRAKSVLNM MLMSFASIAV VSVLWVVIGH 
SLTYSDGPGA LDTFIGGLDY VGLSNLIGEI EPAAEDGTGG YPLLVDAGFQ MMFAVITVAL
ISGAIADRAK FGAWLLFVPV WALLVYFPVA HWVWGEGWIE QLEIGGYTVI DFAGGTAVHI
NAGAAALALT FVLGRRKGFG SESMRPHNLP FVLLGTALLW FGWFGFNAGS AYAADGTAAL
ALVNTQVATA AATGAWMLVE RFRYGKVSAL GFASGAVAGL VAITPAAANV TPLGAIAVGL
LSGAVCAYAI SWKFKFKYDD ALDVVGIHMV GGIVGSLILG LVAAGVAGGS DGLLYGGGIG
LLAVQTIAVI GVMLYSFAVT WVIAKVIHLV IGFRIPEEVE TNGLDHELHA ESAYAFDELD
ELEDATEAVS LPTPPGGETA SPKAKA