Gene Ndas_4935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4935 
Symbol 
ID9248822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp72582 
End bp73715 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_003682824 
Protein GI297563851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0962986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCC GACGCGACAT CCTGCGCTAC ACCGCGGCCG CCGGGGCCGC CCTCCCCCTG 
CTCTCCGCAT GCGGACCCCC GGGGAGCGGC GCGGCCGGAG CCTCCCCCGT GATCCGCTAC
CAGGGCTGGA CCGGAGACGT CCTCCTGCCC GAACTCGCCG AGGACCTGGG CTACCTGGAC
GGCATCGGGC TGGAGTGGAT CGGCGACACC ACCAGCGGCC CCCAGGACAT CCAGGCCGCG
GCCACCGGCA GCACCGATGT GGGCGGCGCC TTCAACGGGG CGATCGCCAA GCTGGCCGCC
GCCGGGGCGC CCGTCACCGC CGTCCTGGCC TACTACGGGG CGGACGAGGA GACCCACAAC
GGCTACTACG TCCTGGAGGA CAGCGACATC ACCGGGGCCC GCGACCTCGT CGGCAAGCGG
GTCTCCATGA ACACCCTGGG CGCCCACCAC GAGTTCGTGG TCCGCGAGTG GCTGGCCAGG
GAGGGGCTGA CCAACGAGGA GATCGCCCGG GTGGAGCTGA CGGTGGTCCC GCCGGTCAAC
GCCGAGCAGA CCCTGCGCAA CGGGCAGGTG GAGGTCGCCA CGCTCGGCGA CCTGCTGCGC
GAGGTCGCCC TGGAGCGCGG CGGCATCCGG CCCCTGTTCA CCGACCACGG CCTGTACGGC
GCCTTCAGCT ACGGCTCCCT CGTGCTGCGC GACGACTTCA TCGAGGCACA CGAGGACACC
GTCCAGGCCT TCGTCGGCGG GGTCGCCCGC GCGATCCGGT GGACGCAGAC CACCCCGCGC
GAGGAGGTGG TGGACCGCTA CACCGACATC ATCGGGCGGC GTGGCCGCAA CGAGAGCGCC
GAGGCCGTCC GGTACTGGCG CAGCACCGGC GTCGCCGGAC CCGGCGGCGT CATCGCCCCG
GACGAGTTCC GGACCTGGAT CGACTGGCTG GTCCGCAACG GCGAACTCGA CGAGGGGGCG
GTCGAGGCCG AGGAGCTGTT CACCAACGAC TACAACCCCT ACGCCAACGG GACCTACCCC
GAGGACTCCG GCCCCGACGG CCGACCCCTC GCGGAGGGCG GTGCTCCCGG GGACGGCGCC
TCCGGGGGAG CGGACGACAC ACGGCGCGCG GCCGGGGACG GGGGGAACCG ATGA
 
Protein sequence
MHTRRDILRY TAAAGAALPL LSACGPPGSG AAGASPVIRY QGWTGDVLLP ELAEDLGYLD 
GIGLEWIGDT TSGPQDIQAA ATGSTDVGGA FNGAIAKLAA AGAPVTAVLA YYGADEETHN
GYYVLEDSDI TGARDLVGKR VSMNTLGAHH EFVVREWLAR EGLTNEEIAR VELTVVPPVN
AEQTLRNGQV EVATLGDLLR EVALERGGIR PLFTDHGLYG AFSYGSLVLR DDFIEAHEDT
VQAFVGGVAR AIRWTQTTPR EEVVDRYTDI IGRRGRNESA EAVRYWRSTG VAGPGGVIAP
DEFRTWIDWL VRNGELDEGA VEAEELFTND YNPYANGTYP EDSGPDGRPL AEGGAPGDGA
SGGADDTRRA AGDGGNR