Gene Ndas_0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0481 
Symbol 
ID9244322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp579001 
End bp580620 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content72% 
IMG OID 
ProductTAP domain protein 
Protein accessionYP_003678434 
Protein GI297559460 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.223264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCACAG CACGCGCAAC CACACTGGCC TCGACCGTCC TGGTGACGGC CCTGGCCGCA 
TCCCTCGCAG CGGCCTCCCC CGCCGCGGCC GATCCCCTGG ACTCCCCCGC CGACGACACC
GCGCTCGCCG AGTTCCACGA CCAGGAACTG GCCTGGGCTC CCTGTACCGA GGAGGTGCTC
CAGGGACTGG AGTGCGCCGA CGTCGTGGTC CCGCTCGACT ACTCCGACCC CGGAGGCGAG
CGCGTCACCG TGGCCATCAG CCGGGCCCGT GCCGCCGACC CCGAACAGCG GCGCGGCATC
CTGCTCACCA ACCCCGGGGG TCCGGGCGGG CACGGGCGGA GCATGCCCCT GCCGCTGGAC
CCGCAGACGG GCACCGGACT CGTCGGCGGC CTGCGGGTCG CCGAGGTGTA CGACGTCATC
GGCATGGACC CCCGCGGCAC CGGCGGGTCC TCTCCCAGGC TGGACTGCGA CTCGGAGCCG
CCGCCCGTGT ACCCGAGTCC CACGGACGAG GAGATCTCCG AGGCCACCCG GTCCGCGATC
CGCTACCAGC GCGCCTGCGA GCAGGCCGAG GGCGACGTCC GCCCGCACAT GACCACCGCC
AACACCGCCC GCGACATGGA CGTCATCCGG GCCGCCCTGG GTGAGGAGAA GACCAACTAC
CTGGGCTACT CCTACGGCAC CTACCTGGGC GCGGTCTACG GGAGCCTGTT CCCCGAGCGG
CTGGACCGCA GCGTGCTGGA CTCGTCGGTG CACCCCGACG GCGTCTGGCG CGACGTCTTC
ATGATGCAGG CGCCGGCCTA CAGCGAGAAC ATGGACCGCT ACACGGCGTG GGTGGCGGGG
TACGACGACG TCTTCGGCTT CGGTTCGACC CGGGAGGAGG TCTACGCGAC CTTCGAGGCG
ACCGCCGAGC GGCTGCGGGA GGAACCGCTC GAGGTCGCGC CCGGCGTGCT CTACGACAAC
CACCTGTTCC ACATGGAGGT GGGCATGTAC TCGCGCTTCC AGGACCGGTG GGACGTGGTC
TCGGAGATCC TGGGGTACAT CGTGCACGAT CAGCCCCTGC CCCAGCCCAT CGCCGCCCAG
GCGGGCGCTC TGGCCGAGGA GGAGTACGGC ACGGCCACCA TCGACCTGCA GACGGCGGTG
CTGTGCGAGG CCGAGTGGCC CGAGCGGATC AGCACCTACC ACGCGGACGC CCGGGAGTAC
CGGGAGGAGC ACCCCTACGG CAGCGGCGCC TACTGGGCGG TTCCCCACCC GTGCACCTTC
AACGGCCTCG ACCGGCCCGA GCCCCCGGTC GAGCTGGAGC GGGAGGGCTA CCCCGAGGCA
CTGGTCATCG CCGGTGAGTT CGACGCCAAC ACCGCCTACG AGGGCGGCCC GGCCATGGCC
GACCGGCTGG ACAGCTCCCT GATCACCGTC GCCGGTGACG GCGGGCACGG CTTCTACCTC
CCCGGCGGCC TGGACTGCGT GGCCGACGCG GTGGACGCCT ACCTGGTGGA CGGGGCCCGG
CCCGAGGACC TCACCTGCCA GGGCCTGCCC CCGGCGGAGG TGGACGGGGA GATCGAGGTG
GAGGCCGAGG ACCTCGTGGC ACTGCGCGAC CGGACCGGCC CGAGCGGTCT GGGGATCTGA
 
Protein sequence
MRTARATTLA STVLVTALAA SLAAASPAAA DPLDSPADDT ALAEFHDQEL AWAPCTEEVL 
QGLECADVVV PLDYSDPGGE RVTVAISRAR AADPEQRRGI LLTNPGGPGG HGRSMPLPLD
PQTGTGLVGG LRVAEVYDVI GMDPRGTGGS SPRLDCDSEP PPVYPSPTDE EISEATRSAI
RYQRACEQAE GDVRPHMTTA NTARDMDVIR AALGEEKTNY LGYSYGTYLG AVYGSLFPER
LDRSVLDSSV HPDGVWRDVF MMQAPAYSEN MDRYTAWVAG YDDVFGFGST REEVYATFEA
TAERLREEPL EVAPGVLYDN HLFHMEVGMY SRFQDRWDVV SEILGYIVHD QPLPQPIAAQ
AGALAEEEYG TATIDLQTAV LCEAEWPERI STYHADAREY REEHPYGSGA YWAVPHPCTF
NGLDRPEPPV ELEREGYPEA LVIAGEFDAN TAYEGGPAMA DRLDSSLITV AGDGGHGFYL
PGGLDCVADA VDAYLVDGAR PEDLTCQGLP PAEVDGEIEV EAEDLVALRD RTGPSGLGI