Gene Ndas_4715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4715 
Symbol 
ID9248597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5596657 
End bp5597916 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682607 
Protein GI297563633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.683026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.640764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTGC TGATCGCGAC GCAGGCGGAG CGGACCCACT TCCTGGGGCT GGTGCCCCTG 
GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTCCGGGTGG CCAGCCAGCC CGAACTGGAG
GCGGTGGTCA CCGGGACGGG CCTGCCCTTC TCCCCCGTGG GCAGGGACCA CCTCCTGCGC
AGGGTCATGC GGCAGTACCA CGCGATGACC GGCGGGGAGG ACGACGACTT CGACATGGCC
GAGGACCGTG ACGAGGTCCT GACCTGGGAC TACCTCCTGG AGGGCTACCG CCTGACCGTG
CAGTGGTGGT GGCGGATGGT CAACGACCCC ATGGTCGACG ACCTGGTCGC CCTCTGCCGC
GAGTGGCGCC CCCACCTGGT CGTGTGGGAG CCCATCACCT TCTCCGGGGC GATCGCCGCC
GAGGCCTGCG GGGCCGCGCA CGTGCGCTAC CTGTGGGGGG CCGACATCTT CGCCCGCACC
CGCGCGCGCT TCCTGGCGCG GATGGGCGAA CAGCCCGCCT CACGGCGCGA GGACCCCCTG
GCCGCGTGGC TGGGGACCAG GGCGGCCCGG TACGGGGTGA ACTTCTCCGA GACCCTGGTC
CACGGCCAGG CCACCGTCGA GCAGGTCCCC GCGTCCCTGC GGGTGGACAC GCCCGCGCAC
CTGGAGTACC TGCCGGTGCG CTACGTGCCC TACAACGGAC GCGCCGTCGT CCCCCACTGG
CTGCGCACAC AACCCGACCG CCCCCGGATC GGACTCAGCC TCGGGACCAG CGCGAACGAG
TGGTACGGCG GTCACCGGGT CTCCGCCGGG GAGATCCTGG AGGGTCTGGC CGAGCTGGAC
GTGGAGGTGG TGGCCACCCT GCCCGCCAGT GAGCAGGCCA AGCTCGGCGC CGTCCCCGGC
AACGCCCGCC TGGTCGAGTA CGTCCCCCTG CACGCCCTGG CCCCCACCTG CGCCGCCATG
GTCACCCACG GCGGCCCCGG CACCGTCCTG ACCGGCCTCG CCCACGGAGT CCCCCAACTC
CTGTCACCCA ACGCGCACAT GTTCGACACG GTCCTGCTGT CCGGGCTGGT GGAGGAGGCC
GGGGCGGGCA GGGTCGTGGA CCCCGACCGC CTGGACGCCG CCACCGTCGC CGCAGGCGTG
CGCACCCTCC TGGAGGACCC CCGCCACACA AGCGCCGCCC GCGCCCTGCG CGCACGCATG
GACGCCATGC CCACCCCCGC CGACCTCGCC CACACCCTCG CCGGCCTCAC CCGCACCTGA
 
Protein sequence
MRVLIATQAE RTHFLGLVPL AWALRAAGHE VRVASQPELE AVVTGTGLPF SPVGRDHLLR 
RVMRQYHAMT GGEDDDFDMA EDRDEVLTWD YLLEGYRLTV QWWWRMVNDP MVDDLVALCR
EWRPHLVVWE PITFSGAIAA EACGAAHVRY LWGADIFART RARFLARMGE QPASRREDPL
AAWLGTRAAR YGVNFSETLV HGQATVEQVP ASLRVDTPAH LEYLPVRYVP YNGRAVVPHW
LRTQPDRPRI GLSLGTSANE WYGGHRVSAG EILEGLAELD VEVVATLPAS EQAKLGAVPG
NARLVEYVPL HALAPTCAAM VTHGGPGTVL TGLAHGVPQL LSPNAHMFDT VLLSGLVEEA
GAGRVVDPDR LDAATVAAGV RTLLEDPRHT SAARALRARM DAMPTPADLA HTLAGLTRT