Gene Ndas_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0565 
Symbol 
ID9244407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp698590 
End bp700692 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content75% 
IMG OID 
ProductProtein of unknown function DUF2510 
Protein accessionYP_003678518 
Protein GI297559544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.371062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGAT CTATCGAGCC GGGTTGGTAC GCGGACCCGC AGGGCGACGC GCAGACGCTC 
AGATGGTGGA ACGGCAGCGA GTGGACACGG CATACCCGTT CCCTGAGCGA GCTCCAGGGC
GGTGGCGATC CGGGAGACGA GGAGGCCGCG ACCACCCACC TGGGCGGATC CGGACAGGGC
GGGTCCGGAC AGGGCGCCCC CGCCTCCGAC GAGGAGGATC CCGGAACGGT GCGGCTGGGC
CCCTCCCAGA CGCAGTCGGC GGCACCGATC GACGACGAGC CCAGCACCAT GCGCATCAAC
CCCTCCTGGG CACAGGGATC CCCCCAGCAG GACACCACCC CGCCCTCTCT CCAGCCCCCG
ATCGACGACG AACCCAGCAC GATGCGGATC AACCCGTCGT GGACACAGGG TTCCTCCTCT
CCCTCTTCCC CCTCCCCCGC CTCACCGCCC CCCGGCGCCC CCATCGACGA CGAGCCCAGC
ACCATGCGCA TCAACCCCTC CTGGGCGCGG GGCGGCGACG ATGACGAGGA GCCCACCGCC
GACCTGAGCG GCACGAACTC CACGATGCGC GTCGACCCGG CCGACCTCCC GGGCCGCGGC
GGCGGCCCGG GGACGGACGG CGGGTACGGG TCGGGCGACG GGGCGACCGT GCGGGTCAAC
CCGGGCGACC TTCCCCGGCG CGACGAGGAC GAACTGCCCA CCGCCGACCT CACCGACGAC
GAGATCCCCA CGTCCAACCT GACCGACGAC GACCTGGGCG GTGAGGCGCC GCGCACCGCG
GTGTTCGACC CGGGTGGTCC CGGTGCTTCC GGCACTCCTG GTGCCCCCGG TGCCCCCGGG
TCGGCGGGGA CTCCGAGTGA GGCGCCGCGC ACCGCCGTGT TCGACCCGAA CGACCCCGCG
CTCCCGGGCG GCGGGACCTC CGGTACGGCC GTGTTCGACC CGGCCGACCC GGCCTTCTCT
GGAGGCGGGG CCGCAGGGGG CGGGACCGCC GGTACCGCGG TGTTCAACCC CTCGGGTGGC
GAACCCTCCG GTACGGCCGT GTTCAACCCG GGCGACCCCG CCTTCTCGGG AGGCGCGGAC
GAGGGGAACA AGAAGGGCGG CAAGTTCAAC AAGTTCCTGG GCAGCCTCAA GGAGGGCTGG
TCCGACCTGT CCGAGGAGCG CAAGGAGCAG CGCCGCCACT CCGAGGAGGA GGCGGCCAAG
CGGCGTGAGG AGGAGCGGGT CCGCAGGGAG GCCGAACTCC AGGAGGAGGC CAAGCGCCGC
GAGGAGGAGG CCCGGCGCCG CGCCGAGGAG CAGCCCCCGG GCGCGCAGGG GCCGGGCGCC
GCTGCCGCCA CCGGTTCGGC GGTCGGTGCG CAGCCGGGAG CCCCGTGGTC CCCGACCGGT
CAGCCCGAGC AGCCCGGCCA GCCCGGTCAG CACTCCTCCG AGCAGCAGGC CCCGGGCCGG
CAGCCCTCGG GGCCGCAGCC CTCCGCCCAG CCGTTCCCCG GTGCCCAGCA GCCCCCGGGC
TTCCCGCCCG CCGCGCAGCA GCCGCCCTCG GGCCCCCAGC AGGGGCACTC CGCTCCGCCC
GCGGCCCCGT CCGGACCGCA GCAGGGCATG TTCTCGCCCC AGCGCCAGTC GGGACCGCAG
CACCCGCCGT CGGGGCCCCA GGCCGGTTAC CGCCAGCCCC CGCCGCCCCA GCAGGGTGGT
TTCGGTCCGC CCGGATACCC GCCCCCGCAC CAGATGGGCG GCCCGAACAC TCCGGGGCGT
CCGCCACAGG GCATGCCCCC CGTCGGTCCG CCGCCCGGCC AGCGGAACAA GCGGTCGGGC
CCGCCGCCCG GGTACCCGCC GGCCGTCGGC CCCATGGCCA ACACCCCGGG CGGACAGCCG
CCGATGGGCG GACACCCCGG GGGCCCGCAG TGGGGCCAGC CGGGTCCGGG GCAGCCCCCC
CGGGCCGGGA CAGCCCTACC AGCAGCACCG CCCGCAAAAG CCCAAGAAGA AGCGGGGAGG
GTGCAGGGGA TGCGGCTGCG GCTGCTTCAC CTTCCTGCTC ATCCTGCTCG TCCTCCTGGT
CCTGGGCTAC CTCCAGTTCT GGGGCGTGTG GGACTGGATG TACGCCATCT TCGGCGTCGG
TGA
 
Protein sequence
MGGSIEPGWY ADPQGDAQTL RWWNGSEWTR HTRSLSELQG GGDPGDEEAA TTHLGGSGQG 
GSGQGAPASD EEDPGTVRLG PSQTQSAAPI DDEPSTMRIN PSWAQGSPQQ DTTPPSLQPP
IDDEPSTMRI NPSWTQGSSS PSSPSPASPP PGAPIDDEPS TMRINPSWAR GGDDDEEPTA
DLSGTNSTMR VDPADLPGRG GGPGTDGGYG SGDGATVRVN PGDLPRRDED ELPTADLTDD
EIPTSNLTDD DLGGEAPRTA VFDPGGPGAS GTPGAPGAPG SAGTPSEAPR TAVFDPNDPA
LPGGGTSGTA VFDPADPAFS GGGAAGGGTA GTAVFNPSGG EPSGTAVFNP GDPAFSGGAD
EGNKKGGKFN KFLGSLKEGW SDLSEERKEQ RRHSEEEAAK RREEERVRRE AELQEEAKRR
EEEARRRAEE QPPGAQGPGA AAATGSAVGA QPGAPWSPTG QPEQPGQPGQ HSSEQQAPGR
QPSGPQPSAQ PFPGAQQPPG FPPAAQQPPS GPQQGHSAPP AAPSGPQQGM FSPQRQSGPQ
HPPSGPQAGY RQPPPPQQGG FGPPGYPPPH QMGGPNTPGR PPQGMPPVGP PPGQRNKRSG
PPPGYPPAVG PMANTPGGQP PMGGHPGGPQ WGQPGPGQPP RAGTALPAAP PAKAQEEAGR
VQGMRLRLLH LPAHPARPPG PGLPPVLGRV GLDVRHLRRR