Gene Ndas_0262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0262 
Symbol 
ID9244096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp326518 
End bp327783 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003678217 
Protein GI297559243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.558818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGCA TCGGGGCCCA GATCGGCCTC GTCCTGGTCC TGGTCGTGGT GAACGCCGTG 
TTCGCGGGCA GCGAGATCGC CCTGATCACG CTGCGGGAGG GGCAGATCAG GCGGCTGGAG
GAGCGCGGCC CGGGTGGGCG CGCGGTGGCC CGACTGGCCC GCGACCCCAA CCGGTTCCTG
GCCACCATCC AGATCGGCAT CACCCTCGCG GGCTTCCTCG CCTCCGCCAC CGCCGCCGTC
TCCCTGGCCC GGCCGCTGGT CGAGCCGCTG GGCTTCCTCG GCTCCTACGC GGCCCCGGTG
TCGGTCGTGC TGGTCACCGT CGTGCTCACC TTCGTCACCC TGGTCCTGGG CGAACTCGCG
CCCAAGCGCA TCGCCATGCA GCGCGCCGAG CCCTGGGCAC TGCTCGTGGC CCGCCCGCTC
AACGCGCTGG CGCTGCTCTC GCGCCCCGCG ATCTGGCTTC TCAGCGCCTC CACCGACCTG
GTCGTCCGGC TCGGCGGGCT CGACCCGCAC GGCGCCAGGG AGGAGGCCAC CGAGGAGGAG
CTGCGCGACA TGATCGAGGC CCAGGGCCAT ATGACCCCCG AACAACGCAC CATCCTCTCC
GGGGCCTTCG ACATCACCGG GCGGACGCTG CGCCAGGTCC TGGTCCCGCG TCCCGACGTG
GACACCGTCC CGGCCGACCT GCCCGCGTGC GAGACAGCCC TGCTCCTGGC CGAGCACGGC
CACTCGCGCG CCCCGGTGGT CGGCCGGGAC GACGTGGACG ACGTCGTCGG CGTGGTGCAC
TGGTCCGACC TCCTGCGGGG CGAGGGCGCG GCCCGGGAGC TGGCCCGCGA ACCGCTGCTG
CTGCCCGACT CGCTGACGGT CTCCGCCGCC CTGCACCGGC TCACCGTCGA ACGCCAGCAG
CTGGCCGTCG TGATCGGTGA GAGCGGCGAG GTCAGCGGCA TCGTCAGCCT GGAGGACCTG
CTGGAGGAGG TCGTCGGCGA GATCTACGAC GAGACCGACA CCGACGAGCG CGCCCCCGCC
CGGCTGGACG GCGGCGCCCT GCGCCTGCCC GGCGTGTACC CCGTCCACGA GCTGGAGGAC
CTCGGCGTCG TCCTCACCGA CCGGCCCCGG GGCAGTTACG TGACCGTCGC CGGGATGGTC
CTGGTCCTGC TCGGCCACAT CCCCGACGAG CCCGGGGAGA GCGTGGACCT GGGCGGCTGG
ACGGCCACGG TCACCGAGGC CGACGGCCGC GTCGTCAGCG AACTCCTGCT CACCCCCGCC
CGCTGA
 
Protein sequence
MEGIGAQIGL VLVLVVVNAV FAGSEIALIT LREGQIRRLE ERGPGGRAVA RLARDPNRFL 
ATIQIGITLA GFLASATAAV SLARPLVEPL GFLGSYAAPV SVVLVTVVLT FVTLVLGELA
PKRIAMQRAE PWALLVARPL NALALLSRPA IWLLSASTDL VVRLGGLDPH GAREEATEEE
LRDMIEAQGH MTPEQRTILS GAFDITGRTL RQVLVPRPDV DTVPADLPAC ETALLLAEHG
HSRAPVVGRD DVDDVVGVVH WSDLLRGEGA ARELAREPLL LPDSLTVSAA LHRLTVERQQ
LAVVIGESGE VSGIVSLEDL LEEVVGEIYD ETDTDERAPA RLDGGALRLP GVYPVHELED
LGVVLTDRPR GSYVTVAGMV LVLLGHIPDE PGESVDLGGW TATVTEADGR VVSELLLTPA
R